Subscriber access provided by NEW MEXICO STATE UNIV
Characterization of Natural and Affected Environments
Screening Priority Factors Determining and Predicting the Reproductive Toxicity of Various Nanoparticles Zhan Ban, Li Mu, Qixing Zhou, Anqi Sun, and Xiangang Hu Environ. Sci. Technol., Just Accepted Manuscript • DOI: 10.1021/acs.est.8b02757 • Publication Date (Web): 30 Jul 2018 Downloaded from http://pubs.acs.org on July 30, 2018
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 45
Environmental Science & Technology
TOC 84x47mm (300 x 300 DPI)
ACS Paragon Plus Environment
Environmental Science & Technology
1
Screening Priority Factors Determining and Predicting the Reproductive
2
Toxicity of Various Nanoparticles
3
Zhan Ban†, Qixing Zhou†, Anqi Sun†, Li Mu‡, Xiangang Hu†,*
4
†
5
Education)/Tianjin Key Laboratory of Environmental Remediation and Pollution
6
Control, College of Environmental Science and Engineering, Nankai University,
7
Tianjin 300350, China
8
‡
9
Laboratory for Environmental Factors Control of Agro-Product Quality and Safety
10
(Ministry of Agriculture), Institute of Agro-Environmental Protection, Ministry of
11
Agriculture, Tianjin 300191, China
Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of
Tianjin Key Laboratory of Agro-Environment and Agro-Product Safety, Key
12 13
Corresponding author: Xiangang Hu,
[email protected] 14
Fax: 0086-022-23507800
15
Tel.: 0086-022-23507800
16 17 18 19 20 1
ACS Paragon Plus Environment
Page 2 of 45
Page 3 of 45
Environmental Science & Technology
21
ABSTRACT
22
Due to the numerous factors (e.g., nanoparticle [NP] properties and experimental
23
conditions) influencing nanotoxicity, it is difficult to identify the priority factors
24
dominating nanotoxicity. Herein, by integrating data from the literature and a random
25
forest model, the priority factors determining reproductive toxicity were successfully
26
screened from highly heterogeneous data. Among ten factors from more than eighteen
27
different NPs, the NP type and the exposure pathway were found to dominantly
28
determine NP accumulation. The reproductive toxicity of various NPs primarily
29
depended on the NP type and the toxicity indicators. Nanoparticles containing major
30
elements (e.g., Zn and Fe) tended to accumulate in rats but induced lower toxicity
31
than NPs containing noble elements. Compared with other exposure pathways, i.p.
32
injection posed significantly higher risks for NP accumulation. By combining
33
similarity network analysis and hierarchical clustering, the sources of highly
34
heterogeneous data were identified, the factor-toxicity dependencies were extracted
35
and visualized, and the prediction of nanotoxicity was then achieved based on the
36
screened priority factors. The present work provides insights for the design of animal
37
experiments and the illustration and prediction of nanotoxicity.
38 39
Keywords: nanotoxicology, carbon nanotube, graphene, Ag, TiO2, rat
40 2
ACS Paragon Plus Environment
Environmental Science & Technology
41
INTRODUCTION
42
With unique and advanced properties, engineered nanoparticles (ENPs) have been
43
widely applied in human activities (e.g., approximately 760 nano-enabled consumer
44
products in the health and fitness category, with an increasing number of
45
nano-enabled products for industry, agriculture and environmental protection).1 With
46
the rapid development of NPs, nanosafety has attracted increasing attention
47
worldwide due to environmental contamination or direct human exposure.2, 3 However,
48
the numerous physicochemical properties (e.g., size, shape and surface chemistry) of
49
ENPs and the various experimental conditions (e.g., various exposure pathways)
50
affect nanotoxicity.4 For example, suspended graphene showed toxic effects through
51
direct physical interaction with the cell membrane, generating reactive oxygen species
52
(ROS) and trapping cells via aggregation.5-9 The chemical state of graphene (e.g., GO
53
and rGO) also affected its interaction with cells.10 Based on the hypothesis that the
54
physicochemical properties of ENPs and the experimental conditions determine
55
nanotoxicology, the priority factors determining nanotoxicity were identified and
56
nanotoxicity was predicted using models in the present work. In addition, the cost of
57
animal experiments is very high, and the mechanisms of nanotoxicity are very
58
complicated, especially in experiments involving reproductive toxicity. As a result,
59
arguments concerning nanotoxicity are frequently reported.11-13 It is becoming
60
increasingly important to effectively screen priority factors determining and 3
ACS Paragon Plus Environment
Page 4 of 45
Page 5 of 45
Environmental Science & Technology
61
predicting the toxicity of various NPs using other methods to supplement animal
62
experiments, which is critical for the design of animal experiments and for the
63
illustration and prediction of nanotoxicity.
64
Multidimensional datasets (e.g., the physicochemical properties of ENPs and the
65
various experimental conditions) challenge traditional tools (e.g., quantitative
66
structure-activity relationships and qualitative classification-based models) for the
67
assessment and prediction of nanotoxicity.14-16 Machine learning is a discipline that
68
can be used to build models to explain observed data through experience.17, 18 With its
69
robust capacity to recognize meaningful patterns, machine learning has a wide range
70
of applications, including controlling robots,18 predicting toxicity19 and predicting
71
synthetic reactions.19, 20 In the above approach, numerous data points were used to
72
overcome the heterogeneous data. Thus, machine learning may solve the
73
aforementioned problems of assessing nanosafety.
74
However, publications and data regarding reproductive toxicity are relatively
75
limited due to the high cost and complexity of performing such studies. To evaluate
76
the reproductive toxicity of ENPs, the reported data from animal experiments are
77
valuable sources of toxicological information for meta-analysis, which could produce
78
general answers to questions regarding diverse data.19-23 Given the highly
79
heterogeneous data, it is difficult to reveal the relationships between the
80
multidimensional factors of ENPs and nanotoxicity using traditional models. 4
ACS Paragon Plus Environment
Environmental Science & Technology
81
Therefore, accurately extracting the hidden multidimensional factor-toxicity
82
dependencies through machine learning is critical for toxicity assessment and
83
prediction19, 24 and remains a challenge for current workflows.25
84
As a new interdisciplinary technique, machine learning can be used to build models
85
to explain observed data through experience.17, 18 Random forest (RF)26 is a powerful
86
algorithm based on the decision tree method and is suitable for dealing with big
87
data.19, 27 In the present work, RF was used to examine heterogeneous data from a
88
meta-analysis. To explore the heterogeneity of the datasets, similarity network
89
analysis was performed to visualize the similarity between data points with the
90
proximity matrix in RF. The proximity matrix, a by-product of RF, records the
91
frequency of two data points appearing in the same nodes.19 The priority factors for
92
determining reproductive toxicity were screened by RF and are critical to the
93
scientific design of experiments, as well as the accurate illustration and prediction of
94
nanotoxicity. Finally, multiple methods were integrated to predict nanotoxicity by
95
revealing factor-toxicity dependencies.
96
MATERIALS AND METHODS
97
Data extraction
98 99 100
Data regarding the female reproductive toxicity induced by NPs (e.g., those related to the uterus, ovaries and sex hormones) are highly heterogeneous.28, 29 In contrast, the indexes of male reproductive toxicity showed a high representation of fertility (e.g., 5
ACS Paragon Plus Environment
Page 6 of 45
Page 7 of 45
Environmental Science & Technology
101
the sperm count and the sperm abnormality rate).30 Therefore, the present work
102
focused on reproductive toxicity in male rodents. The tested data were obtained from
103
the ISI Web of Knowledge, PubMed and Scopus databases; the searched topics are
104
presented in the Supporting Information (search time, July 23, 2017). To obtain a
105
comprehensive understanding of the relationships between different factors (e.g., ENP
106
types) and reproductive toxicity, the present work collected data from animal
107
experiments related to nanotoxicity in the literature, as shown in Tables S1-S3. The
108
chosen NPs, including carbon nanotubes, graphene, and Ag and TiO2 NPs, have been
109
widely applied in various fields and have high potential for environmental or human
110
exposure; for example, Ag and TiO2 NPs have been detected in various surface
111
waters.31, 32 More importantly, the chosen NPs cause obvious reproductive toxicity by
112
reducing sperm parameters and accumulating in the testicles.33, 34 The present work
113
intended to study the relationships between different ENP types (e.g., carbon
114
nanotubes and Ag and Au NPs) and reproductive toxicity in the testes. All NPs
115
causing reproductive toxicity in male rats were analyzed. A summary of toxicity
116
information from each publication collected in the present work is provided in Table
117
S3 to ensure the reliability of the results of the model. Ten factors, including five
118
physicochemical properties (ENP size, shape, type, surface ligand and element) and
119
five experimental conditions (rodent age, dosage and exposure time, duration and
120
method), and two adverse biological impacts (reproductive toxicity and accumulation 6
ACS Paragon Plus Environment
Environmental Science & Technology
121
of ENPs) were extracted from each publication, as shown in Tables S1 and S2. The
122
organ tested for the accumulation of ENPs was the testis. The tested reproductive
123
toxicity indicators were sperm parameters, blood testosterone concentration and testis
124
index. Among the above three indicators, sperm parameters were the priority
125
parameters for measuring the male reproductive toxicity of ENPs. The analyzed
126
sperm parameters included the sperm count (106/ml), rate of motile sperm (%),
127
progressive motility (%), sperm abnormalities (%) and total sperm production (106/g
128
testis). Among the above sperm parameters, the sperm count (106/ml) was set as a
129
priority sperm parameter.
130
Treatment of special data
131
Approximately one hundred data samples did not include the age of the tested
132
rodents. The curve-fitting method was used to calculate the missing ages based on the
133
age-weight relationships of different species (details are presented in the
134
Supplementary Information and Table S4). In a few publications without information
135
regarding rodent age or weight, the median age of overall rodents was used to replace
136
the missing age values. For nanowires or nanorods, the widths were recorded in the
137
present analysis. According to respiratory rates in previous research,35 μg/݉ଷ or
138
mg/݉ଷ was converted to mg/݇݃ for rodents based on the formula described by
139
Nouwen.36 The unit of exposure dosage was normalized to mg/(݇݃ ∙ ݀ܽ)ݕ. To
7
ACS Paragon Plus Environment
Page 8 of 45
Page 9 of 45
Environmental Science & Technology
140
normalize the diverse toxicity indicators, the ratios of data from treated groups to data
141
from control groups were used in the present analysis.
142
RF regression and validation
143
In RF, each tree is built by a bootstrap sample from the overall data and the best
144
partitions among a subset of attributes that are randomly selected for each node. The
145
RF algorithm performs the predictions by aggregating the predictions of each tree,
146
using the majority vote for classification and the average for regression. To measure
147
the relative importance of factors, the percent increase in the RF mean square error
148
(MSE) was calculated using the R package randomForest. To find the sources of
149
heterogeneity and to identify the priority factors determining the reproductive toxicity
150
of ENPs, the original datasets were classified into subsets for further analysis
151
according to the above priority factors. Given that the type of ENP was crucial for
152
ENP toxicity and accumulation, metallic ENPs vs metal-free ENPs and carbonaceous
153
ENPs vs carbon-free ENPs were set as two important classifications. Regarding the
154
exposure method in accumulation studies, the data were divided according to injection
155
(e.g., intravenous injection, intraperitoneal injection, abdominal subcutaneous
156
injection and intratracheal injection) and noninjection (e.g., gavage and inhalation)
157
exposure methods.
158
RF, as a data-driven model, includes two important parameters: ntree, the number
159
of trees in the forest; and mtry, the number of attributes randomly selected for a subset 8
ACS Paragon Plus Environment
Environmental Science & Technology
160
at each node.26 The values of ntree and mtry for each study were adjusted to obtain
161
the best predictive accuracy. During tree construction, approximately 63% of the raw
162
data were used to build the trees in each RF bootstrap sample, and the remaining
163
out-of-bag (OOB) data (not in the bootstrap samples) were used to validate the
164
performance of the model.37 As the RF algorithm is inherently tolerant to
165
overfitting,37 the OOB predictive accuracy was selected to represent the performance
166
and predictive accuracy of the model. To measure the performance, the correlation
167
coefficient (R2) and root mean square error (RMSE) between the predictions and
168
observations were calculated as predictive accuracy metrics.24 Then, to evaluate the
169
predictive accuracy of the RF model, linear regression of quantitative factors was
170
performed to predict nanotoxicity (accumulation and toxicity of ENPs). To evaluate
171
the performance of the model with different data, training datasets (from 20% to
172
100%) were randomly selected from original datasets, with ten repetitions, and the R2
173
and RMSE were then calculated.
174
Similarity network analysis
175
The similarity network was drawn using the Kamada-Kawai layout algorithm.38
176
Similarity network nodes with higher than average values were linked in the
177
proximity matrix using the R package igraph. Subsequently, to find tight-knit
178
subgroups with intrinsic commonality and to visualize the heterogeneity of the
179
datasets, the walktrap community algorithm,39 a hierarchical clustering method, was 9
ACS Paragon Plus Environment
Page 10 of 45
Page 11 of 45
Environmental Science & Technology
180
applied to the similarity network. Then, two important measurements, the density and
181
clustering coefficient (transitivity), reflecting the kernel density estimation and the
182
probability of a vertex connecting to adjacent vertices, were applied to measure the
183
tendency of the data to cluster.40, 41 Given the heterogeneity in the datasets, the first to
184
the third quartiles and the most frequent categories were applied to represent the
185
distribution of quantitative and qualitative data in each cluster, respectively.
186
Factor-toxicity dependence analysis and nanotoxicity prediction
187
Two methods were used to analyze the complex factor-toxicity dependencies. The
188
first method was to visualize the average partial relationships between the adverse
189
impacts (accumulation and toxicity) of ENPs and factors by partial dependence plots42,
190
43
191
was then used to estimate statistically significant differences (p < 0.05) between
192
observations (p1) and predictions (p2). The above method was suitable for visualizing
193
low-dimensional (one or two) parameters. Similarity network analysis with
194
hierarchical clustering could identify tight-knit clusters with intrinsic similarity and
195
common properties. Similarity network analysis was also employed to visualize
196
high-dimensional parameters. The classification analysis of RF was used to extract
197
reliable factor-toxicity dependencies shared in clusters. The prediction analysis used
198
data with a uniform distribution over a certain range for each factor. The median
199
values or most frequent categories were used to describe nonsignificant attributes
and individual conditional expectation (ICE) plots.44 Analysis of variance (ANOVA)
10
ACS Paragon Plus Environment
Environmental Science & Technology
200
(rodent age, ENP shape, exposure duration, surface ligands for accumulation studies
201
and indicators for toxicity studies). The sperm parameters were selected for measuring
202
reproductive toxicity due to their low heterogeneity.
203
To validate the analysis of multidimensional factor-toxicity dependence, the
204
over-80% vote rate and the classification rate were calculated to measure the data
205
matching the cluster and the RF classification performance. To estimate the predictive
206
performance, the predictions (TP and CP) were compared with the observations (TO
207
and CO) of data meeting the conditions of the six important attributes (ENP type, size,
208
exposure method, exposure time, dosage, element for accumulation studies and
209
surface ligand for toxicity studies) screened by variable importance, shown in Figure
210
2, and the observations in the original cluster (TC and CC). ANOVA with Tukey’s
211
multiple comparison test was performed using IBM SPSS Statistics 20. In the RF
212
analysis, partial dependence plots and ICE plots were generated using the
213
randomForest package and the pdp package in R software version 3.3.2, respectively.
214 215
RESULTS AND DISCUSSION
216
Analysis of highly heterogeneous data
217
Mining data regarding the reproductive toxicity of ENPs followed the process
218
described in the Methods section (Tables S1 and S2). The data from eighty-two
219
publications directly related to the male reproductive toxicity of ENPs were analyzed. 11
ACS Paragon Plus Environment
Page 12 of 45
Page 13 of 45
Environmental Science & Technology
220
Regarding the accumulation and the reproductive toxicity of ENPs, 242 and 250 data
221
points were obtained, respectively, as summarized in Table 1. The quantitative
222
attributes (rodent age, ENP size, dosage, exposure time and duration) and qualitative
223
attributes (ENP type, shape, surface ligand, exposure method, and element) were
224
tested in the RF analysis (Table 1). The aforementioned ten qualitative and
225
quantitative attributes covered the main issues related to the reproductive impacts of
226
ENPs.45 The distribution of the extracted data points is presented in Figure S1. No
227
particular studies dominating the databases were observed, and further analysis was
228
performed.
229
However, the data heterogeneity was still a challenge for further analysis. Table 1
230
provides an overview of the heterogeneity of databases regarding the accumulation
231
and toxicity of ENPs. The large standard deviations for the accumulation
232
concentration (e.g., 1437 ± 6201 ng/g) and the toxicity (e.g., 0.12 ± 0.25) of ENPs
233
demonstrated the heterogeneity of the data (Table 1), leading to difficulty in drawing
234
an unbiased conclusion. In addition to the influence of the physicochemical properties
235
of ENPs and the experimental conditions, the method of exposure to NPs can lead to
236
disruption of the cellular redox status and activation of compensatory mechanisms
237
(e.g., changes in starch granules, lysosomes and autophagy), resulting heterogeneous
238
toxicological data.46 First, linear regression was used to fit the heterogeneous data and
239
explore the connections of quantitative factors with the accumulation concentration 12
ACS Paragon Plus Environment
Environmental Science & Technology
240
and toxicity of ENPs. As shown in Figure S2, the small R2 (less than 0.05 for
241
accumulation studies and less than 0.12 for toxicity studies) and the large RMSE
242
(greater than 6200 ng/g for accumulation studies and greater than 0.25 for
243
reproductive toxicity) values indicated that the traditional linear regression model had
244
no capacity to handle the database heterogeneity. To compare with the above linear
245
regression model, the R2 and RMSE were also used to evaluate the performance of the
246
RF model (Table 2 and Figure 1). Higher R2 (greater than 0.62) and smaller RMSE
247
(less than 6000 ng/g) values indicated that the RF model was better for analyzing the
248
heterogeneous data than the linear regression model. Moreover, the RMSE and R2
249
gradually decreased and increased with increasing training data, respectively (Figure
250
1), indicating that the RF model would exhibit better performance with more data.
251
These measurements ensured the reliability of the meta-analysis machine learning
252
workflow and the results of the model. Regarding the above heterogeneous data, the
253
RF model combined with similarity network and hierarchical clustering analyses was
254
used to handle and visualize data heterogeneity, to screen priority factors determining
255
ENP toxicity and to reveal factor-toxicity dependencies, as presented below.
256
Screening priority factors determining reproductive toxicity and contributing to
257
heterogeneity
258
The present work screened priority factors determining toxicity by measuring the
259
increase in factor importance with increasing MSE. According to the MSE increase 13
ACS Paragon Plus Environment
Page 14 of 45
Page 15 of 45
Environmental Science & Technology
260
illustrated in Figure 2, two top priority factors with the highest MSE increases (greater
261
than 32%) for NP accumulation were screened: the exposure method (e.g., i.p., i.v.
262
and ga.) and ENP type (e.g., MWCNT, ZnO and Ag). The above two factors should
263
be well considered in the design of animal experiments and the assessment of
264
nanomaterial accumulation. In contrast, toxicity indicators (e.g., sperm parameters,
265
testosterone level and testis index) and the ENP type (e.g., Ag, MWCNT and TiO2)
266
with the highest MSE increase (greater than 15%) were identified as the two top
267
priority factors for reproductive toxicity. Therefore, the determination of nanomaterial
268
reproductive toxicity depended on the toxicity indicators and ENP type used in the
269
experiments.
270
To find the sources of heterogeneity, the present work classified the priority factors
271
into subsets and evaluated the heterogeneity in these subsets using an RF regression
272
model. As shown in Table 2, the RF model had good predictive performance in all of
273
the subsets, and the R2 values were greater than 0.6. The differences between the
274
observations and predictions modeled by the RMSE were obvious in the accumulation
275
studies due to the presence of few data sources with large predictive errors (Figure
276
S3a). In other words, the RMSE could be used to find the sources of large predictive
277
errors and the contributors to heterogeneity. In the toxicity studies (Table 2), the small
278
RMSE (e.g., 0.091 and 0.111) reflected the high predictive accuracy in those subsets.
279
The limited data points (n = 37 and n = 56) of the subsets with carbonaceous ENPs 14
ACS Paragon Plus Environment
Environmental Science & Technology
280
and the testis index contributed to the high heterogeneity, indicated by small R2 values
281
(0.489 and 0.383, respectively), in the prediction of reproductive toxicity (Table 2).
282
However, compared with the accumulation studies (Figure S3a), the toxicity studies
283
did not contain data that dominantly affected the regression performance or reduced
284
the predictive accuracy (Figure S3b).
285
To further identify sources of heterogeneity, the present work built similarity
286
networks (Figures 3 and 4) to visualize the construction and heterogeneity of the
287
datasets based on the RF proximity matrix. To analyze the major constructions of the
288
similarity network, hierarchical clustering was performed to find tight-knit clusters,
289
which are labeled with different colors in Figure 3. Table S5 clearly lists the densities
290
and clustering coefficients, which were provided as measures of the aggregation of
291
each cluster in the similarity network for NP accumulation and reproductive toxicity.
292
With few connections, sparse clusters demonstrated the heterogeneity of accumulation
293
or toxicity study-related data in terms of factor-toxicity dependencies. For example,
294
the sparse cluster 8 (C8), with low density (0.692) and clustering coefficient (0.782)
295
values, indicates high heterogeneity in terms of the properties of ENPs and the
296
experimental conditions, consistent with the wide range (0-14191 ng/g) of ENP
297
accumulation concentrations. Additionally, tight clusters tended to present lower
298
accumulation concentrations (e.g., C1, C2 and C3 in accumulation studies) and higher
299
toxicity levels (e.g., C7, C8 and C9 in toxicity studies) than sparse clusters. The 15
ACS Paragon Plus Environment
Page 16 of 45
Page 17 of 45
Environmental Science & Technology
300
accumulation studies contained more separate clusters (e.g., C1 and C3) with specific
301
commonalities than the toxicity studies, indicating the diverse factor-accumulation
302
dependencies in the accumulation studies. To assess the heterogeneity of the priority
303
factors (i.e., ENP type and exposure method for accumulation studies; ENP type and
304
indicator for toxicity studies), Figure 4 presents similarity networks colored by the
305
categories of the priority factors. The data for some categories (e.g., Au in Figure 4a,
306
i.v. in Figure 4b and sperm parameters in Figure 4c) were separated from neighbor
307
clusters and gathered in tight clusters with great homogeneity. For example, the Au
308
NPs gathered in two tight clusters for NP accumulation. Compared with the clusters in
309
Figure 4b, the two clusters of Au NPs were separated by different exposure methods
310
(i.v. and i.p.). The data also suggested that the exposure method was the priority
311
factor that more dominantly determined ENP accumulation than the ENP type, which
312
was verified by the variable importance in Figure 2. Furthermore, in Figure 4d,
313
compared with the cluster of sperm parameters, the testosterone and testis index
314
clusters contained sparse connections, indicating the high heterogeneity of the data
315
regarding testis index-toxicity and testosterone-toxicity. Compared with Figure 4a,
316
Figure 4c, with nodes of diverse types of ENPs clustering closely together, shows the
317
high homogeneity of the relationships between ENP type and ENP toxicity. The
318
figure also confirms that the exposure methods with tight clusters play more important
319
roles in ENP toxicity than the ENP types in Figure 2. The employed similarity 16
ACS Paragon Plus Environment
Environmental Science & Technology
320
networks identified the heterogeneity and homogeneity of factor-toxicity
321
dependencies from sparse and tight clusters, respectively. The above analysis of
322
factor-toxicity dependence provided a reliable foundation from which to predict the
323
reproductive toxicity of ENPs, as described below.
324
Predicting the reproductive toxicity of ENPs
325
Visualizing multidimensional factor-toxicity dependencies is effective for
326
interpreting the relationship between factors and toxicity hidden in the RF model and
327
contributes to the prediction of nanotoxicity. The partial dependence plots shown in
328
Figure 5 provided an easy way to deeply understand the relationships between factors
329
(ENP properties and exposure conditions) and biological impacts (ENP accumulation
330
and reproductive toxicity) (more details are presented in Figure S4). As shown in
331
Figure 5a, sperm parameters exhibited higher sensitivity for indicating reproductive
332
toxicity risk than did other reproductive toxicity indicators (e.g., testosterone and the
333
testis index) (p1, p2 < 0.0001). The above result was consistent with the better
334
predictive performance of RF in the sperm parameter indicator subset than other
335
subsets of reproductive indicators (Table 2). Compared with other types of ENPs,
336
CdTe quantum dots and Si NPs were likely to induce significantly high reproductive
337
toxicity (p1 = 0.023, p2 = 0.007). Previous studies have demonstrated that CdTe
338
quantum dots can result in acute reproductive toxicity by destroying the structure of
339
testicular tissue.47 As shown in Figure 5b, Fe3O4, Fe2O3 and ZnO NPs exhibited 17
ACS Paragon Plus Environment
Page 18 of 45
Page 19 of 45
Environmental Science & Technology
340
significantly higher accumulation concentrations than the other types of ENPs. Figure
341
S4 also illustrates that the ENPs with major elements (e.g., Zn and Fe) tended to reach
342
significantly high accumulation concentrations in the testes (p1, p2 < 0.0001). As
343
demonstrated by animal experiments,48, 49 essential elements (e.g., Fe and Zn) with
344
high absorptivity in spermatogenesis could cross the blood-testis barrier and easily
345
accumulate in the testes. Interestingly, these NPs (e.g., Fe and Zn ENPs) reached
346
higher accumulation concentrations, but with less reproductive toxicity, as shown in
347
Figures 5a and 5b. Such inconsistency between the accumulation concentration and
348
reproductive toxicity of ENPs deserves attention in the assessment and control of
349
nanotoxicity.50, 51 Figure 5b shows that i.p. injection posed a significantly higher risk
350
of ENP accumulation in the testes than other exposure pathways (p1, p2 < 0.0001),
351
probably because i.p. injection could effectively transport ENPs to the testes.48 The
352
above result suggests that assessments of nanotoxicity must consider the exposure
353
pathway: i.p. injection may overestimate the nanotoxicity or pose a higher risk.
354
To reveal any multidimensional factor-toxicity dependencies, the present work
355
applied data mining technology to the similarity network (Figure 3), a platform for
356
predicting the accumulation and reproductive toxicity of ENPs under specific
357
conditions. The tight clusters in the similarity networks shared commonalities in terms
358
of ENP properties and experimental conditions. Extracting the reliable commonalities
359
shared in the clusters could visualize the multidimensional factor-toxicity 18
ACS Paragon Plus Environment
Environmental Science & Technology
360
dependencies. The present work applied RF classification analysis to achieve
361
excellent classification accuracy based on hierarchical clustering with high vote rates
362
(greater than 95%) and small error rates (less than 4%), as listed in Table S6. The high
363
classification accuracy demonstrated the good affiliation of the extracted
364
factor-dependencies (Table 3) with their original clusters.
365
The specific dependencies of significant factors (e.g., ENP type and size) on the
366
accumulation concentration and reproductive toxicity of ENPs are listed in Table 3
367
(details are presented in the Methods section). The RF classification obtained high
368
vote rates (greater than 80%) for each factor dependency, as shown in Table 3,
369
indicating the good affiliation of each extracted factor dependency with their original
370
clusters. The predictions (CP and TP) and observations (CP and TP) were compared to
371
measure the predictive performance. In the accumulation studies, the similar
372
prediction and observation ranges (e.g., C1, 4-15 ng/g vs 0.9-18.9 ng/g; C3, 177-252
373
ng/g vs 183-287 ng/g) verified the acceptable predictive performance. However, there
374
was remarkable predictive error (1562-12110 ng/g vs 200 ng/g) in C7 accumulation
375
studies, indicating the difficulty in predicting the accumulation concentrations of
376
ENPs with essential elements (e.g., Zn and Fe). In the toxicity studies, there were
377
slight differences between the predicted and observed ranges, and comparable toxicity
378
levels were achieved in the predictions and observations (e.g., C8: 0.26-0.33 vs
379
0.14-0.42). As denoted for the C9 toxicity studies, the model performed well in 19
ACS Paragon Plus Environment
Page 20 of 45
Page 21 of 45
Environmental Science & Technology
380
predicting the reproductive toxicity of three ENP types in terms of a high vote rate
381
(0.89). However, there was a slight difference between the predicted and observed
382
ranges (0.14-0.19 vs 0.01-0.15), which indicated the complicated nature of
383
reproductive toxicity, especially for carbon nanomaterials. There are some works
384
related to the complicated biocompatibility of graphene, for example, the roles of
385
surface modification in the growth, proliferation and differentiation of cells.52, 53 In
386
addition to cytotoxic materials, graphene materials also showed genotoxic effects,
387
which were dependent on size, shape and concentration.5-10 As a result, the toxicity
388
study predictions presented low predictive accuracy in terms of greater error between
389
predictions and observations due to the high heterogeneity of the toxicity studies, and
390
the accumulation study predictions exhibited higher accuracy in comparison.
391
According to the prediction results, Ag NPs could attain high accumulation
392
concentrations (177-252 ng/g) at small sizes (15-40 nm), medium-term exposure
393
times (14-28 days) and wide dosage ranges (1-100 mg/kg) by ga. or i.v. exposure. The
394
above results indicated that dosage over a certain range was not a sensitive factor for
395
the accumulation of Ag NPs in the testes. Furthermore, compared with Ag NPs in two
396
dependencies (C4 vs C7 of toxicity studies in Table 3), a citrate coating could reduce
397
the reproductive toxicity of Ag NPs (0.04-0.06 vs 0.06-0.08). This result supported
398
the idea that the surface ligands of Ag NPs should be taken into consideration for
399
controlling and reducing Ag nanotoxicity. 20
ACS Paragon Plus Environment
Environmental Science & Technology
400
Perspectives of machine learning in nanotoxicity assessment and prediction
401
The rapid development and application of ENPs in various fields have attracted
402
much attention to nanosafety worldwide.2, 54 There are two challenges to the scientific
403
assessment of nanosafety: the numerous attributes (various ENP properties and
404
experimental conditions) related to nanosafety and the high heterogeneity of the data.
405
There are some special biological end points and mechanisms for the toxicological
406
study of nanomaterials compared with common pollutants or xenobiotics, such as
407
cellular uptake, cell membrane and cell wall damage by sharp edges and the release of
408
monomers and ions with stronger toxicity than the parent nanomaterials.55, 56 The
409
above issues contributed the high heterogeneity of the data. With the robust capacity
410
to recognize meaningful patterns in large and heterogeneous datasets, machine
411
learning can be used in a wide range of applications, such as cellular toxicity
412
prediction,19 synthetic reaction prediction24 and personality judgment.57 Thus,
413
machine learning may solve the above two problems in nanosafety assessment and
414
prediction. The present work applied a workflow combining machine learning (RF)
415
with similarity network analysis to provide a roadmap to robustly handle highly
416
heterogeneous data, reveal multidimensional factor-toxicity dependencies and predict
417
the nanotoxicity of various ENPs (more than eighteen types). However, the data
418
quality influences the performance of data-driven RF models.19 As stated above, the
419
lack of ENP characterization and the scarcity of data for some ENPs in the analyzed 21
ACS Paragon Plus Environment
Page 22 of 45
Page 23 of 45
Environmental Science & Technology
420
references have influenced the performance of the model. Integrating RF with
421
similarity network analysis is an option for analyzing heterogeneous data from the
422
literature (Table 3 and Figure 3) for other nanotoxicological studies (including those
423
of female reproductive toxicity). Machine learning also reduces the use of animals in
424
experiments and will be a powerful tool in the assessment and prediction of complex
425
nanosafety issues. In addition, it is well known that reproductive toxicity evaluations
426
are very complicated and require detailed mechanistic studies. The method applied in
427
this research revealed the complex relationships between each piece of data in the RF
428
model and allowed the visualization of hidden complex factor-toxicity dependencies
429
via combination with a similarity network model. Animal experiments provide data
430
and information to support such analyses; thus, animal experiments, especially those
431
investigating toxicological mechanisms, are irreplaceable in assessing the
432
reproductive toxicity of ENPs. Notably, the data or databases are the most important
433
issue in machine learning. The intentional establishment of big data or databases
434
determines the quality of machine learning applications. In the present work, the
435
priority factors determining reproductive toxicity were successfully screened from
436
highly heterogeneous data by integrating data mined from the literature and an RF
437
model. By combining similarity network analysis and hierarchical clustering, the
438
sources of highly heterogeneous data were identified. Then, the factor-toxicity
439
dependencies were extracted and visualized. Subsequently, the prediction of 22
ACS Paragon Plus Environment
Environmental Science & Technology
Page 24 of 45
440
nanotoxicity was achieved based on the screened priority factors. The above
441
workflow and used models are not specific to male toxicity; they are also suitable for
442
examine female reproductive toxicity, among others.
443 444
ASSOCIATED CONTENT
445
Supporting Information Available
446
Methods of data extraction, data treatment and ENP characterization and related
447
results are provided in Tables S1-S6 and Figures S1-S5.
448 449
AUTHOR INFORMATION
450
Corresponding author
451
*Email:
452
86-22-23507800.
453
The authors declare no competing financial interests.
[email protected] (X.H.).
Tel.:
86-22-23507800;
fax:
454 455
ACKNOWLEDGMENTS
456
This work was financially supported by the National Natural Science Foundation of
457
China (grant nos. 21577070 and 21307061).
458 459 23
ACS Paragon Plus Environment
Page 25 of 45
Environmental Science & Technology
460
REFERENCES
461
1.
462
D.; Hull, M. S., Nanotechnology in the real world: Redeveloping the nanomaterial
463
consumer products inventory. Beilstein J. Nanotech. 2015, 6, 1769-80.
464
2.
465
113 (5), 3815-3835.
466
3.
467
Maurizi, L.; Jochum, W.; Krug, H. F.; Buerkithurnherr, T.; Mandach, U. V.,
468
Bidirectional Transfer Study of Polystyrene Nanoparticles across the Placental Barrier
469
in an ex Vivo Human Placental Perfusion Model. Environ. Health Perspect. 2015,
470
123 (12), 1280-6.
471
4.
472
Perez-Potti, A.; Cookman, J.; Garry, D.; P. Silveira, C.; Boselli, L.; Pelaz, B.; Serchi,
473
T.; Cambier, S.; Gutleb, A. C.; Feliu, N.; Yan, Y.; Salmona, M.; Parak, W. J.;
474
Dawson, K. A.; Bigini, P., Influence of Size and Shape on the Anatomical
475
Distribution of Endotoxin-Free Gold Nanoparticles. ACS Nano 2017, 11 (6),
476
5519-5529.
477
5.
478
A., Cyto and genotoxicities of graphene oxide and reduced graphene oxide sheets on
479
spermatozoa. RSC Adv. 2014, 4 (52), 27213-27223.
Vance, M. E.; Kuiken, T.; Vejerano, E. P.; Mcginnis, S. P.; Jr, M. F. H.; Rejeski,
Hu, X.; Zhou, Q., Health and Ecosystem Risks of Graphene. Chem. Rev. 2013,
Grafmueller, S.; Manser, P.; Diener, L.; Diener, P. A.; Maederalthaus, X.;
Talamini, L.; Violatto, M. B.; Cai, Q.; Monopoli, M. P.; Kantner, K.; Krpetić, Ž.;
Hashemi, E.; Akhavan, O.; Shamsara, M.; Rahighi, R.; Esfandiar, A.; Tayefeh,
24
ACS Paragon Plus Environment
Environmental Science & Technology
480
6.
Kaner, E. F.; Lock, C. A.; Mcavoy, B. R.; Heather, N.; Gilvarry, Wrapping
481
bacteria by graphene nanosheets for isolation from environment, reactivation by
482
sonication, and inactivation by near-infrared irradiation. J. Phys. Chem. B 2011, 115
483
(19), 6279-6288.
484
7.
485
Protein corona-mediated mitigation of cytotoxicity of graphene oxide. ACS Nano
486
2011, 5 (5), 3693-3700.
487
8.
488
Cytotoxicity effects of graphene and single-wall carbon nanotubes in neural
489
phaeochromocytoma-derived PC12 cells. ACS Nano 2010, 4 (6), 3181-3186.
490
9.
491
against bacteria. ACS Nano 2010, 4 (10), 5731-5736.
492
10. Akhavan, O.; Ghaderi, E., Escherichia coli bacteria reduce graphene oxide to
493
bactericidal graphene in a self-limiting manner. Carbon 2012, 50 (5), 1853-1860.
494
11. Semmler-Behnke, M.; Lipka, J.; Wenk, A.; Hirn, S.; Schäffler, M.; Tian, F.;
495
Schmid, G.; Oberdörster, G.; Kreyling, W. G., Size dependent translocation and fetal
496
accumulation of gold nanoparticles from maternal blood in the rat. Part. Fibre
497
Toxicol. 2014, 11 (1) 33.
498
12. Yang, X.; Gondikas, A. P.; Marinakos, S. M.; Auffan, M.; Liu, J.; Hsu-Kim, H.;
499
Meyer, J. N., Mechanism of Silver Nanoparticle Toxicity Is Dependent on Dissolved
Hu, W.; Peng, C.; Lv, M.; Li, X.; Zhang, Y.; Chen, N.; Fan, C.; Huang, Q.,
Zhang, Y.; Ali, S. F.; Dervishi, E.; Xu, Y.; Li, Z.; Casciano, D.; Biris, A. S.,
Akhavan, O.; Ghaderi, E., Toxicity of graphene and graphene oxide nanowalls
25
ACS Paragon Plus Environment
Page 26 of 45
Page 27 of 45
Environmental Science & Technology
500
Silver and Surface Coating in Caenorhabditis elegans. Environ. Sci. Technol. 2012, 46
501
(2), 1119-1127.
502
13. Kataoka, C.; Nakahara, K.; Shimizu, K.; Kowase, S.; Nagasaka, S.; Ifuku, S.;
503
Kashiwada, S., Salinity-dependent toxicity of water-dispersible, single-walled carbon
504
nanotubes to Japanese medaka embryos. J. Appl. Toxicol. 2017, 37 (4), 408-416.
505
14. Gajewicz, A., How to judge whether QSAR/read-across predictions can be
506
trusted: a novel approach for establishing a model's applicability domain. Environ.
507
Sci-Nano 2018, 5 (2), 408-421.
508
15. Wang, D.; Shi, J.; Xiong, Y.; Hu, J.; Lin, Z.; Qiu, Y.; Cheng, J., A QSAR-based
509
mechanistic study on the combined toxicity of antibiotics and quorum sensing
510
inhibitors against Escherichia coli. J. Hazard. Mater. 2018, 341, 438-447.
511
16. Tropsha, A., Best Practices for QSAR Model Development, Validation, and
512
Exploitation. Qsar Comb. Sci. 2010, 29 (6-7), 476-488.
513
17. Ghahramani, Z., Probabilistic machine learning and artificial intelligence. Nature
514
2015, 521 (7553), 452.
515
18. Jordan, M. I.; Mitchell, T. M., Machine learning: Trends, perspectives, and
516
prospects. Science 2015, 349 (6245), 255-260.
517
19. Oh, E.; Liu, R.; Nel, A.; Gemill, K. B.; Bilal, M.; Cohen, Y.; Medintz, I. L.,
518
Meta-analysis of cellular toxicity for cadmium-containing quantum dots. Nat.
519
Nanotechnol. 2016, 11 (5), 479-486. 26
ACS Paragon Plus Environment
Environmental Science & Technology
520
20. Gernand, J. M.; Casman, E. A., A meta-analysis of carbon nanotube pulmonary
521
toxicity studies--how physical dimensions and impurities affect the toxicity of carbon
522
nanotubes. Risk Analysis 2014, 34 (3), 583-597.
523
21. Wang, X.; Zou, C.; Gao, X.; Guan, X.; Zhang, W.; Zhang, Y.; Shi, X.; Chen, X.,
524
Nitrous oxide emissions in Chinese vegetable systems: A meta-analysis. Environ.
525
Pollut. 2018, 239, 375-383.
526
22. Stringhini, S.; Carmeli, C.; Jokela, M.; Avendano, M.; Muennig, P.; Guida, F.;
527
Ricceri, F.; D'Errico, A.; Barros, H.; Bochud, M., Socioeconomic status and the 25 x
528
25 risk factors as determinants of premature mortality: a multicohort study and
529
meta-analysis of 1.7 million men and women. Lancet 2017, 389(10075), 1229-1237.
530
23. Ettehad, D.; Emdin, C. A.; Kiran, A.; Anderson, S. G.; Callender, T.; Emberson,
531
J.; Chalmers, J.; Rodgers, A.; Rahimi, K., Blood pressure lowering for prevention of
532
cardiovascular disease and death: a systematic review and meta-analysis. Lancet
533
2015, 387 (10022), 957–967.
534
24. Ahneman, D. T.; Estrada, J. G.; Lin, S.; Dreher, S. D.; Doyle, A. G., Predicting
535
reaction performance in C-N cross-coupling using machine learning. Science 2018,
536
360 (6385), 186-190.
537
25. Rauber, P. E.; Fadel, S. G.; Falcao, A. X.; Telea, A. C., Visualizing the Hidden
538
Activity of Artificial Neural Networks. IEEE T. Vis. Comput. Gr. 2017, 23 (1),
539
101-110. 27
ACS Paragon Plus Environment
Page 28 of 45
Page 29 of 45
Environmental Science & Technology
540
26. Breiman, L., Random Forest. Mach. Learn. 2001, 45, 5-32.
541
27. Eze, I. C.; Hemkens, L. G.; Bucher, H. C.; Hoffmann, B.; Schindler, C.; Kuenzli,
542
N.; Schikowski, T.; Probst-Hensch, N. M., Association between Ambient Air
543
Pollution and Diabetes Mellitus in Europe and North America: Systematic Review
544
and Meta-Analysis. Environ. Health Perspect. 2015, 123 (5), 381-389.
545
28. Ema, M.; Hougaard, K. S.; Kishimoto, A.; Honda, K., Reproductive and
546
developmental toxicity of carbon-based nanomaterials: A literature review.
547
Nanotoxicology 2016, 10 (4), 391-412.
548
29. Tian, X.; Zhu, M.; Du, L.; Wang, J.; Fan, Z.; Liu, J.; Zhao, Y.; Nie, G.,
549
Intrauterine inflammation increases materno-fetal transfer of gold nanoparticles in a
550
size-dependent manner in murine pregnancy. Small 2013, 9 (14), 2432-2439.
551
30. Hong, F.; Zhao, X.; Si, W.; Ze, Y.; Wang, L.; Zhou, Y.; Hong, J.; Yu, X.; Sheng,
552
L.; Liu, D.; Xu, B.; Zhang, J., Decreased spermatogenesis led to alterations of
553
testis-specific gene expression in male mice following nano-TiO2 exposure. J.
554
Hazard. Mater. 2015, 300, 718-728.
555
31. Schaumann, G. E.; Baumann, T.; Lang, F.; Metreveli, G.; Vogel, H. J.,
556
Engineered nanoparticles in soils and waters. Sci. Total Environ. 2015, 535, 1-2.
557
32. Loosli, F.; Le, C. P.; Stoll, S., Effect of electrolyte valency, alginate concentration
558
and pH on engineered TiO₂ nanoparticle stability in aqueous solution. Sci. Total
559
Environ. 2015, 535, 28-34. 28
ACS Paragon Plus Environment
Environmental Science & Technology
560
33. Balasubramanian, S. K.; Jittiwat, J.; Manikandan, J.; Ong, C. N.; Yu, L. E.; Ong,
561
W. Y., Biodistribution of gold nanoparticles and gene expression changes in the liver
562
and spleen after intravenous administration in rats. Biomaterials 2010, 31 (8),
563
2034-2042.
564
34. Khorsandi, L.; Orazizadeh, M.; Moradi-Gharibvand, N.; Hemadi, M.; Mansouri,
565
E., Beneficial effects of quercetin on titanium dioxide nanoparticles induced
566
spermatogenesis defects in mice. Environ Sci Pollut Res Int 2016.
567
35. Creutzenberg, O.; Kock, H.; Schaudien, D., Translocation and biokinetic
568
behavior of nanoscaled europium oxide particles within 5 days following an acute
569
inhalation in rats. J. Appl. Toxicol. 2016, 36 (3), 474-478.
570
36. Nouwen, J.; Cornelis, C.; De, F. R.; Wevers, M.; Viaene, P.; Mensink, C.; Patyn,
571
J.; Verschaeve, L.; Hooghe, R.; Maes, A., Health risk assessment of dioxin emissions
572
from municipal waste incinerators: the Neerlandquarter (Wilrijk, Belgium).
573
Chemosphere 2001, 43 (4), 909-923.
574
37. Bylander, T., Estimating Generalization Error on Two-Class Datasets Using
575
Out-of-Bag Estimates. Mach. Learn. 2002, 48 (1-3), 287-297.
576
38. Kamada, T.; Kawai, S., An algorithm for drawing general undirected graphs.
577
Inform. Process. Lett. 1989, 31 (1), 7-15.
578
39. Ilany, A.; Akçay, E., Social inheritance can explain the structure of animal social
579
networks. Nat. Commun. 2016, 7, 12084. 29
ACS Paragon Plus Environment
Page 30 of 45
Page 31 of 45
Environmental Science & Technology
580
40. Newman, M. E. J., The Structure and Function of Complex Networks. Siam
581
Review 2003, 45 (2), 167-256.
582
41. Opsahl, T.; Panzarasa, P., Clustering in weighted networks. Social Networks
583
2009, 31 (2), 155-163.
584
42. Cutler, D. R.; Edwards, T. C., Jr.; Beard, K. H.; Cutler, A.; Hess, K. T., Random
585
forests for classification in ecology. Ecology 2007, 88 (11), 2783-2792.
586
43. Allen, T.; Murray, K. A.; Zambrana-Torrelio, C.; Morse, S. S.; Rondinini, C.; Di
587
Marco, M.; Breit, N.; Olival, K. J.; Daszak, P., Global hotspots and correlates of
588
emerging zoonotic diseases. Nat. Commun. 2017, 8 (1), 1124.
589
44. Goldstein, A.; Kapelner, A.; Bleich, J.; Pitkin, E., Peeking Inside the Black Box:
590
Visualizing Statistical Learning With Plots of Individual Conditional Expectation. J.
591
Comput. Graph. Stat. 2013, 24 (1), 44-65.
592
45. Ema, M.; Kobayashi, N.; Naya, M.; Hanai, S.; Nakanishi, J., Reproductive and
593
developmental toxicity studies of manufactured nanomaterials. Reprod. Toxicol. 2010,
594
30 (3), 343-352.
595
46. Hu, X.; Li, D.; Gao, Y.; Mu, L.; Zhou, Q., Knowledge gaps between
596
nanotoxicological research and nanomaterial safety. Environ. Int. 2016, 94, 8-23.
597
47. Li, X.; Yang, X.; Yuwen, L.; Yang, W.; Weng, L.; Teng, Z.; Wang, L.,
598
Evaluation of toxic effects of CdTe quantum dots on the reproductive system in adult
599
male mice. Biomaterials 2016, 96, 24-32. 30
ACS Paragon Plus Environment
Environmental Science & Technology
600
48. Li, C. H.; Shen, C. C.; Cheng, Y. W.; Huang, S. H.; Wu, C. C.; Kao, C. C.; Liao,
601
J. W.; Kang, J. J., Organ biodistribution, clearance, and genotoxicity of orally
602
administered zinc oxide nanoparticles in mice. Nanotoxicology 2012, 6 (7), 746-756.
603
49. Sundarraj, K.; Manickam, V.; Raghunath, A.; Periyasamy, M.; Viswanathan, M.
604
P.; Perumal, E., Repeated exposure to iron oxide nanoparticles causes testicular
605
toxicity in mice. Environ. Toxicol. 2017, 32 (2), 594-608.
606
50. Choi, J.; Kim, H.; Kim, P.; Jo, E.; Kim, H. M.; Lee, M. Y.; Jin, S. M.; Park, K.,
607
Toxicity of zinc oxide nanoparticles in rats treated by two different routes: Single
608
intravenous injection and single oral administration. J. Toxicol. Environ. Health. A
609
2015, 78 (4), 226-243.
610
51. Wang, X.; Zhang, J.; Yang, X.; Tang, Z.; Hu, Y.; Chen, B.; Tang, J., In vivo
611
assessment of hepatotoxicity, nephrotoxicity and biodistribution using
612
3-aminopropyltriethoxysilane-coated magnetic nanoparticles (APTS-MNPs) in ICR
613
mice. Chin. Sci. Bull. 2014, 59 (16), 1800-1808.
614
52. Akhavan, O.; Ghaderi, E.; Emamy, H.; Akhavan, F., Genotoxicity of graphene
615
nanoribbons in human mesenchymal stem cells. Carbon 2013, 54 (2), 419-431.
616
53. Akhavan, O.; Ghaderi, E.; Akhavan, A., Size-dependent genotoxicity of graphene
617
nanoplatelets in human stem cells. Biomaterials 2012, 33 (32), 8017-8025.
31
ACS Paragon Plus Environment
Page 32 of 45
Page 33 of 45
Environmental Science & Technology
618
54. Mu, Q.; Jiang, G.; Chen, L.; Zhou, H.; Fourches, D.; Tropsha, A.; Yan, B.,
619
Chemical Basis of Interactions Between Engineered Nanoparticles and Biological
620
Systems. Chem. Rev. 2014, 114 (15), 7740-7781.
621
55. Li, M.; Yue, G.; Hu, X., l -Cysteine: A biocompatible, breathable and beneficial
622
coating for graphene oxide. Biomaterials 2015, 52 (1), 301-311.
623
56. Laycock, A.; Diezortiz, M.; Larner, F.; Dybowska, A.; Spurgeon, D.;
624
Valsamijones, E.; Rehkämper, M.; Svendsen, C., Earthworm Uptake Routes and
625
Rates of Ionic Zn and ZnO Nanoparticles at Realistic Concentrations, Traced Using
626
Stable Isotope Labeling. Environ. Sci. Technol. 2015, 50 (1), 412-419.
627
57. Youyou, W.; Kosinski, M.; Stillwell, D., Computer-based personality judgments
628
are more accurate than those made by humans. P. Natl. Acad. Sci. USA 2015, 112 (4),
629
1036-1040.
630 631 632 633 634 635 636 637
32
ACS Paragon Plus Environment
Environmental Science & Technology
Page 34 of 45
638
Table 1. Data analysis of ENP accumulation (242 data points) and reproductive
639
toxicity (250 data points). Five qualitative factors (rodent age, ENP size, and exposure
640
time, duration and dosage) and five quantitative factors (ENP type, surface ligand,
641
shape, exposure method, and element) were analyzed for each study.
642
“#”, the elements of the ENPs are major elements in organisms. “★”, the types of
643
ENPs include CeO2, Fe2O3, Fe3O4, VO, nano-alloy particles (Au and Cu), CoCr,
644
Eu2O3, MWCNTs and PbS NPs. “⚐”, special surface ligands, including
645
3-aminopropyltriethoxysilane (ATPS), carboxyl, amino, hydrophilic polymer, citrate,
646
polyvinyl pyrrolidone (PVP) and peptide. “⸙ ”, special ENP shapes, including colloid,
647
cluster, crystalline, rod, pentacle and line. “☼”, ENP types, including Se, CdTe,
648
CdSe/ZnS, Fe2O3, Si, PbS, Mn2O3, Ni and CoCr NPs. “⸙ ”, the surface ligands
649
polyethylene glycol-amino (PEG-NH2), polyethylene glycol and PVP. “♫”, ENP
650
shapes (line, rod and cube). MWCNTs, multiwalled carbon nanotubes; i.v.,
651
intravenous injection; i.p., intraperitoneal injection; i.t., intratracheal injection; ga.,
652
gavage; i.h., inhalation; sc., abdominal subcutaneous injection. Factor
Age (weeks) Mean (SD) Range Type of ENP (%) Au Ag TiO2 ZnO CdTe
Accumulation of ENPs Factor (n = 242) Age (weeks) 7.4 (2.3) Mean (SD) 1.1 to 13.0 Range Type of ENP (%) 32.2 Ag 21.9 TiO2 10.7 Au 7.0 MWCNT 6.6 ZnO 33
ACS Paragon Plus Environment
Toxicity of ENPs (n = 250)
8.0 (3.9) 1.1 to 19.3 30.8 18.8 8.4 7.6 7.2
Page 35 of 45
Environmental Science & Technology
CdS …★ Surface ligand (%) None Glutathione Polyethylene glycol …⚐ Shape (%) Sphere …⸙ Exposure method (%) i.v. ga. i.p. i.h. i.t. Elements (%)#
3.3 18.1 62.4 8.3 8.3
21.1 77.7 22.3
50 26.9 14.5 7.0 1.7
Yes
28.9
No
71.1
Size (nm) Mean (SD) Range
GO …☼ Surface ligand (%) None Citrate Hydrophilic Polymer …₂ Shape (%) Sphere …♫ Exposure method (%) ga. i.v. i.p. i.h. sc. Indicator (%) Sperm Parameters Testosterone Testis index Size (nm) Mean (SD) Range Exposure time (days) Mean (SD) Range Duration time (days) Mean (SD) Range Dosage (mg/kg) Mean (SD) Range
37.7 (51.2) 1.9 to 323
Exposure time (days)
Mean (SD) Range
15.1 (34.0) 0 to 224
Duration time (days) Mean (SD) Range Dosage (mg/kg) Mean (SD) Range Accumulation Concentration Mean (SD) Range
17.4 (33.2) 0 to 180 233 (862) 0 to 5000
6.8 19.6 81.6 6.8 4.8
6.4 91.1 8.9
43.6 29.2 20.8 3.6 2.4 66.8 10.4 22.4 41.9 (48.5) 1.4 to 245
32.3 (38.9) 1 to 224 12.0 (37.7) 0 to 365 112 (519) 0 to 5000
Toxicity 1437 (6201) -3280 to 69856
Mean (SD) Range
653 654
34
ACS Paragon Plus Environment
0.12 (0.25) -1.3 to 0.95
Environmental Science & Technology
Page 36 of 45
655
Table 2. Assessments of the performance of the random forest (RF) model using the
656
R2 correlation coefficient and root mean square error (RMSE). The subset of
657
metal-free engineering nanoparticles (ENPs) in NP accumulation studies was not
658
analyzed due to the limited number of samples (n = 3). For other subsets, the number
659
of tested samples was not less than 37. a, Number of data points in the subset. b,
660
Injection exposure methods, including intramuscular, intravenous, intraperitoneal,
661
intra-articular and abdominal subcutaneous injection. c, Noninjection exposure
662
methods, including inhalation and gavage. Accumulation of ENPs Priority Factor Overall Type of ENP Exposure method Priority Factor Overall Type of ENP Toxicity indicator
Nd a
R2
242 135 104 156 86
0.744 0.634 0.757 0.769 0.758
RMSE (ng/g) 4218 392 6177 5033 851
Subset
Nd a
R2
RMSE
Metallic Metal-free Carbonaceous Carbon-free Sperm parameters Testosterone Testis index
250 194 56 37 213 168 26 56
0.624 0.605 0.553 0.485 0.619 0.543 0.471 0.383
0.198 0.215 0.131 0.091 0.212 0.181 0.424 0.111
Subset Single-metallic element Multiple-metallic element Injectionb Noninjectionc Toxicity of ENPs
663
35
ACS Paragon Plus Environment
Page 37 of 45
Environmental Science & Technology
664
Table 3. Predictions of nanomaterial accumulation and reproductive toxicity. The factors are normalized for each study as follows: rodent age, 8
665
weeks; ENP shape, particles; duration, 14 days; ENP surface ligands, none; toxicity indicator, sperm parameters. ⚐, cluster number; *, duration,
666
0 to 4 weeks; $, lowest vote rates of data points. Cp and Tp represent the predictions of the extracted data points in the RF model for the
667
accumulation and toxicity studies, respectively. Co and To represent the observations of actual data points meeting the conditions of the six
668
important attributes (ENP type, size, exposure method, exposure time, dosage, and element for accumulation studies and surface ligand for
669
toxicity studies) for the accumulation and reproductive toxicity of ENPs, respectively. Nd and Np represent the number of actual data points and
670
publications meeting the conditions of the six attributes, respectively. Cc and Tc represent the accumulation concentration and toxicity
671
distribution of each cluster, respectively. The first to third quartiles were used as the main range.
C⚐
ENP
Element
Size Exposure (nm) Method
1 3 3
Au TiO2 Ag ZnO, Fe2O3
N Y N
10-40 i.v., i.h. 5-100 ga., i.v. 15-40 ga., i.v.
Y
20-40
7*
ga.
Predictions for Accumulation Studies Exposure Dosage Time Cp (ng/g) Nd (mg/kg) (days) 1 0.01-1 4-15 31 1-90 10-200 405-1123 5 14-28 1-100 177-252 6 10-40
30-200
1562-12110
36
ACS Paragon Plus Environment
2
Np
Co (ng/g)
4 3 3
0.9-18.9 230-3860 183-287
Vote Rate$ (%) 96.3 80.0 87.0
2
200
82.5
Cc (ng/g)
2-18 3-174 3-174 130-3080
Environmental Science & Technology
⚐
C
4 7 8
ENP
Ag Ag TiO2 MWCNT, 9 GO, Ag
Predictions for Toxicity Studies Exposure Dosage Surface Size Exposure Time Tp (mg/kg) Ligand (nm) Method (days) citrate 5-40 ga. 90-365 10-150 0.04-0.06 n 10-15 i.v. 10-15 0.1-5 0.06-0.08 n 20-70 ga. 30-60 20-200 0.26-0.33 n
3-20
i.p.
1-15
0.5-70
0.14-0.19
672
37
ACS Paragon Plus Environment
Page 38 of 45
Nd
Np
To
2 3 6
1 1 4
0.00-0.01 0.00-0.03 0.14-0.42
Vote Rate$ (%) 80.0 86.0 87.0
24
3
0.01-0.15
89.0
Tc
-0.03-0.04 0.00-0.18 0.04-0.35 0.08-0.36
Page 39 of 45
Environmental Science & Technology
673
Figure Captions
674
Figure 1. Performance of the RF model. The training sets were randomly selected
675
from original datasets, and each training set was repeated ten times. a and b,
676
accumulation of ENPs; b and d, reproductive toxicity of ENPs.
677 678
Figure 2. Importance of attributes to the reproductive toxicity and accumulation
679
concentration of ENPs analyzed by the RF model. The increase in the MSE was used
680
to measure the importance of the attributes.
681 682
Figure 3. Similarity network analysis based on the proximity matrix of the RF model.
683
The present work connected adjacent nodes with higher than average values in the
684
proximity matrix, which measures the probability of two data points occurring in the
685
same node of a tree in the RF model, and vice versa. The tight connections in the
686
similarity network indicate the high homogeneity of the clusters. The clusters with
687
different colors were classified by hierarchical clustering.
688 689
Figure 4. Similarity network analysis of priority factors based on the proximity
690
matrix of the RF model. The nodes were colored according to the categories of the
691
priority factors.
38
ACS Paragon Plus Environment
Environmental Science & Technology
692
Figure 5. Bivariate partial dependence plots of ENP types, exposure methods and
693
toxicity indicators. a, Reproductive toxicity of ENPs; b, accumulation concentration
694
of ENPs.
695 696 697 698 699 700 701 702 703 704 705 706
39
ACS Paragon Plus Environment
Page 40 of 45
Page 41 of 45
Environmental Science & Technology
707 708
Figure 1.
709
40
ACS Paragon Plus Environment
Environmental Science & Technology
710 711
Figure 2.
712 713 714 715 716 717 718 719 41
ACS Paragon Plus Environment
Page 42 of 45
Page 43 of 45
Environmental Science & Technology
720 721
Figure 3. 42
ACS Paragon Plus Environment
Environmental Science & Technology
722 723
724 725
Figure 4.
43
ACS Paragon Plus Environment
Page 44 of 45
Page 45 of 45
Environmental Science & Technology
726 727
Figure 5.
728
44
ACS Paragon Plus Environment