Chemical Isotope Labeling Exposome (CIL-EXPOSOME): One High

Apr 3, 2019 - Chemical Isotope Labeling Exposome (CIL-EXPOSOME): One High-throughput Platform for Human Urinary Global Exposome Characterization...
0 downloads 0 Views 2MB Size
Subscriber access provided by UNIV OF LOUISIANA

Ecotoxicology and Human Environmental Health

Chemical Isotope Labeling Exposome (CIL-EXPOSOME): One Highthroughput Platform for Human Urinary Global Exposome Characterization Shenglan Jia, Tengfei Xu, Tao Huan, Maria Chong, Min Liu, Wenjuan Fang, and Mingliang Fang Environ. Sci. Technol., Just Accepted Manuscript • DOI: 10.1021/acs.est.9b00285 • Publication Date (Web): 03 Apr 2019 Downloaded from http://pubs.acs.org on April 3, 2019

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 27

Environmental Science & Technology

1

Chemical Isotope Labeling Exposome (CIL-EXPOSOME): One High-throughput

2

Platform for Human Urinary Global Exposome Characterization

3

Shenglan Jia1, 2, 3; Tengfei Xu1, 3; Tao Huan4; Maria Chong1; Min Liu1, 2; Wenjuan Fang2;

4

Mingliang Fang1, 3, 5*

5 6

1

7

Nanyang Avenue, Singapore 639798

8

2

9

Institute, Nanyang Technological University, 1 Cleantech Loop, CleanTech One, Singapore

School of Civil and Environmental Engineering, Nanyang Technological University, 50

Residues and Resource Reclamation Centre, Nanyang Environment and Water Research

10

637141

11

3

12

Institute, Nanyang Technological University, Singapore 637141

13

4

14

Mall, Vancouver, BC Canada V6T 1Z1

15

5

16

Technological University, 1 Cleantech Loop, CleanTech One, Singapore 637141

Advanced Environmental Biotechnology Centre, Nanyang Environment & Water Research

Department of Chemistry, University of British Columbia, Vancouver Campus, 2036 Main

Analytics Cluster, Nanyang Environment and Water Research Institute, Nanyang

17 18

Address for Correspondence

19

Dr. Mingliang FANG, School of Civil and Environmental Engineering, Nanyang

20

Technological University, 50 Nanyang Avenue, Singapore 639798.

21

TEL: +65 6790 5331;

22

Email: [email protected]

1 ACS Paragon Plus Environment

Environmental Science & Technology

23

TOC

24

2 ACS Paragon Plus Environment

Page 2 of 27

Page 3 of 27

Environmental Science & Technology

25

ABSTRACT: Human exposure to hundreds of chemicals, a primary component of the

26

exposome, has been associated with many diseases. Urinary biomarkers of these chemicals are

27

commonly monitored to quantify their exposure. However, due to low concentrations and the

28

great variability in physicochemical properties of exposure biomarkers, exposome research has

29

been limited by low-throughput and costly methods. Here, we developed a sensitive and high-

30

throughput exposome analytical platform (CIL-EXPOSOME) by isotopically-labeling urinary

31

biomarkers with common functional groups (hydroxyl/carboxyl/primary amine). After a

32

simple cleanup, we used mass spectrometry to perform a screening for both targeted and

33

untargeted biomarkers, which was further processed by an automatic computational pipeline

34

method for qualification and quantification. This platform has effectively introduced an isotope

35

tag for the absolute quantification of biomarkers and has improved sensitivity of 2-1,184 fold

36

compared to existing methods. For putative identification, we built a database of 818 urinary

37

biomarkers with MS/MS fragmentation information from either standards or in silico

38

predictions. Using this platform, we have found 671 urinary exposure biomarker candidates

39

from a 2 mL pooled urine sample. The exposome data acquisition and analysis time has also

40

been greatly shortened. The results showed that CIL-EXPOSOME is a useful tool for global

41

exposome analysis of complex samples.

3 ACS Paragon Plus Environment

Environmental Science & Technology

42

INTRODUCTION

43

The concept of the “exposome”, which is a measure of the effects of life-long environmental

44

exposure on health, has been a hot topic since its introduction in 2005.1 Recent initiatives in

45

environmental exposome research have attempted to map the complex relationships between

46

biomarkers and disease risks.2-4 Therefore, measurable and characteristic changes of all

47

biomarkers taken together form the key foundation of exposome research.5 Exposure

48

biomarkers possess enormous structural diversity and have a broad concentration ranges (from

49

10-7 μM to 1 mM for environmental pollutants).6, 7 Meanwhile, we are constantly exposed to

50

an array of ever-increasing chemicals, both endogenous and exogenous, from our diet, the

51

environment and our behavior.8 As such, profiling a large number of exposure biomarkers

52

requires powerful analytical techniques with broad coverage and high sensitivity.

53

Current exposure analytical methods include both traditional targeted measurement and

54

untargeted discovery of chemicals with biological importance (mainly endogenous

55

metabolites).9-11 For targeted appraoches, high sensitivity, wide dynamic range, and good

56

reproducibility are achieved with complex cleanup steps and expensive chemical isotope-

57

labeled (CIL) analogues.12 Moreover, the targeted method can only measure several biomarkers

58

at one time and requires large volumes of sample. Untargeted approaches have primarily

59

evolved from global metabolomics methods, by combining advanced analytical and

60

computational tools.13 Untargeted metabolomics retain a broad coverage of analytes by

61

sacrificing sensitivity, making the approach inefficient at measuring low concentration

62

environmental chemicals.5,

63

throughput platform, capable of capturing both endogenous and low level exogenous exposure

64

biomarkers.

14

Therefore, there is a great demand for a sensitive and high-

65

Among all biological samples, urine samples have a great potential to be used in long-term

66

exposome characterization due to their non-invasive and easy collection.15 Most environmental 4 ACS Paragon Plus Environment

Page 4 of 27

Page 5 of 27

Environmental Science & Technology

67

chemical exposure biomarkers are excreted and can be found in the urine following oxidation,

68

reduction, or hydrolysis and further phase II conjugations.16 However, high concentrations of

69

salts and the diversity of chemical compounds present, make the analysis of urinary exposure

70

biomarkers across a wide range of concentrations challenging.

71

To address this analytical challenge and establish a sensitive, high-throughput exposome

72

platform, we applied a chemical-derivatization method for exposome profiling in urinary

73

samples. Chemical isotope labeling combined with liquid chromatography-mass spectrometry

74

(CIL-LC-MS) analysis was used to enhance the detection of both endogenous and exogenous

75

compounds, simplify both data acquisition and processing during exposome analysis. CIL-LC-

76

MS could significantly improve chemical detection and quantification of endogenous

77

metabolites, with sensitivity increases of 10-1,000-fold observed in previous studies.17-19

78

Furthermore, isotope forms of the derivatization reagents can be used as an economical internal

79

standard for absolute quantification.20,

80

subgroups based on their functional moieties (i.e., carboxyl, carbonyl, and amine etc.), different

81

labelling chemistries can be used to profile different sub-metabolomes in order to provide better

82

coverage of the entire compound range of interest.22, 23 For most exogenous biomarkers in the

83

urinary exposome, either the parent compound or metabolites commonly contain hydroxyl or

84

carboxylic acid functional groups to enhance their aqueous solubility, making the CIL method

85

promising for this application.24

21

After categorizing the metabolites into several

86

Here, we present a urinary exposome profiling workflow that employs CIL and automatic

87

data processing to examine hydroxyl/primary amine and carboxyl biomarkers originating from

88

both endogenous and exogenous chemicals. This approach provides a solution to several

89

challenges currently limiting exposure biomarker profiling. First, CIL methods were used to

90

increase sensitivity and greatly improve the coverage of exposure biomarkers. Second, an

91

automatic data processing was built up to confirm potential exposure biomarkers, enabling fast

5 ACS Paragon Plus Environment

Environmental Science & Technology

Page 6 of 27

92

screening and compound identification. Finally, the method’s performance was validated in a

93

pooled human urine sample, demonstrating that the CIL-EXPOSOME workflow is a promising

94

tool for future global exposome analysis.

95

MATERIALS AND METHODS

96

Chemicals and Stock Preparation. All chemicals and reagents were purchased from

97

Sigma-Aldrich Singapore (Singapore) except those otherwise noted. The labelling reagent, p-

98

dimethylaminophenacyl bromide (DmPA), was purchased from Apollo Scientific Ltd

99

(Stockport, UK). The isotope reagents,

13C

2-danysl

chloride (DNS) and

13C -p2 13C

100

dimethylaminophenacyl bromide, were purchased from TMIC (Edmonton, Canada).

101

BPA,

102

Isotope Laboratories (Andover, MA, USA). HPLC grade water, ethyl acetate, and acetonitrile

103

(ACN) were purchased from Thermo Fisher Scientific (Singapore).

13C

6-monobenzyl

phthalate and

13C

4-methyl

12-

paraben were purchased from Cambridge

104

Standard Solution Preparation. Twenty-two hydroxyl- or carboxyl- containing standards

105

were chosen to optimize the analytical method. Additional information is provided in Table

106

S1. Each compound dissolved in ACN at a concentration of 10 mg/mL. Seven-level mixed

107

standard working solutions (10, 1, 0.1, 0.01, 0.001, 0.0001, 0.00001 µg/mL), also serving as

108

calibration standards, were prepared with ACN through serial dilution of stock solutions for

109

method validation. In addition, three-level quality control (QC) working solutions were

110

prepared in a similar manner to provide low (0.0001 µg/mL), medium (0.01 µg/mL), and high

111

(1 µg/mL) level controls. All stock and working solutions were stocked at −40°C, and diluted

112

with ACN to the desired level before use.

113

Urine Sample Collection and Preparation. A pooled human urine sample was prepared

114

by mixing first morning voids of 35 volunteers (healthy, age: 22 - 37) which were immediately

115

frozen at −20°C after sampling. Informed written consent was obtained from all participants.

116

Institutional Review Board approval for human specimen analysis were obtained from 6 ACS Paragon Plus Environment

Page 7 of 27

Environmental Science & Technology

117

Singapore (IRB-2017-02-023). The pooled urine sample was buffered to pH 5.5 and treated

118

with an enzyme mixture (β-glucuronidase/sulfatase) for 12 h at 37°C. After centrifugation (10

119

min, 16000 g, 4°C), the urine was processed with labelling reagents for method development.

120

Spiking experiments were performed by adding a mixed working solution of the selected

121

compounds or surrogate standards to the pooled samples before sample preparation.

122

CIL-EXPOSOME Procedure.Work flow. In this study, two CIL methods were used to profile

123

hydroxyl/primary amine/ carboxyl endogenous and exogenous biomarkers. Figure 1 illustrates

124

the overall workflow of the method and data processing design with following steps: (1)

125

dansylation or p-dimethylaminophenacyl bromide (DmPA) labeling of the de-conjugated urine

126

sample; (2) equal amounts of 12C-/13C- dansylation/DmPA were mixed with labeled samples;

127

(3) liquid-liquid extraction was performed to enrich the labeled biomarkers from above

128

complex mixtures; (4) raw LC-qTOF analysis data were imported into IsoMS (Test S1) to get

129

12C-/13C-

130

database under mass accuracy tolerance (e.g., < 20 ppm); (6) the resulting information

131

(biomarker candidate name, mass tolerance, and m/z) were exported; and (7) the exported

132

biomarker candidates were processed and putatively identified with standard or in silico

133

predicted MS/MS fragmentation pattern.

peak pairs; (5) the exported paired suspect biomarkers were mass matched to the

134

7 ACS Paragon Plus Environment

Environmental Science & Technology

135 136

Figure 1. Workflow for the determination and quantitation of hydroxyl/primary amine and

137

carboxyl exposure biomarkers in human urine by the CIL-EXPOSOME platform. Following

138

derivatization and sample processing, the LC-qTOF raw data were imported into the IsoMS,

139

and suspect exposure biomarkers were searched against the compounds in the CIL-

140

EXPOSOME database under different mass accuracy tolerances. The resulting information

141

(name, mass tolerance, accurate mass and m/z light) was exported. The exposure biomarker

142

candidates were further confirmed with MS/MS (i.e., MS2) fragmentation.

143

Reaction method optimization. To optimize the derivatization conditions, three-level quality

144

control (QC) mixed standard solutions reflecting human relevant levels were utilized. The

145

mixtures were allowed to react under different concentrations of derivatization reagent,

146

preparation times of derivatization reagent, ACN/H2O percentages, reaction temperatures and

147

incubation times individually. A detailed description of the labelling reaction can be found in

148

Text S2.

149

LC-qTOF Profiling and MS/MS fragmentation acquisition. Standard and urine analyses

150

were performed using a high-performance liquid chromatography (HPLC) system (1200 series,

8 ACS Paragon Plus Environment

Page 8 of 27

Page 9 of 27

Environmental Science & Technology

151

Agilent Technologies) coupled to an Agilent 6550 Quadrupole Time-of-flight (qTOF) mass

152

spectrometer (Agilent, Singapore). Samples were injected for reversed-phase analysis in

153

electrospray ionization (ESI) positive and negative modes, as detailed in Text S3. The

154

instrument was set to acquire over a mass to charge ratio (m/z) range from 50 to 1700 with the

155

MS acquisition rate of 2 Hz. For the acquisition of MS/MS spectra of selected precursors, the

156

default isolation width was set to 1.3 Da with MS and MS/MS acquisition rates of 4 Hz. The

157

collision energy was set to 20 - 40 eV. Data were processed by using the CIL-EXPOSOME

158

Data Processing approach as described in the results.

159

CIL-EXPOSOME Method Validation.

160

Linearity and Sensitivity. Calibration was conducted based on the analysis of derivatized 7-

161

level labelled mix standard working solutions. Calibration curves were established by using

162

linear regression. The LOD and LOQ were estimated as 3 and 10 times of the standard

163

deviation calculated from 6 replicates at the lowest spiking level, respectively.

164

Precision and Accuracy. Three-level derivatization standard quality control (DQC) working

165

solutions were prepared to provide low, medium, and high scenarios of human biomarker

166

concentration. Intra- and inter-day precision and accuracy were assessed by using relative

167

standard deviation (RSD) within a day (n = 6) and on three consecutive days (n = 9);

168

respectively.

169

Stability and Cleanup Efficiency. Sample preparation stability of all derivatives was

170

evaluated by analysis of the DQC solutions at 48 h (28°C) and −40°C at 30-day intervals.

171

Pooled urine samples spiked with DQC solutions were prepared to evaluate the cleanup

172

efficiency, which was defined as the response ratio of extracted concentrations from a spiked

173

sample to the same concentrations of DQC. Each measurement was performed in triplicate.

174

Matrix Effect. In the absence of a biomarker-free urine matrix, a pooled urine sample was

175

processed as described in Urine Sample Collection and Preparation, and divided into two parts 9 ACS Paragon Plus Environment

Environmental Science & Technology

Page 10 of 27

176

(A and B). Part A was dried to prepare the urine matrix, and an equal volume of DQC solutions

177

were added to estimate the influence of the matrix effect. Part B was analyzed directly to

178

determine the background concentration of each analyte. The matrix factor of each compound

179

was calculated by difference between Part A and DQC relative to the response of DQC in pure

180

solvent. To further distinguish the matrix effect from derivatization efficiency,

181

monobenzyl phthalate and 13C4-methyl paraben were added before derivatization to examine

182

the derivatization efficiencies on the two types of labeling. Labelled 13C12-BPA was dansylated

183

and added before injection as a recovery standard to assess the matrix effect.

13C

6-

184

Statistical Analyses and Quality Assurance and Quality Control (QA/QC). One-way

185

ANOVA and Tukey's post-hoc test were used to compare the response difference during

186

optimization. Procedure MiliQ-water blanks were prepared to evaluate contamination arising

187

from laboratory materials and solvents. An ACN blank was injected after 6 samples, and a

188

calibration check standard was injected after every 12 samples.

189

RESULTS AND DISCUSSION

190

CIL-EXPOSOME Chemical Derivatization Method Development

191

Derivatization reagent selection. One of the important considerations in developing labeling

192

chemistry is how to tag the urinary biomarkers with differential isotopic groups in robust way.

193

In this study, we evaluated the functional groups of urinary biomarkers from a recent published

194

comprehensive database (Exposome Explorer).25 Together, biomarkers with hydroxyl, primary

195

amine and carboxylic functional groups consist of > 70% of the total database. Hydroxyl

196

compounds and primary amines are preferentially labeled with dansylation methods.18, 26, 27

197

The derivatization of carboxylic acids can be conducted with a variety of chemical reactions

198

for analytical applications and phenacyl bromide has been used to label the acids for improved

199

HPLC and UV detection.28 However, a new reagent, DmPA, was designed that allows for the

200

introduction of a mass tag and concurrent improvement during LC-MS analysis.19 Here, we 10 ACS Paragon Plus Environment

Page 11 of 27

Environmental Science & Technology

201

used dansyl chloride (DnsCl) and DmPA to label hydroxyl/primary amine and carboxyl urinary

202

biomarkers, respectively.

203

Reaction Condition Optimization. Although the stable isotope labeling method has been

204

previously optimized for the endogenous metabolites,29-31 the derivatization efficiency of high

205

concentration endogenous (µM to mM level) and low concentration xenobiotic biomarkers

206

(nM or pM level) may vary from each other. Furthermore, compared to endogenous metabolites,

207

the chemical structure of xenobiotics and their biotransformation products are more diverse.

208

To optimize the labeling conditions for urinary biomarkers, MiliQ water and urine spiked with

209

three-level QC working solutions were labeled under different reaction conditions. Figure 2 (a,

210

b) shows the effects of reaction solvent, time, temperature, DnsCl/compound mole ratio, and

211

DnsCl preparation time (from left to right) on the efficiency of standard dansylation. For the

212

pure chemical optimization, results indicate that the effect of water in the incubation solvent

213

on the labeling efficiency is compound dependent. After the optimization of all the tested

214

compounds, ACN/H2O (1:1, v/v) was selected as the reaction solvent for dansylation. The

215

reaction temperature and time also had an effect on the labeling efficiency. We found that the

216

optimal conditions were 40 min at 40°C, which differs from the previous method for

217

endogenous metabolites.18 The optimal mole ratio of compound to DnsCl was 1:105. The

218

optimization results for urine were the same as those for the MiliQ water, except that the mole

219

ratio of compound to DnsCl increased to 1:106.

220

Under both MiliQ water and urine background, 80% ACN/H2O was selected as the reaction

221

solvent for DmPA reaction, and the optimized reaction temperature and time was found to be

222

90°C for 60 min (Figure S1 (a, b)). The mole ratio of compound to DmPA was optimal at 1:104

223

for MiliQ water and 1:105 for urine.

11 ACS Paragon Plus Environment

Environmental Science & Technology

224 225

Figure 2. Comparison of labeling efficiencies for 15 hydroxyl standards (a) spiked in MiliQ

226

water and (b) spiked in urine under different labeling reaction conditions. From left to right:

227

reaction time, reaction temperature, incubation solvent, DnsCl preparation time, and

228

compound:DnsCl mole ratio; (c) Comparison of cleanup method optimization for standards

229

under different cleanup procedures. All data in (a) (b) (c) are presented as the mean from three

230

separate experiments (n = 3) of three levels of QC working solution (0.1, 10, and 1000 ppb);

231

Extracted ion chromatograms of labelled standard mixture (d) after derivatization and (e)

232

before derivatization (10 ppb each). The compounds are shown as follows: 1, Methylparaben;

233

2, Ethylparaben; 3, 4-Bromophenol; 4, 1-Naphthol; 5, Benzylparaben; 6, 2-Hydroxyfluorene; 12 ACS Paragon Plus Environment

Page 12 of 27

Page 13 of 27

Environmental Science & Technology

234

7, Butylparaben; 8, 9-Phenanthrol; 9, Bisphenol S; 10, Benzoresorcinol; 11, 1-Hydroxypyrene;

235

12, Triclosan; 13, Bisphenol A; 14, Genistein; 15, 3,5-DHPPA.

236

Cleanup Method Optimization. A proper cleanup method is crucial to minimizing matrix

237

effects. In the present study, we compared the recoveries of a few pre-treatment methods: (1)

238

extraction of the free biomarkers from MiliQ water followed by derivatization (LLE-Label);

239

(2) biomarker derivatization in MiliQ water followed by extraction (Label-LLE, Label-SPE);

240

(3) extraction of free biomarkers from urine followed by derivatization (urine LLE-Label) and

241

(4) biomarker derivatization in urine followed by extraction (urine Label-LLE). As shown in

242

Figure 2 (c), derivatized standards were more efficiently extracted by the liquid-liquid

243

extraction (LLE) strategy than free standards in both MiliQ water and urine. This is probably

244

because the derivatization step increases the hydrophobicity of hydroxyl, amines and

245

carboxylic acids in the urine sample. As a result, the chemicals are more likely to partition to

246

the organic phase.17 Sample processing was also streamlined by combining DnsCl- and DmPA-

247

derivatized samples prior to extraction. Finally, the optimized cleanup in this study was

248

simplified with a single liquid-liquid solvent extraction, which was enabled by the use of the

249

non-polar derivatization reagent. Details are as follows:

250

DnsCl/DmPA labeled urine samples were combined to profile hydroxyl/primary amine and

251

carboxyl biomarkers; (2) saturated NaCl solution (v/v, 1:5) was added to the mixture, then

252

extracted with ethyl acetate twice (v/v, 1:3); (3) the extracted solutions were combined and

253

dried, followed by reconstitution in ACN prior to LC-qTOF injection.

(1) equal amounts

12C-/13C

2-

254

Method Performance Validation.

255

Sensitivity and Separation Improvement. After derivatization, both the sensitivity and

256

retention of all the tested compounds were greatly improved compared with their non-

257

derivatized counterparts (Figure 2 d, e). A similar finding was observed for the urine samples

258

(Figure S2). The results showed that the detection sensitivities of these standard compounds 13 ACS Paragon Plus Environment

Environmental Science & Technology

259

generally increased by 2-1,184 fold upon chemical labeling (Table S2). Before derivatization,

260

only three of the free form hydroxyl compounds (Bisphenol S, Benzoresorcinol and Genistein)

261

were detected at the concentration of 10 ppb. Chemical labeling will not only efficiently

262

improve the detection sensitivities, but will also lead to the discovery of more low-abundance

263

biomarkers. The chromatographic separation also showed significant improvement after

264

derivatization. Due to the diverse chemical structures and physicochemical properties among

265

biomarkers, separation of these compounds in a complex urine sample is not readily

266

accomplished with a single LC method. Derivatization can overcome this difficulty by

267

introducing a hydrophobic functional group to the analytes. Following derivatization, sufficient

268

separation can be accomplished by using simple reverse phase chromatography. By

269

chromatographic optimization, the derivatization methods greatly reduced the analysis time in

270

the traditional targeted analysis, especially for polar biomarkers that are not retained by regular

271

reverse-phase chromatography.

272

Analytical figures of merit. We estimated a series of attributes of the proposed quantification

273

method, such as calibration range, linearity, sensitivity, precision, accuracy, stability,

274

extraction efficiency, and matrix effect. The linearities of calibration curves for all analytes

275

were excellent among the validated concentration range of 100 ppt to 1 ppm, with

276

determination coefficients (R2) greater than 0.99. Due to the diversity of biomarker

277

concentrations, we considered two injection volumes (1 and 10 µL) to avoid response

278

saturation. As shown in Table S2, all analytes had an LOQ lower than 1 ppb. The analytical

279

method was shown to be highly reproducible, with all of the relative standard deviations (RSDs)

280

less than 20% for both intra-day and inter-day variations (Table S3). The stability of all

281

derivatized standards had coefficients of variation (CVs) that were ≤ 18% and ≤ 16% for

282

storage at 28°C for 48 h or at −20°C for 30 days, respectively (Table S4). Extraction

283

efficiencies of three-level DQC using the above optimized cleanup method were approximately

14 ACS Paragon Plus Environment

Page 14 of 27

Page 15 of 27

Environmental Science & Technology

284

73 - 92% (Table S5). The interference of the matrix with the analytes was within 40% (Table

285

S5).

286

CIL-EXPOSOME Biomarker Database. To further facilitate the identification of urinary

287

biomarkers, we established a urinary CIL-EXPOSOME Biomarker Database. Each entry in the

288

exposure database included the chemical name, accurate molecular weight, exposure source,

289

chemical class, predicted exact m/z after being labelled, MS/MS fragmentation, and a detailed

290

information link (HMDB, KEGG, or PUBCHEM). To create a broad exposome database, the

291

CIL-EXPOSOME Biomarker Database was complied from a few existing data sources:

292

Exposome-Explorer database,25 Human Metabolome Database (HMDB),32 the U.S. National

293

Health and Nutrition Examination Survey (NHANES),33 and MyCompoundID database.34 The

294

MS/MS fragmentation of urinary biomarkers was generated either using authentic standards or

295

predicted using CFM-ID (see below). As a result, the CIL-EXPOSOME Biomarker Database

296

that was established (Table S6) included 818 reported hydroxyl/primary amine and carboxyl

297

biomarkers from food, drugs, endogenous metabolites, plasticizers, personal care products

298

(PPCPs), polybrominated diphenyl ethers (PBDEs), polycyclic aromatic hydrocarbons (PAHs),

299

polychlorinated biphenyls (PCBs), phthalates, pesticides, and flame retardants (excluding

300

PBDEs) (Figure 3).

15 ACS Paragon Plus Environment

Environmental Science & Technology

301 302

Figure 3. Composition of CIL-EXPOSOME Biomarker Database: number of compounds (n =

303

818) by chemical classes.

304

CIL-EXPOSOME MS/MS fragmentation pattern and database development. MS/MS

305

fragmentation provides an important criteria for compound identification. In the present study,

306

we created a MS/MS fragmentation library of ~100 xenobiotics and their biotransformation

307

products by conducting dissociation at different collision energies (0, 10, 20, 30 eV). Together

308

with ~ 250 MS/MS spectra of endogenous metabolites from mycompound ID,35 we have

309

generated a MS/MS database comprised of ~350 authentic standards. For the rest of compounds

310

in the database, in silico MS/MS information was predicted using a machine-learning method

311

(CFM-ID).36 The database is available to the public upon request via Google Drive.

312

We further investigated the MS/MS fragmentation behavior of compounds labeled by two

313

reagents (DnsCl and DmPA). As shown in Figure 4, the expected precursor ions were m/z

314

378.1/380.1 for DNS/13C2-DNS-labeled naphthol and m/z 418.1/420.1 for DmPA/13C2-DmPA-

315

labeled mono-benzyl phthalate. The DNS/13C2-DNS-labeled hydroxyl compounds generated

316

two characteristic product ions, 171.1/173.1 and 235.0/237.0, by neutral loss from the precursor

317

ion, [M + H]+. For naphthol, another characteristic product ion was formed, [M + H]+ 207.0,

16 ACS Paragon Plus Environment

Page 16 of 27

Page 17 of 27

Environmental Science & Technology

318

which is the parent ion of [M + H]+ 171.1/173.1 (Figure 4a and 4b). The DmPA/13C2-DmPA-

319

labeled carboxylic acids typically lose the common derivatization group (m/z 149.0/151.0).

320

The resulting product ion [M + H]+ −270.1 for mono-benzyl phthalate was formed (Figure 4c

321

and 4d). Overall, we have observed m/z 171.1 and 235.0 as two typical fragments for

322

dansylation, and m/z 149.0 as a typical fragment for the DmPA label. The molecular MS

323

fragment feature of the targeted compounds can also be observed in other MS/MS

324

fragmentation patterns. The use of the selected paired-peaks after chemical labeling as the

325

targeted precursor ions facilitates subsequent MS/MS analysis. In addition, the characteristic

326

fragmentation of metabolites after chemical labeling make identification feasible. This strategy

327

can narrow down the candidate compounds from fragmentation ions. [M+H]+ - 207.0

( a( 100

100 N

O O S O

m/z 171.1

75

171.1 [M+H]+

Intensity abundance (%)

Intensity abundance (%)

[M+H]+ - 207.0

( b(

N

m/z 378.1

50 O O S O

25

207.0 0

O S

N

378.1 [M+H]+

O

m/z 235.0 235.0 [M+H]+

200

C13 N N C13

O S O

75

m/z 173.1

50 O O S O

25

207.0 0

300

O O S

173.1 [M+H]+ C13 N 13 C

O S

380.1 [M+H]+

m/z 237.0 237.0 [M+H]+ 300

m/z [M+H]+ - 150.0

[M+H]+ - 148.0

( c(

( d(

O

100

O

O

O O

O O

100

N O

50 O

O

m/z 149.0

O

310.1

149.0 [M+H]+ 100

200

300

O

O O

25

O O O

N

N

0

328

270.1 [M+H]+

75

Intensity abundance (%)

Intensity abundance (%)

O

m/z 270.1

91.0

m/z 380.1

O

200

m/z

C13 N N 13 C

O O S O

m/z 418.1 418.1 [M+H]+

270.1 [M+H]+

75

C13 N 13 C O

O

O

25

O O O

C13 N 13 C

50

0

400

m/z 270.1

91.0

m/z 151.0

m/z 420.1

151.0 [M+H]+

420.1 [M+H]+

100

m/z

200

300

400

m/z

329

Figure 4. Fragmentation pattern of chemically labeled products. (a) DNS-labeled hydroxyl

330

compound; (b) 13C2-DNS-labeled hydroxyl compound; (c) DmPA-labeled carboxylic acid; (d)

17 ACS Paragon Plus Environment

Environmental Science & Technology

331

13C

332

labelled m/z, respectively.

2-DmPA-labeled

carboxylic acid. Highlighted in blue and green are the theoretical and

333

CIL-EXPOSOME Automatic Profiling Workflow. In addition to working as an internal

334

standard, 13C reagent labelled biomarkers can play a unique role in untargeted profiling and

335

identification. Here, the obtained full scan (MS) raw data was processed by IsoMS to extract

336

12C/13C

337

difference (2.0067 Da for one light/heavy labeled metabolites), tolerance window (e.g., < 20

338

ppm), and intensities ratios (0.80 to 1.20) were extracted from the spectra. Information on

339

detected compounds, including accurate m/z, retention time (t), and peak intensity, was

340

generated for each exposure biomarker candidate (Table S7).

peak pairs from mass spectra. The paired-peaks that were with a defined mass

341

We further designed a MATLAB program to automatically search the exposure biomarker

342

candidate peak pairs against the CIL-EXPOSOME Biomarker Database; the detailed manual

343

can be found in Google Drive. Based on the IsoMS preselected mass list, the prospective

344

candidates of each exposure biomarker were output with mass tolerance, name, database ID,

345

accurate mass and labeled accurate mass (Figure S3). Users can select different values to use

346

for mass tolerance. For the identification, targeted MS/MS fragmentation of the potential

347

biomarker was generated with a qTOF mass spectrometer. QTOF parameters included and

348

width of 1.3 m/z units and a collision energy of 20 V. The obtained MS/MS fragmentation

349

spectra was then further matched with prospective candidates in the established MS/MS library.

350

Additionally, compounds were excluded that did not meet the following specifications: (1)

351

targeted MS/MS of the potential biomarker included 171.1/173.1 or 149.0/151.0 in the native

352

and isotope labelled compounds; and/or (2) the MS/MS fragment matched with the precursor

353

compounds.

18 ACS Paragon Plus Environment

Page 18 of 27

Page 19 of 27

Environmental Science & Technology

7

8.0x105

8 9

( a( 12C

75

Intensity

4.0x10

3

labeled

2

1

6

5

4

7

0.0

10

7

-4.0x105

13C

12

11

13

15 14

labeled

7

-8.0x105

0

5

10

15

Retention time (min) 4-Ethylphenol (peak 11)

7575

( b( 12C

13C

labeled

labeled

2525

75

0

340

360

m/z 469.2

m/z 465.2

5050

12C

13C

labeled

m/z

00

380

440

460

m/z

m/z 13

N

171.1

0 100 100 3

173.1

100

6

25

Intensity (%)

235.0

050 50

100

2525

200 356.1

122.1

00

100

25

200

300

237.0

300 122.1

m/z

0 400

500 100

N

C

400

149.0

N

358.1

m/z

700 300

0 400

m/z

270.1

100500

1.2x104 1.0x104

6.0x107 ( d(

C

C N O

O

100

200600

O

m/z

O75

50

465.2

300700

O

O

O

25

600 200

500 13

O

75

50

480

13

O

9 O S O O

50

13

N

100

time (min) O S Retention O 75 O

7575

C

Intensity (%)

50

labeled

2525

m/z

1x1060 320

( c(

7575

Intensity (%)

Intensity (%)

m/z 358.1

m/z 356.1

6 50 2x1050

Intensity (%)

O O

O

151.0

N

270.1

469.2

25

0 500

100 600

200 700

300 800

500

600

700

m/z

( e(

2.0x103

Con. (ng/mL)

2.0x107

0.0 0.0

0.0

4.0x103

8.0x103

1.2x104

1.6x104

A or B P M es o S et hy rcin o lp ar l ab e n 4- Tri Br c l o om s a op n he N nol ap h 3, 5- tho D H l PP G en A is te in

4.0x10

7

BP

Intensity abundance (%) Intensity

100

Abundance (CIL-EXPOSOME)

Trans, trans-Muconic Acid (peak 10) 100 100

Intensity (%)

100 100

nz

Abundance (LC-QQQ)

Be

354 355

Figure 5. Chemical isotope-labeled urine sample analyzed by the CIL-EXPOSOME platform.

356

Extracted ion chromatograms of (a) exposure biomarker examples from ~671 potential

357

hydroxyl/primary amine and carboxylic acid metabolites from combined DNS/13C2-DNS and

19 ACS Paragon Plus Environment

800

Environmental Science & Technology

358

DmPA/13C2-DmPA labeled urine sample. Fifteen exposure biomarkers have been confirmed

359

with authentic standards, namely, 1, Methylparaben; 2, 4-Bromophenol; 3, Naphthol; 4, BPS;

360

5 Benzoresorcinol; 6, Triclosan; 7, BPA; 8, Genistein; 9, 3,5-DHPPA; 10, Trans, trans-

361

Muconic Acid; 11, 4-Ethylphenol; 12, 2,4-Di-tert-butylphenol; 13, 2-Methylphenol; 14, 2-

362

Isopropoxyphenol; and 15, 4-Propoxyphenol; an example of MS/MS from the newly identified

363

(b) hydroxyl compound 4-ethylphenol and (c) carboxylic acid Trans, trans-muconic acid; (d)

364

correlation of BPA abundance between LC-QQQ and the CIL-EXPOSOME platform (R2=0.98,

365

p