Subscriber access provided by READING UNIV
Article
Reconstructing Biosynthetic Pathway of the Plant-Derived Cancer Chemopreventive-Precursor Glucoraphanin in Escherichia coli Han Yang, Feixia Liu, Yin Li, and Bo Yu ACS Synth. Biol., Just Accepted Manuscript • DOI: 10.1021/acssynbio.7b00256 • Publication Date (Web): 17 Nov 2017 Downloaded from http://pubs.acs.org on November 19, 2017
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
ACS Synthetic Biology is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
1
Revision of sb-2017-00256v.R2
2 3 4
Reconstructing Biosynthetic Pathway of the Plant-Derived Cancer
5
Chemopreventive-Precursor Glucoraphanin in Escherichia coli
6 7
Han Yanga,b E-mail:
[email protected] 8
Feixia Liua E-mail:
[email protected] 9
Yin Lia,* E-mail:
[email protected] 10
Bo Yua,* E-mail:
[email protected] 11 12
Affiliations:
13
a
14
of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
15
b
CAS Key Laboratory of Microbial Physiological and Metabolic Engineering, Institute
University of Chinese Academy of Sciences, Beijing 100049, China
16 17 18 19
* Corresponding authors.
20
Phone/Fax: +86-10-64806132
1
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
21
Abstract
22
Epidemiological data confirmed a strong correlation between regular consumption of
23
cruciferous vegetables and lower cancer risk. This cancer preventive property is mainly
24
attributed to the glucosinolate products, such as glucoraphanin found in broccoli that is
25
derived from methionine. Here we report the first successful reconstruction of the
26
complete biosynthetic pathway of glucoraphanin from methionine in Escherichia coli
27
via gene selection, pathway design, and protein engineering. We used branched-chain
28
amino transferase 3 to catalyze two transamination steps to ensure the purity of
29
precursor molecules and used cysteine as a sulfur donor to simplify the synthesis
30
pathway. Two chimeric cytochrome P450 enzymes were engineered and expressed in
31
E. coli functionally. The original plant C-S lyase was replaced by the Neurospora
32
crassa hercynylcysteine sulfoxide lyase. Other pathway enzymes were successfully
33
mined from Arabidopsis thaliana, Brassica rapa, and Brassica oleracea. Biosynthesis
34
of glucoraphanin upon co-expression of the optimized enzymes in vivo was confirmed
35
by liquid chromatography-tandem mass spectrometry analysis. No other glucosinolate
36
analogs (except for glucoiberin) were identified that could facilitate the downstream
37
purification processes. Production of glucoraphanin in this study laid the foundation for
38
microbial production of such health-beneficial glucosinolates in a large-scale.
39 40
Keywords: glucoraphanin, pathway engineering, protein engineering, microbial
41
synthesis
42
2
ACS Paragon Plus Environment
Page 2 of 42
Page 3 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
43
Abstract Graphic
44 45
3
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
46
Plants can produce a wide range of secondary metabolites, many of which are
47
valuable pharmaceutical and nutraceutical compounds.1 Chemical synthesis of these
48
secondary metabolites have been a challenge due to the complexity of their structures.
49
Consequently, it is more favorable to obtain them by isolation from plants or
50
semisynthesis.2,3 However, only small quantities of such compounds are generated in
51
their host plants at particular time points during development, which in turn may make
52
it difficult for large-scale production. Over the past two decades, it has become
53
technically feasible to introduce the natural synthetic pathways from plants to
54
microorganisms for production purposes, thus bypassing the above limitations. Some
55
notable examples include the antimalarial compound artemisinic acid, the antioxidant
56
resveratrol, the flavor component vanillin, the universal sesquiterpene precursor,
57
farnesyl diphosphate, and taxadiene (a precursor of the anticancer agent taxol).1
58
Glucosinolates are naturally produced by members of cruciferous vegetables and
59
contribute to the health-beneficial effects.4 In particular, glucoraphanin (GRA), a
60
glucosinolate common among broccoli and Arabidopsis thaliana is associated with
61
reduced risks for cardiovascular diseases and cancer.5 Conversion of GRA upon
62
consumption could be mediated by myrosinases from endogenous plant or gut
63
microorganisms, resulting in the formation of sulforaphane (SFN).6 Studies performed
64
with animals as well as cellular models have indicated that SFN possesses
65
cancer-preventive properties, such as antioxidation, anti-inflammation, induction of
66
apoptosis and cell-cycle arrest.7 Very recent research has confirmed that SFN can also
67
improve glucose control in type-II diabetes patients.8 These findings have drawn
68
considerable attention to the optimization of GRA production for pharmaceutical
69
applications, as well as for use as a dietary supplement.
70
Glucosinolate biosynthesis, which has been extensively studied in Arabidopsis, 4
ACS Paragon Plus Environment
Page 4 of 42
Page 5 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
71
comprises up to three stages, namely (i) side chain elongation, (ii) core structure
72
formation, and (iii) secondary modifications.1 The three independent stages of the
73
GRA-biosynthesis pathway in plants are shown in Figure 1. First, methionine
74
undergoes a string of reactions that mediate one or more rounds of chain elongation,
75
reminiscent of the conversion from valine to leucine. Elongation of methionine with
76
two methyl groups leads to the formation of dihomo-methionine (DHM), the precursor
77
of GRA. The initial cytosolic transamination process is accomplished by
78
branched-chain amino transferase 4 (BCAT4), resulting in the α-keto acid (αKA).9 The
79
αKA
80
methylthioalkylmalate
81
2-(2-methylthio) ethylmalate derivative, which subsequently undergoes isomerization
82
and decarboxylation. To be specific, the ethylmalate derivative is catalyzed by an
83
isopropylmalate isomerase (IPMI), as well as an isopropylmalate dehydrogenase
84
(IPMDH) to form a chain-elongated αKA.10-12 Two cycles of chain-elongated αKA
85
(2-oxo-6-methylthiohexanoic acid) is further transaminated by BCAT3 to form DHM.
86
Later, DHM enters the second biosynthetic stage where the corresponding
87
glucosinolates were formed. The pathway responsible for core structure formation
88
involves the activities of five cytosolic enzymes. They include cytochrome P450
89
enzymes from the CYP79 and CYP83 families that mediate two monooxygenation
90
reactions of forming corresponding oximes, and then to reactive compounds.4
91
S-alkylthiohydroxamates are formed in a sulfurization step, which may involve either a
92
non-enzymatic step or an enzyme similar to glutathione-S-transferase.13 The other three
93
cytosolic enzymes are responsible for the last three enzymatic steps: (i) a C-S lyase,
94
converting S-alkylthiohydroxamates to thiohydroxamates,14 (ii) a glucosyltransferase,
95
transferring glucose to thiohydroxamates to form desulfoglucosinolates,15 and (iii) a
is
transported
into
chloroplasts
synthase
(MAM)
where with
it
is
condensed
acetyl-CoA
5
ACS Paragon Plus Environment
to
by
produce
the a
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
96
sulfotransferase, adding a sulfate group to produce glucosinolates.16,17 Finally, the
97
product is modified by a flavin monooxygenase to generate GRA.18
98
Although it is feasible to transfer the biosynthetic pathways from plants to
99
microorganisms for production, exactly how to express the proteins functionally is the
100
first challenge to overcome. Some enzymes derived from plants are difficult to express
101
functionally in prokaryotes because they undergo different post-translational
102
modifications and adopt structures distinct from that generated in eukaryotic cells.
103
Previous studies have shown that many efforts are needed to optimize the heterologous
104
production of eukaryotic proteins in microorganisms. For example, expression of the
105
hydrophobic membrane protein P450s in E. coli is especially challenging due to its
106
tendency to form insoluble inclusion bodies.19 Moreover, some enzymes, such as the
107
C-S lyase SUR1 in Arabidopsis, whose function is not redundant, have been found to
108
be very unstable in E. coli.14 Thus, exploring isoenzymes from different sources is
109
needed. In addition, plants produce over 100 glucosinolates and these compounds are
110
usually mixed together owing to the multifunctional and redundant nature of the
111
enzymes, making it necessary to select relatively efficient enzymes and design a
112
pathway to reduce by-product assembly in microorganisms. For example, in the first
113
stage of GRA production, methionine is converted to DHM often with the by-products
114
of homo-methionine (HM), trihomo-methionine, and other derivatives. Establishing
115
how to shift the flux towards maximum DHM production is also highly desirable.
116
In this study, we first selected appropriate enzymes and functionally expressed all
117
necessary enzymes for GRA production in E. coli. Then, we engineered the
118
biosynthetic pathway from methionine to GRA, which was transferred from plants to
119
microbes. After introducing 10 enzymes from different sources, the final product GRA
120
was confirmed by LC-MS/MS analysis. Our results suggest that heterologous 6
ACS Paragon Plus Environment
Page 6 of 42
Page 7 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
121
production of glucoraphanin in E. coli is a promising way to generate this valuable
122
compound.
123 124
Results and Discussion
125
Selection and expression of enzymes in the side chain-elongation pathway. The
126
side chain-elongation pathway involves several transamination steps.20 The initial
127
transamination step is catalyzed by BCAT4,9 while BCAT3 mediates the terminal
128
chain-elongation steps and can also partially replace the function of BCAT4.21
129
Moreover, BCAT3 only participates in the extension of methionine by adding one or
130
two methyl groups. The enzyme converts 5-methylthio-2-oxopentanoate to HM and
131
6-methylthio-2-oxohexanoate to DHM, whereas less enzymatic activity is measured for
132
the conversion of 4-methylthio-2-oxohexanoate to trihomo-methionine.21 To reduce the
133
formation of byproducts and to simplify the pathway, we selected one branched amino
134
transferase (BCAT3) for the side chain-elongation pathway since only DHM is a
135
precursor for glucoraphanin production. By deleting the potential chloroplast-targeting
136
signal (60 amino acids), the truncated BCAT3 was solubly expressed in E. coli
137
(Supplementary file, Fig. S1a).
138
Glucosinolates in Arabidopsis normally contain a variable side chain with additional
139
one to six methyl groups as a result of chain elongation of methionine derivatives.22
140
GSL-ELONG is one of the critical loci responsible for such variation. It directs the
141
degree of side chain elongation of methionine-derived glucosinolates.23 Previous
142
cloning of the GS-ELONG QTL gene has led to the identification of the MAM1,
143
MAM2, and MAM3 genes.24,25 MAM2 plays an important role in producing aliphatic
144
glucosinolates by adding one methyl group, while MAM1 mediates the production of
145
glucosinolates up to two elongation cycles,24-26 and MAM3 is involved in all aliphatic 7
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
146
glucosinolates production but prefers to add one, five and six methyl groups,
147
respectively.26,27 After transferring the DHM biosynthetic pathway into E. coli with
148
MAM1, the byproduct HM still represented a substantial portion of the total output,28
149
which is not favorable for GRA production. Aliphatic glucosinolates derived from HM
150
are termed “3C” glucosinolates, whereas those derived from DHM are called “4C”
151
glucosinolates.24 In Brassica oleracea, Li et al. confirmed that the 3C and 4C
152
glucosinolates are independently regulated by the GSL-PRO and GSL-ELONG genes.29
153
Therefore, it is possible to channel the glucosinolate pathway with increased content of
154
glucoraphanin by only optimizing the GSL-ELONG expression. Here, we selected
155
GSL-ELONG (encoding methylthioalkylmalate synthase) from B. oleracea to control
156
the condensation reaction for reduced byproducts. The truncated GSL-ELONG gene,
157
without the signal sequence was successfully expressed in E. coli (Supplementary file,
158
Fig. S1b).
159
IPMI is a heterodimer with a large subunit (LSU) and a small subunit (SSU). In
160
Arabidopsis, there are one LSU gene (IPMI LSU1) and three genes for SSU (IPMI
161
SSU1, SSU2, and SSU3).9,30 While the large subunit is responsible for both leucine
162
biosynthesis and methionine chain elongation,31 the three small subunits may specialize
163
in different functions. IPMI SSU2 and IPMI SSU3 function in methionine chain
164
elongation, while IPMI SSU1 is important in leucine biosynthesis.31,32 In Arabidopsis,
165
three IPMDHs are crucial for methionine chain-elongation. IPMDH1 is mainly
166
participated in glucosinolate biosynthesis, whereas both IPMDH2 and IPMDH3 also
167
contribute to leucine biosynthesis.33-35 Mirza et al. introduced IPMI (LSU1 and SSU3
168
subunits) and IPMDH1 into E. coli to successfully construct the DHM-biosynthesis
169
pathway.28 Therefore, we also selected IPMI (LSU1 and SSU3) and IPMDH1 of
170
Arabidopsis for this study. The N-terminal sequences predicted to be responsible for 8
ACS Paragon Plus Environment
Page 8 of 42
Page 9 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
171
chloroplast localization of the three genes were deleted as previously described.32,33
172
The soluble expression of the modified IPMI (LSU1 and SSU3) and IPMDH1 was
173
confirmed (Supplementary file, Fig. S1c).
174
As illustrated by literatures, several isozymes are involved in each step of the side
175
chain-elongation pathway. These isozymes catalyze similar reactions with subtle
176
difference for preferences in substrates. This phenomenon creates redundant expression,
177
which may be necessary for plants, but it would have led to a heavy metabolic load for
178
microorganisms if we introduce all relevant genes for DHM biosynthesis. Therefore, to
179
reduce the number of by-products formed and lower the burden to microbial strains, we
180
chose BCAT3 (derived from Brassica rapa), GSL-ELONG (derived from B. oleracea),
181
IPMI (LSU1 and SSU3), and IPMDH1 (derived from Arabidopsis thaliana) to catalyze
182
the first steps of glucoraphanin production from methionine to DHM.
183 184
Selection and expression of enzymes in the core structure-formation pathway.
185
CYP79 family cytochrome P450s mediate the conversion of amino acids to oximes
186
during glucosinolate biosynthesis, whereas CYP83-family members are involved in
187
subsequent oxime metabolism. When functioning together, the CYP79F1/F2 and
188
CYP83A1 enzymes result in the production of aliphatic glucosinolates, while the
189
CYP79A2/B2/B3 and CYP83B1 enzymes generate indolic/benzenic glucosinolates.
190
Compared to CYP79F2, which yields exclusively long-chain aliphatic glucosinolates,
191
the CYP79F1 enzyme can catalyze mono- to hexahomo-methionine.20 Considering the
192
specificity of the enzymes, we chose CYP79F1 (B. oleracea) and CYP83A1 (B. rapa)
193
to metabolize short-chain aliphatic amino acids and oximes.
194
Application of the plant P450 enzymes in E. coli is restricted by two major factors,
195
with the first one being the lack of cytochrome P450 reductases (CPRs), which are 9
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
196
responsible for electron transfer in eukaryotes.36 The second one is the absence of
197
compatible membrane-bound sequences.37,38 Modification at the N-terminal of P450s
198
have resulted in successfully expression in prokaryotic bacteria.39 In this study, the
199
modification procedures used for functionally expressing CYP79F1 and CYP83A1 in E.
200
coli were performed as described in our previous report for expressing CYP79A2 and
201
CYP83B1.40 Briefly, the first seven amino acids of CYP79F1 were replaced with a
202
synthetic mammalian N-terminal peptide ε to create CYP79F1[ε:8-540].37 Then, the 3′
203
terminus of the modified CYP79F1[ε:8-540] enzyme was joined to the 5′ terminus of
204
ATR2 [73-711] using an artificial linker (λ) to create a fusion protein (Fig. 2a). The
205
used λ fragment was chosen as GSTSSGSG.41 Sodium dodecyl sulfate-polyacrylamide
206
gel electrophoresis (SDS-PAGE) analysis indicated the successful expression of the
207
chimeric protein in E. coli (Fig. 2b).
208
Cytochrome P450 enzyme CYP83A1 is the second enzyme in the core
209
structure-biosynthesis pathway, converting aldoxime to an aci-nitro compound. We
210
modified CYP83A1 in the same manner as done for CYP79F1. CYP83A1 contains two
211
predicted transmembrane domains, one of which is in the N-terminal region and the
212
other is in the central region of the primary sequence. These putative transmembrane
213
domains increase the risk of failure for protein expression and proper folding.
214
Fortunately, we successfully expressed the chimera enzyme by only modifying the
215
N-terminal transmembrane sequence to generate a fusion protein comprised of
216
CYP83A1[ε:20-501] with a truncated ATR2[73-711] domain (Fig. 2b).
217
Our previous investigation has demonstrated that the product catalyzed by
218
CYP83B1, nitrile oxide, can interact with cysteine spontaneously to generate
219
S-alkylthiohydroxamates for downstream reaction catalyzed by the C-S lyase.40 Thus,
220
in this study, we also used cysteine as sulfur donor in vivo to omit two enzymes 10
ACS Paragon Plus Environment
Page 10 of 42
Page 11 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
221
(glutathione-S-transferase GSTF and γ-glutamyl peptidase GGP1). The three
222
subsequent steps of glucoraphanin synthesis are mediated by the so-called
223
post-aldoxime enzymes, which are low selective for the side chain but highly selective
224
for the core group.42 SUR1 has been characterized as a C-S lyase in glucosinolate
225
biosynthesis and the function of SUR1 is not redundant in plants.14 Since the
226
recombinant SUR1 protein is quite unstable in heterologous host14, we tested enzymes
227
with similar functions from other sources. In another sulfur-containing molecule in the
228
ergothioneine-biosynthetic pathway of the fungus Neurospora crassa, a novel C-S
229
lyase hercynylcysteine sulfoxide lyase (EGT2) was identified and in vitro
230
reconstitution revealed that it catalyzes a similar reaction with that of SUR1 (Fig. 3).43
231
Thus, this enzyme attracted our attention and the corresponding gene was synthesized
232
after codon optimization and successfully expressed in E. coli in its active form (Fig.
233
3).
234
The next-to last step in the core structure-formation pathway is glucosylation, which
235
catalyzed by a UDP-glucose-thiohydroximate glucosytransferase. UGT74B1 is the
236
major enzyme catalyzing this reaction in vivo.15 The codon optimized UGT74B1 was
237
easily expressed in E. coli, as previously described40 and is shown in Figure 3.
238
Desulfoglucosinolate: PAPS sulfotransferase catalyzes the sulfurization of
239
desulfoglucosinolates during the final step of core structure formation in glucosinolate
240
biosynthesis. Three sulfotransferases with different substrate preferences are present in
241
plants, classified as ST5a, ST5b, and ST5c. ST5a prefers tryptophan- and
242
phenylalanine-derived desulfoglucosinolates, while aliphatic desulfoglucosinolates
243
serve as the preferred substrates of ST5b and ST5c.16,17 Moldrup et al. transformed the
244
genes involved in benzyl glucosinolate synthesis in tobacco plants and successfully
245
enabled them to synthesize benzyl glucosinolate.44 This finding showed that the supply 11
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 12 of 42
246
of the co-substrate PAPS was a bottleneck in synthesis, rather than the sulfotransferases.
247
Because the binding affinity of the S donor PAPS is higher for ST5c,17 we chose ST5c
248
as the candidate enzyme in this study and the codon-optimized ST5c gene was
249
successfully expressed in E. coli without other attempts (Fig. 3).
250 251
Selection and expression of enzymes in secondary modification. Hanse et al. have
252
identified the flavin monooxygenase (FMO) enzyme in Arabidopsis, FMOGS-OX1,
253
which converts methylthioalkyl glucosinolates to methylsulfinylalkyl glucosinolates.18
254
FMOGS-OX1 efficiently converted 4-methylthiobutyl glucosinolate (derived from
255
dihomo-methionine) to glucoraphanin, but only about 60% of 3-methylthiopropyl
256
glucosinolate
257
3-methylsulfinylpropyl glucosinolate.6 Thus, we chose FMOGS-OX1 from Arabidopsis
258
as a suitable FMO to modify the side chain in this study. We directly expressed the gene
259
amplified from the complementary DNA (cDNA) of Arabidopsis without any changes
260
in the gene sequence (Fig. 3).
(derived
from
homo-methionine)
was
converted
to
261 262
Functional verification of core formation and side chain-modification pathways
263
in vitro. After successfully expressing all the pathway enzymes in E. coli, their
264
enzymatic activities needed to be verified. Because most intermediates were not
265
commercially available and were difficult to detect, it was challenging to measure the
266
activity of each enzyme. Given these limitations, we opted to use a combination of pure
267
enzymes to detect product formation and verify their enzymatic activities. The first part
268
(side chain elongation) of glucoraphanin synthesis in E. coli was already achieved
269
previously, although different sources of enzymes were selected for this study.28 In
270
addition, the decomposition product of glucoraphanin (sulforaphane) was readily 12
ACS Paragon Plus Environment
Page 13 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
271
detected at low levels by GC-MS. Thus, we further added Brevicoryne brassicae
272
myrosinase to the reaction system containing only the enzymes required for the core
273
formation and side chain-modification pathways, in order to test the activities.40
274
Reaction was performed by incubating all the enzymes, substrates and cofactors
275
involved at 25°C for 1 h. The sulfur donor GSH was replaced by cysteine for the
276
reaction.40
277
GC-MS analysis showed that 3-butenyl isothiocyanate was detected in the reaction
278
system (Fig. 4). The presence of 3-butenyl isothiocyanate has been proposed as the
279
result of thermal degradation of sulforaphane during sample injection in GC or GC-MS
280
analysis.45 Blazevic et al. also reported that some 3-butenyl isothiocyanate could
281
originate from sulforaphane degradation. In our experiments, direct detection of the
282
authentic standard sulforaphane by GC-MS also revealed the presence of 3-butenyl
283
isothiocyanate (Fig. 4c).46 In addition, since DHM was not commercially available, we
284
used HM as the substrate. LC-MS analysis of the substrate HM was also accompanied
285
by small amount of DHM, which could have originated as an impurity of fmoc
286
L-homo-methionine. We anticipated that iberin (derived from HM) would be the main
287
product with small amount of sulforaphane (derived from DHM), whereas only
288
sulforaphane was identified in our studies. This outcome might have been due to the
289
enzyme specificities, although the speculation needs to be verified. Nevertheless,
290
identification of 3-butenyl isothiocyanate indicated that sulforaphane was formed in the
291
reaction and most importantly, confirmed the activities of all enzymes expressed in E.
292
coli, which encouraged us to assemble the entire pathway in vivo.
293 294
Assembling the entire glucoraphanin-biosynthesis pathway in E. coli.
295
Sulforaphane is known to be unstable upon exposure to heat and could be degraded into 13
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 14 of 42
296
several volatile compounds.47,48 In addition, sulforaphane also has antimicrobial
297
activity. To achieve stable product formation by microbial fermentation, we introduced
298
the entire glucoraphanin-synthesis pathway in E. coli, starting from methionine. The
299
whole pathway (including the enzymes engineered in the current study), in comparison
300
with the natural pathway in Arabidopsis thaliana, is shown in red in Figure 5.
301
The enzymes mediating side chain elongation, core structure formation, as well as
302
side chain modification were co-expressed together. The synthetic operons were all
303
under the control of the T7 promoter (Supplementary file, Fig. S2). The first operon
304
consisted of the first two genes involved in the side chain-elongation biosynthetic
305
pathway, while the second operon, which was driven by the T7 promoter in a different
306
plasmid (pET-28a), comprised the remaining genes of the DHM-biosynthetic pathway
307
in
308
BCAT3-GSL-ELONG-IPMDH1-IPMI-LSU1-IPMI-SSU3. It is widely accepted that
309
higher expression is observed from the upstream genes in the operon than the genes put
310
at downstream.49 Therefore, we positioned IPMDH1 upstream to IPMI (LSU1 and
311
SSU3) in the second operon to increase the activity of IPMDH1. The four genes in the
312
transcriptional order of EGT2-UGT74B1-ST5c-FMOGS-OX1 were inserted into multiple
313
cloning site 1 (MCS1) of pACYCDuet-1, and the forth operon was composed of
314
CYP79F1-CYP83A1-ATR2 and inserted into MCS2. The optimum proportion of P450
315
enzyme to CPR was ∼15:1 in natural plant systems.50 This trend demonstrates the
316
chimera of P450 and CPR with a ratio of 1:1 was not the most effective scheme. High
317
expression of P450s consumed a huge amount of resource and resulted in heavy
318
metabolic burden, which in turn depressed productivity.51 Therefore, P450s catalyzed
319
two consecutive steps, which should be the rate-limiting steps, and then we fused the
320
modified CYP79F1 and CYP83A1 genes with one ATR2 gene in the final position of
the
transcriptional
14
ACS Paragon Plus Environment
order
of
Page 15 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
321
the operon to maintain relatively low expression.
322 323
In vivo production of GRA in E. coli harboring the modified synthesis pathway.
324
The strain with two plasmids harboring four operons was constructed and analyzed for
325
synthesis of the final product glucoraphanin, derived from methionine. Because so
326
many transgenes were co-expressed in single cells, we first conducted RT-PCR
327
analysis to confirm that all the genes were successfully assembled in E. coli and that the
328
added artificial ribosome binding sequences in front of each gene were functional. All
329
genes in the glucoraphanin biosynthesis pathway were detectable by RT-PCR analysis
330
(Supplementary file, Figure S3), indicating that they were all transcribed correctly.
331
Next, we further investigated the product from methionine transformation in
332
isopropyl β-D-thiogalactopyranoside (IPTG)-induced cells. LC-MS/MS analysis
333
results showed that a trace amount of glucoraphanin was produced in the reaction
334
system (Fig. 6). As expected, glucoiberin was also detected as the only byproduct.
335
Glucoraphanin was detected both in the cells and medium, while glucoiberin was only
336
detected in the cells. The ratio of glucoraphanin/glucoiberin in the cells was about 3: 2.
337
The lack of detectable glucoiberin in the medium may have resulted from a low total
338
yield of glucosinolates and a negative matrix effect. Nonetheless, this finding indicated
339
that the reconstructed pathway in E. coli could not efficiently distinguish between “4C”
340
and “3C” glucosinolates and this result may have been due to low GSL-EONG
341
expression, which suggested a further target for optimizing biosynthesis at the pathway
342
level. The result was also in accordance with previous observations following transient
343
expression of glucoraphanin in tobacco plants,6 in which both glucoraphanin and
344
glucoiberin were detected. Although the native CYP79F1 enzyme purified from
345
Brassica can metabolize mono- methionine to hexahomo-methionine,20 glucosinolate 15
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
346
derivatives more than “4C” were not detected in our experiments. This could be due to
347
the selection of BCAT3 in the first stage pathway to avoid other byproducts, where the
348
substrate specificity of the enzyme only facilitated catalysis of two rounds of
349
condensation reactions at most. Furthermore, CYP79F1 exclusively catalyzes aliphatic
350
amino acid monooxygenase reactions, which could explain why no other
351
amino-acid-derived products were detected in this study. Thus, the enzyme-selection
352
strategy ensured the purity of the final product, which will reduce the number of
353
downstream purification steps required and the processing costs. The absence of
354
pathway intermediates identified in the spectrum could have been due to the relatively
355
low concentrations, where the intermediates were quickly transformed to the
356
downstream compounds. Alternatively, they simply could not be identified due to the
357
lack of authorized standards.
358
In summary, enzyme candidates from different sources were selected by reviewing
359
the literature. The N-terminal domains of two membrane P450 proteins were modified
360
for expression on the membrane of E. coli cells to facilitate protein expression. The
361
P450 proteins were also modified as fusion protein with the corresponding P450
362
reductases to achieve electron transfer. In addition, some enzymes, which could not be
363
expressed in soluble form or were not stable in E. coli cells, were substituted with
364
isoenzymes from different sources. By using cysteine as the sulfur donor in a
365
non-enzymatic step and one BCAT enzyme (BCAT3) to participate in both the
366
beginning and the final transamination reactions, we simplified the synthesis pathway,
367
avoided impurities, and lightened the metabolic burden to the cells. Our study
368
represents the first case where glucoraphanin was synthesized from methionine by
369
microbial cells. At present stage, the glucoraphanin levels in the broth were rather low
370
and the concentrations were roughly estimated to be 2-3 µg/L, despite of the 16
ACS Paragon Plus Environment
Page 16 of 42
Page 17 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
371
complicated sample-processing procedure limited the accuracy of such measurements.
372
Although some technical obstacles remain that may impede the process of large-scale
373
production of glucoraphanin by E. coli strains at present, the achievements in this study
374
provided a start for further improving the production performance. Importantly, the
375
protein-expression and gene-mining processes revealed by this study could also
376
provide reference information for demonstrating the biosynthesis of other secondary
377
plant products by microorganisms. More engineering strategies are required to increase
378
the efficiency, including further optimization of P450s expression. The changes such as
379
reduction of secondary mRNA structure, bacterial codon usage, the use of molecular
380
chaperones, as well as varying external growth conditions, also appear to influence
381
P450s expression in prokaryotic organisms.52 We are currently evaluating systems
382
metabolic engineering to increase the supplies of co-substrates, to construct ATP and
383
NADPH cofactor-regeneration cycles, and to balance the protein expression at the
384
pathway level to strengthen the production capacity of glucoraphanin.
385 386
Methods
387
Functional expression of individual enzymes in E. coli. The sequences of primers
388
used in this study for recombinant protein expression are shown in Table 1. The
389
BCAT3 gene from B. rapa and the GSL-ELONG gene from B. oleracea (encoding
390
methylthioalkylmalate synthase) were codon-optimized and synthesized by GENEWIZ
391
Co., Ltd. (Suzhou, China). IPMDH1, IPMI-LSU1, and IPMI-SSU3 were directly
392
amplified from Arabidopsis cDNA. The cytochrome P450 genes CYP79F1 from B.
393
oleracea and CYP83A1 from B. rapa, EGT2 (encoding a C-S lyase) from N. crassa,
394
UGT74B1 (encoding a glucosyltransferase) from B. rapa, and ST5c (encoding a
395
sulfotransferase) from B. rapa were codon-optimized and synthesized by GENEWIZ. 17
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
396
FMO-GSOX1 (encoding a flavin monooxygenase) was directly amplified from
397
Arabidopsis cDNA. All gene sequences used in this study can be accessed in the
398
associated online supplementary file.
399
The pET-28a and pETDuet-1 vectors were used for recombinant protein expression.
400
The chain-elongation genes BCAT3, GSL-ELONG, IPMI (IPMI-LSU1 and IPMI-SSU3
401
fusion genes), and IPMDH1 were constructed in pET-28a after truncating their
402
respective membrane-positioning sequences. For the core structure-formation pathway,
403
the CYP79F1, CYP83A1, EGT2, UGT74B1, and ST5c genes were constructed in the
404
pET-28a vector. CYP79F1 and CYP83A1 were each fused with a truncated variant of
405
the CPR gene ATR2 and expressed as chimeras after substituting their predicted
406
transmembrane domain segments with that of bovine 17α-hydroxylase, as described
407
previously with minor changes.40 The gene encoding the secondary-modification
408
enzyme FMO-GSOX1 was inserted into pETDuet-1. E. coli DH5α cells were used for
409
plasmid propagation, and BL21(DE3) cells were used for protein expression. The
410
molecular chaperone pGro7 was used for FMO-GSOX1 expression to enhance the
411
purification efficiency. Protein expression and purification were performed in
412
accordance with our previous work.40
413 414
Reconstitution of the core structure formation and side-chain modification
415
pathways in vitro. The individual genes were expressed, purified and added to the
416
reaction system for biosynthesizing sulforaphane, except for CYP79F1 and CYP83A1
417
since it is hard to get pure enzymes and only the respective cell lysates were used. Since
418
no commercial source of DHM was available, we used commercial fmoc
419
L-homo-methionine to produce HM after removing the fmoc-group.53 Synthesis of the
420
resultant HM molecule was verified by LC-MS analysis. The reaction buffer for 18
ACS Paragon Plus Environment
Page 18 of 42
Page 19 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
421
sulforaphane biosynthesis contained 5 mM homo-methionine, 5 mM cysteine, 2 mM
422
UDP-glucose, 2 mM pyridoxal phosphate (PLP), 10 mM NADPH, and 2 mM
423
3'-phosphoadenosine-5'-phosphosulfate lithium salt (PAPS). The reaction was
424
performed at 25°C for 1.5 h. Next, an equal volume of dichloromethane was used to
425
extract the reaction mixture twice at room temperature. The collected dichloromethane
426
portion was evaporated and re-dissolved in 100 µl dichloromethane for subsequent
427
GC-MS analysis.
428 429
Integration of the glucoraphanin-synthesis pathway into E. coli. The pET-28a
430
and pACYCDuet-1 vectors were used for co-expressing all genes in the
431
glucoraphanin-synthesis pathway in clonal E. coli cells. The primer sequences used for
432
pathway reconstruction are shown in Table 2. The T7 promoter was used for sequential
433
expression of glucoraphanin-biosynthesis genes. The plasmid pET-28a harbored genes
434
driving the first step of glucoraphanin biosynthesis (chain elongation), including
435
BACT3, GSL-ELONG, IPMI (LSU1 and SSU3), and IPMDH1. The plasmid
436
pACYCDuet-1 was used to express genes driving core structure formation (CYP79F1
437
and CYP83A1 fused with ATR2, EGT2, UGT74B1, and ST5c) and modification
438
(FMOGS-OX1). The respective ribosome-binding site was put in front of each gene to
439
make the proper expression. The pET-28a and pACYCDuet-1 co-transformants were
440
picked up from LB agar plates containing 50 µg/ml ampicillin and 12.5 µg/ml
441
chloramphenicol, respectively.
442 443
Verification of gene expression by RT-PCR analysis. To check the expression of
444
genes encoding the whole glucoraphanin-synthesis pathway, total RNA isolation
445
followed by RT-PCR analysis was performed. Aliquots of 4-mL overnight-grown 19
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
446
cultures were subsequently centrifuged for 1 min at 13,000 × g. RNA was isolated using
447
the E.Z.N.A.TM Bacterial RNA Kit (Omega Bio-Tek, USA). Total RNA concentrations
448
were determined based on the absorbance at 260 nm (NanoVue, GE Healthcare).
449
cDNAs were prepared with the FastQuant RT Kit (with gDNase) (Tiangen, China),
450
using appropriate gene-specific primers.
451 452
Transforming methionine to GRA by resting cell reactions. A single
453
transformant was inoculated into 10 mL LB media with 50 µg/ml ampicillin and 12.5
454
µg/ml chloramphenicol and grown overnight. The cultures were diluted at 1:50 in 200
455
mL TB media containing antibiotics and additional 75 µg/ml δ-aminolevulinic acid was
456
added when the cells grew to an OD600 of ~0.2. When the OD600 further reached 0.6-0.8,
457
0.5 mM IPTG was added for an additional growth for 12 h at 37 ºC to promote the
458
protein expression. The cells were collected and re-suspended in M9 medium
459
supplemented with 5 mM methionine, 5 mM cysteine, 0.2 mM PLP, 0.2 mM PAPS,
460
and 0.1 mM Fe(II) at an OD600 of ~30. After cultivation at 30 ºC for 48 h, the
461
supernatant was collected after centrifugation and freeze-dried. Pure methanol was
462
added to dissolve the lyophilized powder, the resulting solution was put at room
463
temperature for 4 h, and the supernatant was collected and filtered through a 0.22-µm
464
polyvinylidene fluoride filter. Then, the supernatant was further evaporated and
465
re-dissolved in 100 µl deionized water for LC-MS/MS analysis.
466 467
Analytical methods. GC-MS was performed on a Shimadzu QP2010 Ultra gas
468
chromatograph coupled to a Shimadzu mass spectrometer. A DB-5 column (30 m ×
469
0.25 mm × 0.25-µm) was used (head pressure, 100-kPa, splitless injection). For
470
sulforaphane, the analysis conditions used were as follows: injection port 250 °C; the 20
ACS Paragon Plus Environment
Page 20 of 42
Page 21 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
471
oven temperature program was held at 50 °C for 2 min, increased at 10 °C min-1 to
472
250 °C and held for 5 min; the ion source was run in EI mode (70 ev) at 200 °C.
473
Separation of glucosinolates were achieved by high-performance liquid
474
chromatography (HPLC) coupled with a Zorbax Eclipse Plus C18 column (250 mm ×
475
4.6 mm × 5 mm, Agilent Technologies, Germany). Formic acid (0.05%) in water and
476
methanol were employed as mobile phases A and B respectively. The injection volume
477
was 20 µL. The gradient elution program was applied at a flow rate of 0.5 mL/min as
478
follows: 0-30 min, 5% B in A; 30-32 min, 5-100% B in A; 32-40 min 100% B, 40-41
479
min 100-5% B in A and 41-50 min 5% B. The column temperature was maintained at
480
25 °C. The Agilent 1260 HPLC system was coupled to an AB SCIEX QTRAP 4500
481
mass spectrometer (Foster, CA, USA) equipped with an electrospray ion source
482
operated in negative ionization mode. The operating parameters were as follows:
483
collision gas: 20.0; collision gas: medium; ion spray voltage: -4,500 V; Temperature:
484
500 °C; ion source gas 1: 60.0; ion source gas 2: 60.0; declustering potential: -82.0;
485
entrance potential: -8.0; collision energy: -40.0; collision cell exit potential: -18.0.
486
Glucoraphanin ion detection was performed by multiple reaction monitoring (MRM)
487
mode at m/z 436-96.8, 436-177.8, 436-259.0 and 436-371.9. The same operating
488
parameters were used to detect glucoiberin, except that the collision energy was -26.0.
489
Ion detection was performed in MRM mode at m/z 422-96.9, 422-195.8, 422-258.9 and
490
422-357.9, according to previous literature.9
491
492
Supporting Information
493
Soluble enzyme expression data for BCAT3, GSL-ELONG, IPMI (LSU1 and SSU3)
494
and IPMDH1; a schematic diagram showing the assembly of all genes involved in
21
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
495
glucoraphanin biosynthesis in vivo; RT-PCR analyses of the gene expression in the
496
glucoraphanin biosynthesis pathway in E. coli; and the sequences of all genes used in
497
this study supplied as Supporting Information.
498
499 500 501
Author Contributions B.Y. and Y. L. designed the research. H.Y. and F.L. performed the experiments. H.Y., Y. L. and B.Y. wrote the article.
502 503 504
Notes The authors declare no competing financial interest.
505 506 507 508
Acknowledgements The work was partially supported by a grant from the Key International Cooperation Project of Chinese Academy of Sciences (155112KYSB20150024).
509 510
References
511
(1) Mikkelsen, M. D., Buron, L. D., Salomonsen, B., Olsen, C. E., Hansen, B. G.,
512
Mortensen, U. H., and Halkier, B. A. (2012) Microbial production of
513
indolylglucosinolate through engineering of a multi-gene pathway in a versatile
514
yeast expression platform. Metab. Eng. 14, 104-111.
515
(2) Fossati, E., Ekins, A., Narcross, L., Zhu, Y., Falgueyret, J. P., Beaudoin, G. A.,
516
Facchini, P. J., and Martin, V. J. (2014) Reconstitution of a 10-gene pathway for
517
synthesis of the plant alkaloid dihydrosanguinarine in Saccharomy cescerevisiae. 22
ACS Paragon Plus Environment
Page 22 of 42
Page 23 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
518 519 520 521 522
Nat. Commun. 5, 3283. (3) Sonderby, I. E., Geu-Flores, F., and Halkier, B. A. (2010) Biosynthesis of glucosinolates - gene discovery and beyond. Trends Plant Sci. 15, 283-290. (4) Halkier, B. A., and Gershenzon, J. (2006) Biology and biochemistry of glucosinolates. Annu. Rev. Plant Biol. 57, 303-333.
523
(5) Armah, C. N., Traka, M. H., Dainty, J. R., Defernez, M., Janssens, A., Leung, W.,
524
Doleman, J. F., Potter, J. F., and Mithen, R. F. (2013) A diet rich in
525
high-glucoraphanin broccoli interacts with genotype to reduce discordance in
526
plasma metabolite profiles by modulating mitochondrial function. Am. J. Clin.
527
Nutr. 98, 712-722.
528 529
(6) Mikkelsen, M. D., Olsen, C. E., and Halkier, B. A. (2010) Production of the cancer-preventive glucoraphanin in tobacco. Mol. Plant. 3, 751-759.
530
(7) Juge, N., Mithen, R. F., and Traka, M. (2007) Molecular basis for chemoprevention
531
by sulforaphane: a comprehensive review. Cell Mol. Life Sci. 64, 1105-1127.
532
(8) Axelsson, A. S., Tubbs, E., Mecham, B., Chacko, S., Nenonen, H. A., Tang. Y.,
533
Fahey, J. W., Derry, J. M., Wollheim, C. B., Wierup, N., Haymond, M. W., Friend,
534
S. H., Mulder, H., and Rosengren, A. H. (2017) Sulforaphane reduces hepatic
535
glucose production and improves glucose control in patients with type 2 diabetes.
536
Sci. Transl. Med. 9, eaah4477.
537
(9) Schuster, J., Knill, T., Reichelt, M., Gershenzon, J., and and Binder, S. (2006)
538
Branched-chain aminotransferase4 is part of the chain elongation pathway in the
539
biosynthesis of methionine derived glucosinolates in Arabidopsis. Plant Cell. 18,
540
2664-2679.
541 542
(10) Chapple, C.C.S., Decicco, C., and Ellis, B. E. (1988) Biosynthesis of 2-(2’-methylthio)
ethylmalatein
Brassica
carinata.
23
ACS Paragon Plus Environment
Phytochemistry.
27,
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
543
3461-3463.
544
(11) Chisholm, M. D., and Wetter, L. R. (1964) Biosynthesis of mustard oil glucosides.
545
IV. The administration of methionine-C14 and related compounds to horse radish.
546
Can. J. Biochem. 42, 1033-1040.
547
(12) Graser, G., Schneider, B., Oldham, N. J., and Gershenzon, J. (2000) The
548
methionine chain elongation pathway in the biosynthesis of glucosinolates in
549
Eruca sativa (Brassicaceae). Arch. Biochem. Biophys. 378, 411-419.
550
(13) Hansen, C. H., Du, L., Naur, P., Olsen, C.E., Axelsen, K. B., Hick, A. J., Pickett, J.
551
A., and Halkier, B. A. (2001) CYP83B1 is the oxime-metabolizing enzyme in the
552
glucosinolate pathway in Arabidopsis. J. Biol. Chem. 276, 24790-24796.
553
(14) Mikkelsen, M. D., Naur, P., and Halkier, B. A. (2004) Arabidopsis mutants in the
554
C-S lyase of glucosinolate biosynthesis establish a critical role for
555
indole-3-acetaldoxime in auxin homeostasis. Plant J. 37, 770-777.
556
(15) Grubb, C. D., Zipp, B. J., Ludwig-Muller, J., Masuno, M. N., Molinski, T. F., and
557
Abel, S. (2004) Arabidopsis glucosyltransferase UGT74B1 functions in
558
glucosinolate biosynthesis and auxin homeostasis. Plant J. 40, 893-908.
559
(16) Piotrowski, M., Schemenewitz, A., Lopukhina, A., Muller, A., Janowitz, T.,
560
Weiler, E. W., and Oecking, C. (2004) Desulfoglucosinolate sulfotransferases
561
from Arabidopsis thaliana catalyze the final step in the biosynthesis of the
562
glucosinolate core structure. J. Biol. Chem. 279, 50717-50725.
563
(17) Klein, M., Reichelt, M., Gershenzon, J., and Papenbrock, J. (2006) The three
564
desulfoglucosinolate sulfotransferase proteins in Arabidopsis have different
565
substrate specificities and are differentially expressed. FEBS J. 273, 122-136.
566
(18) Hansen, B. G., Kliebenstein, D. J., and Halkier, B.A. (2007) Identification of a
567
flavin-monooxygenase as the S-oxygenating enzyme in aliphatic glucosinolate 24
ACS Paragon Plus Environment
Page 24 of 42
Page 25 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
568
biosynthesis in Arabidopsis. Plant J. 50, 902-910.
569
(19) Schlegel, S., Klepsch, M., Gialama, D., Wickstrom, D., Slotboom, D. J., and de
570
Gier, J. W. (2010) Revolutionizing membrane protein overexpression in bacteria.
571
Microb Biotechnol. 3, 403-411.
572 573
(20) Wittstock, U., and Halkier, B. A. (2002) Glucosinolate research in the Arabidopsis era. Trends Plant Sci. 7, 263-270.
574
(21) Knill, T., Schuster, J., Reichelt, M., Gershenzon, J., and Binder, S. (2008)
575
Arabidopsis branched-chain aminotransferase 3 functions in both amino acid and
576
glucosinolate biosynthesis. Plant Physiol. 146, 1028-1039.
577
(22) Hogge, L. R., Reed, D. W., Underhill, E. W., and Haughn, G. W. (1988) HPLC
578
separation of glucosinolates from leaves and seeds of Arabidopsis thaliana and
579
their identification using thermospray liquid chromatography-mass spectrometry. J.
580
Chromatogr. Sci. 26, 551-556.
581
(23) Magrath, R., Bano, F., Morgner, M., Parkin, I., Sharpe, A., Lister, C., Dean, C.,
582
Turner, J., Lydiate, D., and Mithen, R. (1994) Genetics of aliphatic glucosinolates:
583
I. Side chain elongation in Brassica napus and Arabidopsis thaliana. Heredity. 72,
584
290-299.
585
(24) Kroymann, J., Textor, S., Tokuhisa, J. G., Falk, K. L., Bartram, S., Gershenzon, J.,
586
Mitchell-Olds, T. (2001) A gene controlling variation in Arabidopsis glucosinolate
587
composition is part of the methionine chain elongation pathway. Plant Physiol. 127,
588
1077-1088.
589
(25) Kroymann, J., Donnerhacke, S., Schnabelrauch, D., and Mitchell-Olds, T. (2003)
590
Evolutionary dynamics of an Arabidopsis insect resistance quantitative trait locus.
591
Proc. Natl. Acad. Sci. 100, 14587-14592.
592
(26) Textor, S., de Kraker, J. W., Hause, B., Gershenzon, J., and Tokuhisa, J. G. (2007) 25
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
593
MAM3 catalyzes the formation of all aliphatic glucosinolate chain lengths in
594
Arabidopsis. Plant Physiol. 144, 60-71.
595
(27) Field, B., Cardon, G., Traka, M., Botterman, J., Vancanneyt, G., and Mithen, R.
596
(2004) Glucosinolate and amino acid biosynthesis in Arabidopsis. Plant Physiol.
597
135, 828-839.
598
(28) Mirza, N., Crocoll, C., Olsem, C. E., and Halkier, B. A. (2016) Engineering of
599
methionine chain elongation part of glucoraphanin pathway in E. coli. Metab. Eng.
600
35, 31-37.
601
(29) Li, G., Riaz, A., Goyal, S., Abel, S., and Quiros, C. F. (2001) Inheritance of three
602
major genes involved in the synthesis of aliphatic glucosinolates in Brassica
603
oleracea. J. Amer. Soc. Hort. Sci. 126, 427-431.
604 605
(30) Binder, S., Knill, T., and Schuster, J. (2007) Branched-chain amino acid metabolism in higher plants. Physiol. Plant. 129, 68-78.
606
(31) Knill, T., Reichelt, H., Paetz, C., Gershenzon, J., and Binder, S. (2009) Arabidopsis
607
thaliana encodes a bacterial-type heterodimeric isopropylmalate isomerase
608
involved in both Leu biosynthesis and the Met chain elongation pathway of
609
glucosinolate formation. Plant Mol. Biol. 71, 227-239.
610
(32) He, Y., Chen, B., Pang, Q., Strul, J. M., Chen, S. (2010) Functional specification of
611
Arabidopsis isopropylmalate isomerases in glucosinolate and leucine biosynthesis.
612
Plant Cell Physiol. 51, 1480-1487.
613
(33) He, Y., Mawhinney, T. P., Preuss, M. L., Schroeder, A. C., Chen, B., Abraham, L.,
614
Jez, J. M., and Chen, S. (2009) A redox-active isopropylmalate dehydrogenase
615
functions in the biosynthesis of glucosinolates and leucine in Arabidopsis. Plant J.
616
60, 679-690.
617
(34) He, Y., Chen, L., Zhou, Y., Mawhinney, T. P., and Chen, B., Kang, B. H., Hauser, 26
ACS Paragon Plus Environment
Page 26 of 42
Page 27 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
618
B. A., and Chen, S. (2011) Functional characterization of Arabidopsis thaliana
619
isopropylmalate dehydrogenases reveals their important roles in gametophyte
620
development. New Phytol. 189, 160-175.
621
(35) He, Y., Galant, A., Pang, Q., Strul, J. M., Balogun, S. F., Jez, J. M., and Chen, S.
622
(2011) Structural and functional evolution of isopropylmalate dehydrogenases in
623
the leucine and glucosinolate pathways of Arabidopsis thaliana. J. Biol. Chem. 286,
624
28794-28801.
625
(36) Sevrioukova, I. F., Li, H., Zhang, H., Peterson, J. A., and Poulos, T. L. (1999)
626
Structure of a cytochrome P450-redox partner electron-transfer complex. Proc.
627
Natl. Acad. Sci. 96, 1863-1868.
628
(37) Barnes, H. J., Arlotto, M. P., and Waterman, M. R. (1991) Expression and
629
enzymatic activity of recombinant cytochrome P450 17 alpha-hydroxylase in
630
Escherichia coli. Proc. Natl. Acad. Sci. 88, 5597-5601.
631
(38) Williams, P. A., Cosme, J., Sridhar, V., Johnson, E. F., and McRee, D. E. (2000)
632
Microsomal cytochrome P450 2C5: comparison to microbial P450s and unique
633
features. J. Inorg. Biochem. 81, 183-190.
634
(39) Leonard, E., Yan, Y., and Koffas, M. A. (2006) Functional expression of a P450
635
flavonoid hydroxylase for the biosynthesis of plant-specific hydroxylated
636
flavonols in Escherichia coli. Metab. Eng. 8, 172-181.
637
(40) Liu, F., Yang, H., Wang, L., and Yu, B. (2016) Biosynthesis of the high-value plant
638
secondary product benzyl isothiocyanate via functional expression of multiple
639
heterologous enzymes in Escherichia coli. ACS Syn. Biol. 5, 1557-1565.
640
(41) Leonard, E., and Koffas, M. A. (2007) Engineering of artificial plant cytochrome
641
P450 enzymes for synthesis of isoflavones by Escherichia coli. Appl. Environ.
642
Microb. 73, 7246-7251. 27
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
643
(42) Mikkelsen, M. D., Petersen, B. L., Olsen, C. E., and Halkier, B. A. (2002)
644
Biosynthesis and metabolic engineering of glucosinolates. Amino Acids. 22,
645
279-295.
646
(43) Hu, W., Song, H., Her, S. A., Bak, D. W., Naowarojna, N., Elliott, S. J., Qin, L.,
647
Chen, X., and Liu, P. (2014) Bioinformatic and biochemical characterizations of
648
C-S bond formation and cleavage enzymes in the fungus Neurospora crassa
649
ergothioneine biosynthetic pathway. Org. Lett. 16, 5382-5385.
650
(44) Moldrup, M. E., Geu-Flores, F., Olsen, C. E., and Halkier, B. A. (2011) Modulation
651
of sulfur metabolism enables efficient glucosinolate engineering. BMC Biotechnol.
652
11, 12.
653
(45) Chiang, W. C., Pusateri, D. J., and Leitz, R. E. (1998) Gas chromatography/mass
654
spectrometry method for the determination of sulforaphane and sulforaphane
655
nitrile in broccoli. J. Agric. Food Chem. 46, 1018-1021.
656
(46) Blazevic, I., Radomic, A., Skocibusic, M., de Nicola, G. R., Montaut, S., Iori, R.,
657
Rollin, P., Mastelic, J., Zekic, M., and Maravic, A. (2011) Glucosinolate profiling
658
and antimicrobial screening of Aurinia leucadea (Brassicaceae). Chem. Biodivers.
659
8, 2310-2321.
660
(47) Liang, H., and Yuan, Q. (2012) Natural sulforaphane as a functional
661
chemopreventive agent: including a review of isolation, purification and analysis
662
methods. Crit. Rev. Biotechnol. 32, 218-234.
663
(48) Tian, G., Li, Y., Cheng, L., Yuan, Q., Tang, P., Kuang, P., and Hu, J. (2016) The
664
mechanism of sulforaphene degradation to different water contents. Food. Chem.
665
194, 1022-1027.
666
(49) Lim, H. N., Lee, Y., and Hussein, R., (2011) Fundamental relationship between
667
operon organization and gene expression. Proc. Natl. Acad. Sci. 108, 10626-10631. 28
ACS Paragon Plus Environment
Page 28 of 42
Page 29 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
668 669
(50) Jensen, K., and Moller, B. L. (2010) Plant NADPH-cytochrome P450 oxidoreductases. Phytochemistry. 71, 132-141.
670
(51) Biggs, B. W., Lim, C. G., Sagliani, K., Shankar, S., Stephanopoulos, G., Mey, M.
671
D., and Ajikumar, P. K. (2016) Overcoming heterologous protein interdependency
672
to optimize P450-mediated taxol precursor synthesis in Escherichia coli. Proc.
673
Natl. Acad. Sci. 113, 3209-3214.
674
(52) Zelasko, S., Palaria, A., and Das, A., (2013) Optimizations to achieve high-level
675
expression of cytochrome P450 proteins using Escherichia coli expression systems.
676
Protein Expr. Purif. 92, 77-87.
677 678
(53) He, H., Gong, D., Li, X., and Wei, P. (2005) Synthesis of L-threoninol. Chin. J. Pharmaceuticals. 36, 326-327.
29
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
Page 30 of 42
Table 1 Primers used in protein expression in this study. Primers
Sequences (5’-3’)*
Function
BCAT3-F
CCCAAGCTTATGAATGCCGTGCTGAGCAATAGCAGC
Expression of truncated
BCAT3-R
CCGCTCGAGTGTTAACACGGTATACAG
BCAT3
GSL-ELONGF
CGGAATTCATGCCGCCGCAGAAGATCGAAATTGCCC
Expression of truncated
GSL-ELONGR
CCGCTCGAGCACCACGCTGCTGATCTG
GSL-ELONG
IPMI-LSU1-F
CGGAATTCATGACAATGACGGAGAAGATTCTAG
Expression of truncated
IPMI-LSU1-R
ACCGCTACCGCTGCTGGTGCTACCCTGCAAGAACTCCCTTGGGTC
IPMI (LSU1&SSU3)
IPMI-SSU3-F
GGTAGCACCAGCAGCGGTAGCGGTATAACCAGAGAGACTTTCCAC
IPMI-SSU3-R
CCGCTCGAGTCAAGCAGAAGGAATCATGCCGGC
IPMDH-F
CGGAATTCATGGCTTCACCTGGGAAAAAACGG
Expression of truncated
IPMDH-R
CCGCTCGAGTTAAACAGTAGCTGGAACTTTGG
IPMDH1
79F1-F
ACGCGTCGACATGGCTCTGTTATTAGCAGTTTTTACCACCAGCCTGCCGTACCC
Expression of modified
79F1-R
ACCGCTACCGCTGCTGGTGCTACCCGGGCAAAACTTCGGATACAG
CYP79F1 fusing with
ATR2-F
GGTAGCACCAGCAGCGGTAGCGGTAGGAGATCCGGTTCTGGGAATTC
truncated ATR2
ATR2-R
CCGCTCGAGTTACCATACATCTCTAAGATATC
30
ACS Paragon Plus Environment
Page 31 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
ACS Synthetic Biology
83A1-F
ACGCGTCGACATGGCTCTGTTATTAGCAGTTTTTAGCCAGAAAAGCAAAACCAAAC
Expression of
83A1-R
ACCGCTACCGCTGCTGGTGCTACCTTTGCTAACTTTTTCCGGAACC
CYP83A1
EGT2-F
CCGGAATTCATGGTGGCAACCACCGTTGAAC
Expression of EGT2
EFT2-R
CCGCTCGAGTTATGCGCTTTCTTTGTAC
UGT74B1-F
CGCGGATCCATGGCCGAAACTACAACAAC
Expression of
UGT74B1-R
CCGCTCGAGTTAGTGTTTTTTGCCCAGAC
UGT74B1
ST5c-F
CGCGGATCCATGGAGAGCAAAAGCGAGAATG
Expression of ST5c
ST5c -R
CCGCTCGAGTTACGGGCTGCTTGCCAGAAAAC
FMO-F
CGAGCTCATGGCACCAACTCAAAACACAATC
Expression of
FMO-R
CCGCTCGAGTCATGATTCGAGGAAATAAGAAG
FMO-GSOX1
BMYR-F
CCGGAATTCATGGATTACAAATTTCCGAAAG
Expression of
BMYR-R
CCGCTCGAGTTACGGTTTGCCGGTGCTC
myrosinase
*
The restriction sites are undelined.
*
The linker sequences are bold.
31
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
Page 32 of 42
Table 2 Primers used in the reconstitution of glucoraphanin biosynthetic pathway in E. coli. Sequences (5’-3’)*
Primers
Function
BCAT3-F
CATGCCATGGACATGAATGCCGTGCTGAGCAATAG
BCAT3-R
ATGTATATCCTCCTTATTATGTTAACACGGTATACAG
GSL-ELONGF
TAAGGAGGATATACATATGCCGCCGCAGAAGATCGAAATTG
GSL-ELONGR
CTAGCTAGCCACCACGCTGCTGATCTGCGGGCTC
IPMDH-F1
GGAATTGTGAGCGGATAACAATTCCTAAGGAGGATATACATATGGCTTCACCTGGGAAAA
Expression of BCAT3
Expression of GSL-ELONG
Expression of IPMDH1
AAC IPMDH-F2
CTAGCTAGCTAATACGACTCACTATAGGGGAATTGTGAGCGGATAACAATTCC
IPMDH-R
ACGCGTCGACTTAAACAGTAGCTGGAACTTTG
IPMI-LSU1-F
ACGCGTCGACTAAGGAGGATATACATATGACAATGACGGAGAAGATTCTAG
Expression of IPMI
IPMI-LSU1-R
ATGTATATCCTCCTTACTACTGCAAGAACTCCCTTGGGTC
(LSU1&SSU3)
IPMI-SSU3-F
TAAGGAGGATATACATATGATAACCAGAGAGACTTTCCAC
32
ACS Paragon Plus Environment
Page 33 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
ACS Synthetic Biology
IPMI-SSU3-R
CCGCTCGAGTCAAGCAGAAGGAATCATGCCGGC
79F1-F
CATGGCCGGCCATGGCTCTGTTATTAGCAGTTTTTACCAC
79F1-R
ATGTATATCCTCCTTATTACGGGCAAAACTTCGGATACAG
83A1-F
TAAGGAGGATATACATATGGCTCTGTTATTAGCAGTTTTTAG
83A1-R
ACATGCATGCTTATTTGCTAACTTTTTCCGGAACCAGTTTC
ATR2-F
ACATGCATGCTAAGGAGGATATACATATGGCTCTGTTATTAGCAGTTTTTAG
ATR2-R
CCTTAATTAATTACCATACATCTCTAAGATATCTTC
EGT2-F
CGAGCTCTAAGGAGGATATACATATGGTGGC
EFT2-R
TTTCGGCCATATGTATATCCTCCTTATTATGCGCTTTCTTTGTACTCGCC
UGT74B1-F
AAGCGCATAATAAGGAGGATATACATATGGCCGAAACTACAACAACAACC
UGT74B1-R
TCCCCCCGGGTTAGTGTTTTTTGCCCAGAC
ST5c-F
TCCCCCCGGGTAAGGAGGATATACATATGGAGAGCAAAAGCGAG
ST5c -R
TTGGTGCCATATGTATATCCTCCTTATTACGGGCTGCTTGCCAG
FMO3-F
CAGCCCGTAATAAGGAGGATATACATATGGCACCAACTCAAAAC
FMO3-R
ACGCGTCGACTCATGATTCGAGGAAATAAG
*
The restriction sites are undelined.
*
The RBS sequences are bold.
Expression of CYP79F1
Expression of CYP83A1
Expression of ATR2
Expression of EGT2
33
ACS Paragon Plus Environment
Expression of UGT74B1
Expression of ST5c
Expression of FMO3
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
1
Figure Captions:
2
Fig. 1 Glucoraphanin biosynthesis pathway from methionine in plant.
3
The enzymes catalyze each step reaction are listed on the left column and full names for the enzymes
4
could be referred in the main text.
5 6
Fig. 2 Functional expression of modified cytochrome P450s in E. coli.
7
(a) Schematic diagram of constructing gene ensembles encoding the cytochrome CYP79F1 [8-540]
8
and CYP83A1 [20-501] with ATR2 [73-711], respectively. Numbers correspond to the amino acids
9
encoded by the first and last codons of the gene sequences. Synthetic sequences ε were used to
10
replace the deleted N-terminal domain of the native P450s and the artificial linker λ was used for
11
fusion of P450s with ATR2.
12
(b) Functional expression of modified cytochrome P450 enzymes CYP79F1 and CYP83A1 fused
13
with reductase ATR2 in E. coli. The protein bands were indicated by the red arrows, respectively.
14 15
Fig. 3 Soluble expression of selected proteins involved in the core structure formation and secondary
16
modification pathway of glucoraphanin in E. coli.
17
The left is the diagram showing that EGT2 catalyzes the similar reaction with SUR1. The right is the
18
profile of showing the protein expression and the respective protein bands were indicated by the red
19
arrows.
20 21
Fig. 4 Detection of the reaction product from the core formation and side-chain modification pathway
22
in vitro.
23
(a) GC chromatogram of authentic standard sulforaphane.
24
(b) GC chromatogram of product from the enzyme mixture and the possible product was indicated
25
with an arrow. 34
ACS Paragon Plus Environment
Page 34 of 42
Page 35 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
26
(c) Ion mass spectra of authentic standard sulforaphane and the structure was identified as 3-butenyl
27
isothiocyanate.
28
(d) Ion mass spectra of the product peak and the structure was identified accordingly.
29
(e) Thermal degradation of sulforaphane to 3-butenyl isothiocyanate caused by the high temperature
30
of the injection ports of GC and GC/MS.
31 32
Fig. 5 The biosynthesis pathway of glucoraphanin in Arabidopsis thaliana vs assembly in E. coli.
33
(a) Side-chain elongation of amino acid.
34
(b) Formation of core glucosinolate functional group.
35
(c) Secondary modification of side chain and breakdown of glucoraphane.
36 37
Fig. 6 Glucoraphanin production in E. coli by expression of the modified synthetic pathway.
38
(a) LC-MS/MS of authentic standard glucoraphanin. (b) Reaction product from the resting cell.
39
Multiple reaction monitoring (MRM) was used to monitor analyze parent ion production transitions.
40
MRMs were chosen as follow: glucoraphanin (m/z 436-96.8, 436-177.8, 436-259.0, 436-371.9).
35
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 1 Glucoraphanin biosynthesis pathway from methionine in plant 212x282mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 36 of 42
Page 37 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 2 Functional expression of modified cytochrome P450s in E. coli 80x61mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 3 Soluble expression of selected proteins involved in the core structure formation and secondary modification pathway of glucoraphanin in E. coli 140x69mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 38 of 42
Page 39 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 4 Detection of the reaction product from the core formation and side-chain modification pathway in vitro 140x108mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 5 The biosynthesis pathway of glucoraphanin in Arabidopsis thaliana vs assembly in E. coli 80x74mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 40 of 42
Page 41 of 42
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 6 Glucoraphanin production in E. coli by expression of the modified synthetic pathway 80x109mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Abstract Graphic 69x39mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 42 of 42