Subscriber access provided by UNIV OF CALIFORNIA SAN DIEGO LIBRARIES
Article
Combinatorial Evolution of Enzymes and Synthetic Pathways Using One-Step PCR Peng Jin, Zhen Kang, Junli Zhang, Linpei Zhang, Guocheng Du, and Jian Chen ACS Synth. Biol., Just Accepted Manuscript • DOI: 10.1021/acssynbio.5b00240 • Publication Date (Web): 11 Jan 2016 Downloaded from http://pubs.acs.org on January 16, 2016
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
ACS Synthetic Biology is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 1. Schematic diagram of the RECODE method with two-step (a) or one-step (b) PCR. Phosphorylated mutagenic oligonucleotides, UAP and DAP were annealed to template DNA after denaturation. All gaps between the primers were filled and ligated with thermostable DNA polymerase and ligation. For two-step PCR, single-stranded mutant products of the 1st step PCR were purified, and then used as the template to synthesize double-stranded variants in the 2nd step PCR reaction with AP primer which specifically matches DAP. For one-step PCR, the AP primer was added in the reaction to selectively synthesize the complementary chains of the mutant strand. The concentration of AP was set with 0.2 µM which was 2 times of that of the DAP primer. 352x139mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 2. Optimization and application of RECODE system. (a) Termination codons were introduced into galactosidase (lacZ) and esterase (estC23) with substitution or deletion of the native codons by RECODE. (b) The phenotype of the recombinants wild-type galactosidase (blue colonies) and esterase (clear halos). LB agars containing tributyrin (esterase substrate) and x-gal (galactosidase substrate) were used. (c) and (d) showed the phenotypes of variants by RECODE with two-step and one-step PCR systems, respectively. (e) Combinatorial engineering of SIR fragment of the rpoS gene. Four mutant spacers between -35 and -10 boxes of SIR segments were inserted upstream of the gfp gene to control the expression of GFP. (f) The transcriptional strength of the SIR variants with one-round evolution by RECODE (with GFP protein as a reporter). (g) Alignment of the nucleic acid sequences of the SIR variants. 160x175mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 2 of 37
Page 3 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 3. Combinatorial evolution of the leech hyaluronidase by RECODE. (a) Illustration of the evolution and screening process. The activities of the wild-type LHAase and its variants towards heparin (top) and hyaluronan (bottom) were determined in 96-well microtiter plates. (b) and (c) are the enzyme activities of the variants (culture supernatants) towards hyaluronan and the non-natural substrate heparin, respectively. (d) Diverse distribution of the mutant amino acids (red) in the constructed variants. All error bars indicate ± SD, n = 3. 203x182mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 4. Combinatorial engineering the regulatory elements and pathway enzymes to optimize synthetic pathway towards ALA. (a) The heme biosynthesis pathway via ALA in E. coli. The arrows in red represent a positive relationship between ALA accumulation and overexpression of the enzymes HemA, HemL and HemF. (b) Illustration of combinatorial engineering of the promoters, ribosome binding sites and pathway genes. (c) Evaluation of the variants in 96-well microtiter plates with the parent E. coli LAF strain as control (red arrow). (d) ALA accumulation of the variants in shake flasks. All error bars indicate ± SD, n = 3. 264x188mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 4 of 37
Page 5 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 5. RECODE generalized workflow for rapid construction of novel regulatory elements and enzymes, and engineering of optimized synthetic pathways to facilitate the development of in vivo and in vitro synthetic biology. 306x206mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
1
Combinatorial Evolution of Enzymes and Synthetic Pathways Using
2
One-Step PCR
3
Peng Jin†,‡, Zhen Kang†,‡,§,*, Junli Zhang†,‡, Linpei Zhang†,‡, Guocheng Du†,‡,§ and
4
Jian Chen†,‡
5
†
6
Biotechnology, Jiangnan University, Wuxi 214122, China
7
‡
8
Wuxi, Jiangsu 214122, China
9
§
The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of
Synergetic Innovation Center of Food Safety and Nutrition, Jiangnan University,
The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of
10
Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
11
S Supporting Information ○
12 13
ABSTRACT: DNA engineering is the fundamental motive driving the rapid
14
development of modern biotechnology. Here, we present a versatile evolution method
15
termed “Rapidly Efficient Combinatorial Oligonucleotides for Directed Evolution”
16
(RECODE) for rapidly introducing multiple combinatorial mutations to the target
17
DNA by combined action of a thermostable high-fidelity DNA polymerase and a
18
thermostable DNA Ligase in one reaction system. By applying this method, we
19
rapidly constructed a variant library of the rpoS promoters (with activity of 8% to
20
460%), generated a novel heparinase from the highly specific leech hyaluronidase
21
(with more than 30 mutant residues) and optimized the heme biosynthetic pathway by 1
ACS Paragon Plus Environment
Page 6 of 37
Page 7 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
22
combinatorial evolution of regulatory elements and pathway enzymes (2500±120 mg
23
L-1 with 20-fold increase). The simple RECODE method enabled researchers the
24
unparalleled ability to efficiently create diverse mutant libraries for rapid evolution
25
and optimization of enzymes and synthetic pathways.
26 27
KEYWORDS: Combinatorial mutagenesis; single-stranded DNA oligonucleotides;
28
directed evolution; pathway optimization; synthetic biology
29 30 31
Enzymes have attracted much attention and study in the fields of metabolic
32
engineering and synthetic biology because of their unique properties of accepting
33
complex substrates and high selectivity and specificity. In particular, enzymes have
34
been widely used for synthesis of fine chemicals, complex pharmaceuticals,
35
agrochemicals, bulk chemicals and biofuels.1 However, due to limited sources and
36
performance of natural enzymes (for instance stability, substrate specificity, or
37
enantioselectivity), construction of novel robust enzymes with desired properties2-4 2
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
38
has always been a subject of interest. However, it is not straightforward to artificially
39
design and synthesize new interchangeable parts, modules or systems with specific
40
phenotypes owing to our limited understanding of the relationship between sequence
41
and function.5,6 Thus, many mutagenesis protocols including PCR based (such as
42
DNA shuffling,7 StEP,8 RPR,9 RACHITT10 and Synthetic shuffling11) and non-PCR
43
based (such as ITCHY12 and SHIPREC13) have been developed and applied.
44
Especially in recent years, application DNA assembly technologies have greatly
45
advanced significant progress in enzyme directed evolution.14-17
46
Several single-stranded DNA (ssDNA) mutagenesis technologies have been
47
developed to for in vivo genome engineering and in vitro enzyme evolution. An
48
ssDNA mediated genome editing approach has been regarded as one of the most
49
potential biotechnologies,18,19 but restricted host strains and time-consuming repetitive
50
electroporation make it inapplicable to rapidly construct mutant libraries for in vivo
51
enzyme evolution. In contrast, although mutagenic ssDNA oligonucleotides have
52
been introduced for improving in vitro DNA shuffling methods17,20-22 and
53
PCR-mediated site-mutagenesis approaches,23-27 the operation process was still
54
comparatively complicated (for instance, the requirement of preparation of substrate
55
and digestion with DNase I) and the generation efficiency of positive mutants was
56
still not satisfied.28 Thus, to enhance mutation efficiency and to simplify the
57
mutagenesis process, several ssDNA mediated PCR methods have also been
58
developed for multiple site-directed mutagenesis.29-31 However, the requirement of 3
ACS Paragon Plus Environment
Page 8 of 37
Page 9 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
59
preparation of linear or circular ssDNAs as templates restricts its practical
60
applications. Moreover, the employment of the room-temperature T4 DNA
61
polymerase and ligase (which merely allows one-time of annealing, extension and
62
ligation) resulted in rare products and blocked its potential applications in mutant
63
library construction.29,30 Recently, a user-defined mutagenesis method PFunkel32 that
64
based on the Kunkel mutagenesis protocols33,34 has been developed and applied for
65
repeated introduction of designed site-directed mutations by recruiting the
66
thermostable Pfu DNA polymerase and Taq DNA ligase, while the dependence of
67
uracil-containing DNA plasmid template still confines its potential applications. Thus,
68
how to fully utilize the advantages of synthetic oligonucleotides to efficiently
69
introduce diversity at specified positions while controlling mutation frequency17 in
70
small, multi-combinatorial and high-quality mutant libraries for high-throughput
71
screening is still a key issue to be solved.1,4,5,35
72
With the rapid development of synthetic biology,36 a wide range of artificial
73
regulatory elements and biomolecular systems have been engineered for various
74
useful tasks.5,37 In the past few decades, many sound strategies and tools including
75
promoter library engineering,38 ribosome binding site engineering,39 modular protein
76
scaffolds,40 tunable intergenic regions,41 dynamic sensor-regulators,42 multivariate
77
modular pathway engineering,43,44 combinatorial transcriptional engineering45 and
78
regulation with small RNAs46,47 and systems metabolic engineering48,49 have been
79
developed for optimizing synthetic metabolic pathways. However, it remains a 4
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
80
challenge to construct balanced synthetic pathways by primarily depending on
81
regulation at transcriptional, post-transcriptional or translational levels,50-52 since these
82
approaches often ignore the inherent shortcoming of pathway enzymes (for instance,
83
stability, fitness, and inhibition by allosteric control with metabolites).2 In fact, tuning
84
and optimization of enzymes are both important concepts for improving the overall
85
synthetic pathway performance.50
86
Here, inspired by the scarless DNA assembly method53 via ligase cycling reaction
87
and the ssDNA-based mutagenesis studies,29,30,32,54 we developed a simple, rapid,
88
efficient, and powerful in vitro evolution method termed “Rapidly Efficient
89
Combinatorial Oligonucleotides for Directed Evolution” (RECODE) (Figure 1). By
90
applying this RECODE method with one- or two-step PCR, the strength of the
91
regulatory element of the rpoS gene55 was increased 5-fold while the leech
92
hyaluronidase LHAase56 was endowed with both activities towards hyaluronan and
93
heparin. Concurrently, a novel strategy of combinatorial engineering of regulatory
94
elements and enzymes for optimizing the synthetic pathway was developed based on
95
this RECODE method. As a result, the heme biosynthetic pathway was optimized
96
with a significant increase in the production of 5-aminolevulinic acid (ALA, 20-fold).
97
■ RESULTS AND DISCUSSION
98
The Basic Principle and Feature of the RECODE Method. The RECODE method
99
is dependent on using thermostable DNA polymerase and ligase to achieve
100
combinatorial mutations at multiple sites (Figure 1). The single-stranded 5
ACS Paragon Plus Environment
Page 10 of 37
Page 11 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
101
oligonucleotides containing various desirable mutations were designed according to
102
the target fragments. After repeated PCR-based thermal cycling, the phosphorylated
103
single-stranded oligonucleotides annealed to the parent template DNA and extended
104
to the adjacent oligonucleotide position by DNA polymerase. Then the adjacent 5'
105
phosphate and 3' hydroxyl termini were ligated by Taq DNA ligase to form
106
completely
107
denaturation-annealing-extension-ligation temperature cycles, the DNA fragments
108
with combinatorial mutations were generated in one-round evolution. Specifically, the
109
upstream anchor primer (UAP), the downstream anchor primer (DAP) and the
110
antisense primer (AP) with specific structures29 were designed and exclusively
111
applied to amplify the variant DNA, which would be in favor of avoiding digestion
112
with DpnI endonuclease and facilitating subsequent assembly (Figure 1).
matched
duplex
DNA
structures.
After
25
113
Optimization of the Constructed RECODE Reaction System for Multiple
114
Site-directed Mutagenesis. To construct, evaluate and optimize the RECODE
115
method, we first constructed an indicator system (Figure 2a) by co-overexpressing a
116
galactosidase (encoded by lacZ) and an esterase (encoded by estC23)57. After
117
induction with isopropyl-β-D-thiogalactopyranoside (IPTG), both activities of
118
esterase (clear halos) and galactosidase (blue colonies) could be observed on
119
Luria-Bertani (LB) agarose plate with tributyrin and X-gal (Figure 2b). In
120
consideration of fidelity and avoiding the 5'-3'exonuclease (which may introduce
121
non-specific mutations and remove the downstream oligonucleotides annealed on a 6
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
122
template from 5' terminus),58 the Phusion® high fidelity DNA polymerase without the
123
5'-3' exonuclease activity was selected for extension reactions. Simultaneously, the
124
commonly used thermostable DNA ligases were comparatively investigated and the
125
results showed that only the Ampligase thermostable DNA Ligase generated small
126
amounts of mutant products (Supplementary Figure S1), which should be ascribed to
127
its similar optimal pH (8.3) compared with the high fidelity Phusion® DNA
128
polymerase (9.3). Due to the incompatible working concentration of Mg2+ for the
129
Phusion® DNA polymerase and Ampligase thermostable DNA Ligase (2 mM and 10
130
mM, respectively), the Mg2+ concentration was further investigated and optimized to
131
reconcile the extension and ligation reactions. When Mg2+ was above 7 mM or below
132
3 mM, only small amounts of mutant products were produced (Supplementary Figure
133
S2a), suggesting high concentration of Mg2+ inhibits the extension activity of
134
Phusion® DNA polymerase while low concentration of Mg2+ hinders the ligation
135
activity of Ampligase thermostable DNA Ligase. Accordingly, 5 mM Mg2+ was
136
utilized for extension and ligation in the optimized RECODE reaction buffer system
137
(Supplementary methods). In addition, the melting temperature (Tm) of the mutagenic
138
oligonucleotides, which depends on the flanking homology region of targets, showed
139
a wide range (40−66 °C). Specifically, the Tm value was optimized at 50 °C for high
140
recombination frequency (data not shown) by applying a consistent design of
141
mutagenic oligonucleotides (Supplementary Table S1).
142
Through systematic comparison and optimization of the components and reaction 7
ACS Paragon Plus Environment
Page 12 of 37
Page 13 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
143
cycle conditions (Supplementary Figure S1-3), the RECODE method with two-step
144
PCR was established (Figure 1a). On this basis, a color indicator system with
145
three-locus lethal mutations (inactivation of galactosidase in one site and esterase in
146
two sites) (Figure 2a) was designed and applied to visually examine the mutation
147
efficiency of this method. Remarkably, it could be found that all the recombinant
148
colonies on LB plates with X-gal and tributyrin substrates showed no parental
149
phenotype (blue color and transparent zone) (Figure 2c) and about 66.5% of the
150
colonies
151
(Supplementary Table S2). DNA sequencing results showed that 93 of the 116
152
variants (blue colonies) were simultaneously introduced mutations at two sites of the
153
estC23 gene while 109 of the 230 variants (white colonies) were simultaneously
154
introduced mutations at three sites of the designed lacZ-estC23 operon (47.5%).
lost
both
lipolytic
and
galactosidase
activities
(white
colonies)
155
To further simplify the operation process, a more simple RECODE method with
156
one-step PCR in one reaction system was designed and established (Figure 1b). To
157
guarantee the matching efficiency to the template, the DAP (60 nt) was designed with
158
a 40 nt complementary sequence to the template and 20 nt to the AP segment (Figure
159
1b). Meanwhile, a two-fold molar excess of APs over other primers were added with a
160
final concentration of 0.2 µM to guarantee extension and synthesis of all the mutant
161
single-strand
162
(Supplementary Figure S4) gave rise to fewer recombinants, the proportion of the
163
white colonies (about 61.6%, Figure 2d) was similar with that of the two-step PCR
chains.
Although
comparatively
less
8
ACS Paragon Plus Environment
target
PCR
products
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
164
systems. Furthermore, DNA sequencing results showed that 26 of the 45 mutants with
165
color colonies were simultaneously introduced mutations at three sites of the designed
166
lacZ-estC23 operon (57.8%, Figure 2d) which was higher than that of the two-step
167
PCR systems. Taken together, all the results demonstrate the high efficiency of the
168
RECODE method with one-step or two-step PCR system.
169
Application of RECODE to Rapidly Construct a Diverse Mutant Library of
170
Regulatory Elements. Promoter engineering is a powerful strategy in metabolic
171
engineering and synthetic biology.59 The transcriptional sigma factors and promoters
172
have been engineered for pathway construction and optimization.60 Here, four spacer
173
regions between the -35 and -10 elements from the 5′-untranslated region of rpoS
174
gene (encoding the sigma factor 38 for stress resistance in Escherichia coli) was
175
cloned and engineered by the RECODE method (Figure 2e) to generate a mutant
176
library with different transcriptional strengths. Specifically, the green fluorescent
177
protein that encoded by gfp was chosen as a reporter for visualized analysis of the
178
mutant library. After one-round evolution, five hundred colonies with different visual
179
green fluorescence intensities were randomly selected and cultured in 96-well plates
180
for characterization. The variants that showed significant differences compared to the
181
wild-type counterpart were then further confirmed in shake flask cultures. As shown
182
in Figure 2f, 74 variants exhibited a broad span of fluorescence intensity (8–460% of
183
the wild type). DNA sequencing results of the randomly picked variants showed that
184
the introduced multiple-cassette mutations covered the four spacer regions (Figure 2g), 9
ACS Paragon Plus Environment
Page 14 of 37
Page 15 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
185
demonstrating its high efficiency in combinatorial editing of DNA segments.
186
Collectively, these data suggest that this simple RECODE method should have wide
187
potential applications in rapid enzyme evolution.
188
Application of RECODE to Rapidly Broaden the Substrate Specificity of the
189
Leech Hyaluronidase. As an application example, a novel leech hyaluronidase
190
(LHAase, without activity to heparin) secreted by a recombinant Pichia pastoris56 was
191
chosen as a candidate for rapid evolution by RECODE (Figure 3a). The objective is to
192
rapidly create variants with improved LHAase activities and/or broadened substrate
193
specificity to heparin. According to the phylogenetic relationship and conserved
194
domains among the hyaluronidases and heparanases,56 10 mutagenic oligonucleotides
195
with degenerate sequences (Supplementary Table S1) were designed to cover the 10
196
candidate regions near the conserved amino acids.56 After one-step PCR evolution and
197
assembly, all the mutants were transformed into P. pastoris GS115. Subsequently,
198
Seven hundred colonies were firstly picked in 96 deep-well plates for cultivation and
199
assay of HAase activity. Then, the variants with significant differences were further
200
investigated in flask cultures. Remarkably, many variants with different activities
201
towards hyaluronan and heparin were successfully created after one-round evolution
202
by the RECODE method (Figure 3a). In comparison, the variant M2D7 showed a
203
2.4-fold increase of the LHAase activity while both the variants M4A4 and M5B4
204
showed increased LHAase activity and heparanase activity (Figure 3b and 3c).
205
Impressively, although the LHAase activity of the variant M4A12 decreased, the 10
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
206
heparanase activity increased with the highest titer of 750 ± 20 U mL–1, indicating its
207
potential to be an exclusive “heparanase”. Further DNA sequencing analysis of these
208
variants showed that the introduced mutation sites covered all the area of candidates
209
with random combinations (Figure 3d). In particular, more than 30 amino acid
210
residues in a single variant were simultaneously mutated (accounting for 6.12% of the
211
LHAase, 489 amino acids), demonstrating the powerful capacity of this RECODE
212
method in enzyme directed evolution.
213
Combinatorial Engineering of Regulatory Elements and Enzymes to Optimize
214
Synthetic Pathway. In many cases, it is not possible to optimize a synthetic pathway
215
only by engineering the regulatory elements (such as promoters and RBS). Here,
216
based on the established simple RECODE method, a novel approach of combinatorial
217
engineering of regulatory elements and pathway enzymes was proposed to optimize
218
synthetic pathways towards target compounds. As an example, the heme biosynthesis
219
pathway was simultaneously engineered at the transcriptional and protein levels for
220
efficient accumulation of the intermediate ALA (Figure 4a). Specifically, the three
221
crucial genes hemA, hemL and hemF61 were simultaneously engineered at the
222
transcriptional and protein levels by the RECODE method (Figure 4b). After
223
one-round evolution, 1000 recombinant colonies were characterized in 96 deep-well
224
plates (Figure 4c) and further investigated with flask cultures. As shown in Figure 4d,
225
the recombinant BL21 (DE3) strain containing the variant A24-H6 accumulated ALA
226
to the highest titer (2500 ± 120 mg L–1) which was 20-fold of that of the parental 11
ACS Paragon Plus Environment
Page 16 of 37
Page 17 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
227
strain. Furthermore, DNA sequencing results of eight variants (Figure 4d) showed that
228
the mutation sites covered most of the designed regions including promoters, RBS
229
regions and the open reading frames of hemA, hemL and hemF (Supplementary Figure
230
S6). The results demonstrate the high efficiency of this simple approach in pathway
231
engineering. Meanwhile, the results also confirmed that fine-tuning of the three
232
committed enzymes HemA, HemL and HemF are critical for ALA accumulation.61
233
Discussion. Enzyme directed evolution technologies have achieved great progress
234
in the past two decades, however, the sizes of the generated mutant libraries are
235
generally too large to fully explore by available selection strategies.50,62 Thus,
236
development of simple, fast and efficient techniques that allow rational design and
237
target engineering of smaller libraries with higher degrees of precision is always a
238
major requirement.1 In the present study, a designated RECODE method with
239
one-step PCR has been developed for rapid DNA editing (Figure 1), which has been
240
demonstrated to be effectively applied in engineering of regulatory elements, enzymes
241
and synthetic pathways.
242
Different from previous ssDNA-based methods,29,30 the RECODE method was
243
developed by employing the thermostable Phusion® high fidelity DNA polymerase
244
and Taq DNA ligase (Figure 1). Combined action of the thermostable high-fidelity
245
DNA polymerase and ligase not only facilitates all kinds of DNA templates (eg.
246
double-stranded DNA fragments, cDNA, genomic DNA and plasmid DNA) but also 12
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
247
avoids or reduces non-specific mutations and incorrect ligations. Here, to construct
248
this reaction system in one tube, the Mg2+ concentration and pH were demonstrated as
249
the major parameters that affect the activities of polymerase and ligase
250
(Supplementary Figure S1, 2) and optimized to achieve accordant extension and
251
ligation activities for the production of amounts of variants with desirable diversities
252
(Supplementary Figure S2). After further optimization of the extension and ligation
253
cycles (to fully anneal the mutagenic oligonucleotides and make full use of the
254
advantages of the thermostable DNA Ligase.53) and the Tm value of the homologous
255
sequence, a sufficient quantity of mutants with multi-combinatorial mutations were
256
rapidly generated for high-throughput screening. In comparison, it would be
257
impossible to achieve combinatorial multiple-cassette mutagenesis by further
258
modification of previous reported protocols because of application of the
259
room-temperature T4 DNA polymerase and ligase.29,30 Moreover, the PFunkel
260
method32 may be laborious due to the delayed addition of reaction reagents (including
261
polymerase and ligase) and degradation of the uracil-containing template and DNA
262
products not in the desired covalently closed circular by uracil DNA glycosylase and
263
exonuclease III.32
264
For enzyme evolution, the size and the diversity of the created library are always
265
the critical characterizations for the subsequent screening especially when no effective
266
selection methods are available. To this end, specific upstream and downstream
267
anchor primers and an antisense primer were designed and standardized (Figure 1). 13
ACS Paragon Plus Environment
Page 18 of 37
Page 19 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
268
Compared with previous DNA shuffling methods,7,63 the RECODE method enabled
269
rapid DNA evolution with higher diversity within a smaller library size. This is
270
accomplished by the application of specific primers in the reaction system that result
271
in synthesis of the full-length active mutants without the repeat amplification of the
272
existing ones. Furthermore, this RECODE method also showed powerful capacity in
273
combinatorial editing of a target DNA with insertion and deletion of interested
274
elements (Supplementary methods and Supplementary Figure S5). Compared with
275
previous DNA shuffling strategies, this RECODE method was much more rational in
276
protein engineering especially with the assistance of crystal structure analysis. As an
277
example, we successfully created a novel “heparanase” from the highly specific leech
278
hyaluronidase (Figure 3a). Moreover, the novel approach devised for rapid
279
optimization of the synthetic pathways by combinatorial engineering of regulatory
280
elements and pathway enzymes also showed high efficiencies (Figure 4d). Compared
281
with other pathway engineering strategies that mainly focused on transcriptional
282
regulation,64-66 this approach should be comparatively easier to use in constructing an
283
optimized pathway by simultaneously improving pathway enzyme properties and
284
optimizing the concentration ratios.
285
Conclusion. We have developed a simple (one-step PCR with any form of DNA
286
templates), rapid (one-round evolution of 5 kb DNA within 3 hours), efficient (higher
287
amount of products) and powerful (multiple single-directed mutations, insertions or
288
deletions and combinatorial multiple-cassette mutagenesis) RECODE method, which 14
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
289
enables us to easily reedit the target DNA to construct novel regulatory parts and
290
modules (Figure 5). Combining with protein synthesis in vitro,67,68 computer-aided
291
design69 and simulation, and efficient assembly techniques,1,70 especially the Gibson
292
isothermal assembly,71 Ligase Cycling Reaction53 and the DNA assembler,72 this
293
RECODE method will greatly promote protein studies and benefit our understanding
294
of the relationship between sequence and enzyme function. In conjunction, the
295
demonstrated approach for simultaneous modification of regulatory elements and
296
pathway enzymes will also enable us to rapidly optimize the synthetic pathways
297
towards the target products (Figure 5). Hence, the RECODE method and the derived
298
approach for pathway engineering will be of great benefit to a variety of applications
299
in the field of enzyme engineering, metabolic engineering and synthetic biology.
300
301
Bacterial Strains and Plasmids. E. coli DH5α and pBBRMCS2 (Stratagene, La Jolla,
302
CA, USA) were used for cloning and characterization of the library of SIR regulatory
303
element.55 The pMD19-T (Takara Ltd., Otsu, Japan) vector was used for cloning of
304
the lacZ-estC23 and gfp fragments. P. pastoris GS115 (his4) and the pPIC9K vector
305
(Invitrogen, Carlsbad, CA, USA) were used for expression of the hyaluronidase
306
mutant library. The pRSFduet-1 (Novagen Darmstadt, Germany) and E. coli BL21
307
(DE3) (Stratagene) were used as the vector and host, respectively, for pathway genes
308
expression of ALA production.
MATERIALS AND METHODS
15
ACS Paragon Plus Environment
Page 20 of 37
Page 21 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
309
Media and Culture Conditions. LB medium was used for cloning and cell culture.
310
LB plates supplemented with 100 µg mL-1 ampicillin, 0.1 mM IPTG, 20 µg mL−1
311
X-gal (Sigma-Aldrich, St. Louis, USA), and 1% (v/v) tributyrin (Merck, Darmstadt,
312
Germany) were used for characterizing the indicator system of lacZ-estC23 at 37 °C.
313
The secondary assay of E. coli libraries was carried out in shake flask with LB broth
314
at 37 °C with 200 rpm shaking. P. pastoris was selected on MD medium (0.34% YNB,
315
20 g L−1 glucose and 15 g L−1 agar) supplemented with 2 mg mL-1 G418. Shake flask
316
cultures of recombinant P. pastoris strains were carried out at 30 °C at 200 rpm with
317
buffered methanol minimal yeast medium (BMMY, 10 g L−1 yeast extract, 20 g L−1
318
peptone, 100 mM potassium phosphate, 0.34% YNB, 10 g L−1 (NH4)2SO4, 1%
319
methanol). For production of ALA, the recombinant E. coli BL21 (DE3) cells were
320
grown at 37 °C at 200 rpm with mineral basal medium (g L−1): 20.0 glucose, 2.0 yeast
321
extract, 16.0 (NH4)2SO4, 3.0 KH2PO4, 16.0 Na2HPO4•12H2O, 1.0 MgSO4•7H2O and
322
0.01 MnSO4•H2O, pH 7.0. When necessary, 50 µg mL−1 kanamycin and 0.1 mM
323
IPTG were supplemented.
324
Primer Design. All primers (BGI; Beijing, China) in this study are listed in
325
Supplementary Table S1. The lethal mutations for galactosidase encoding gene lacZ
326
and esterase encoding gene estC23 were designed by introduction of termination
327
codons into one site of lacZ and two sites of estC23 (Figure 2a). Specifically, the
328
bases GAA, TAC, and ATGA were replaced with the termination codons TAA, TAA,
329
and TGA, respectively. To evaluate the insertion and deletion capacity of the 16
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
330
RECODE method, the oligonucleotides that contain 12 nt (ACATGCCACTGG) and
331
9 nt (TACTGGCAC) fragments for insertion at 196 and 601 sites, and a 6 nt
332
(ACACAA) fragment for deletion at 607-612 site of gfp gene were designed,
333
respectively. For editing the SIR regulatory element, four degenerate oligonucleotides
334
were designed to cover the spacer fragments between the −35 and −10 boxes of the
335
corresponding four natural tandem promoters. For combinatorial evolution of the
336
leech hyaluronidase, mutation sequences instead of the inherent triplet code were
337
introduced into 10 candidate sites near the conserved amino acids, Specifically, the
338
triplet-codon was consciously avoided the introduction of the termination codons (for
339
example VNN; V = A, C, G; N = A, T, C, G). For combinatorial evolution of the
340
ALA pathway, multiple regions of the three pathway genes (hemA, hemL and hemF)73
341
were simultaneously modified with the highly degenerate triplet-code sequence (for
342
example VNN or NNN in oligonucleotides). The ribosome binding site (RBS) regions
343
were modified with degenerate RBS sequences (DDRRRRRDDDD; D = A, G, T; R =
344
A, G), and the spacer sequences between −35 and −10 boxes flanking the hemF
345
promoter were edited with the degenerate sequence (NNNNNNNNNNNNNNNNN).
346
RECODE Manipulation. All the synthetic mutagenic oligonucleotides and the
347
downstream anchor primers (DAP) were mixed with the same concentration and
348
phosphorylated. Specifically, phosphorylation was performed in 50-µL reactions
349
containing 300 pmol of a primer mixture, 1× T4 DNA ligase buffer (NEB, New
350
England Biolabs, USA), and 8 U polynucleotide kinase (NEB). The reaction was 17
ACS Paragon Plus Environment
Page 22 of 37
Page 23 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
351
incubated for 30 min at 37 °C and subsequently terminated by heating for 10 min at
352
75 °C.
353
The RECODE reaction was carried out in 50-µL system containing 0.1 µM of each
354
phosphorylated mutagenic oligonucleotide and the upstream anchor primer (UAP),
355
0.2 µM antisense primer , 0.01 pmol DNA template, 1 U Phusion DNA polymerase
356
(NEB), 5 U Ampligase thermostable DNA ligase (Epicentre, USA) and 1× optimized
357
RECODE reaction buffer (20 mM Tris-HCl, 25 mM KCl, 0.5 mM NAD, 2 mM
358
dNTPs, 5 mM Mg2+, 0.1% Triton X-100, pH 8.3). The one-step reaction system
359
(Figure 1b) was performed with the following PCR conditions: 2 min at 94 °C; 25
360
cycles of 30 s at 94 °C, 30 s at 50 °C, 2 min at 72 °C, and 3 min at 66 °C; a hold
361
period at 4 °C. In the two-step RECODE system (Figure 1a), twenty-five cycles were
362
programed in the 1st step PCR to ensure that the extended mutagenic primers were
363
fully annealed and to generate variants with different combinatorial mutations. After
364
purification, all the PCR products were applied as the templates in the 2nd step PCR.
365
Then the double-stranded products were generated with an antisense primer AP in a
366
new 50-µL PCR system containing the purified 1st step PCR products, 1 µL of
367
antisense primer and 25 µL of 2 × SuperPfu PCR MasterMix (Hangzhou Biosci Co.,
368
Ltd, China). The PCR reaction was performed with the following program: 2 min at
369
94 °C; then 3 cycles of 30 s at 94 °C, 30 s at 50 °C and 90 s at 72 °C; 5 min at 72 °C
370
and a hold period at 4 °C.
371
PCR products were run on 1% agarose gels containing EtBr at 90 V for 45 min. 18
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 24 of 37
372
Bands of the correct size were excised and purified with a gel extraction kit, according
373
to the manufacturer’s protocol. When applying plasmid DNA as templates, 1 µL DpnI
374
endonuclease (All endonucleases supplier, Fermentas, USA) was added in the PCR
375
products and incubated at 37 °C for 10 min to digest the template DNA.
376
Variants Library Construction and Screening. To obtain colony libraries,
377
double-stranded DNA variants produced by RECODE were cloned into vectors by the
378
enzymatic assembly DNA method.71 High-throughput screening of the library was
379
performed in 96-well microplates, and then selected variants of all the following
380
libraries were further examined in shake flasks. The lacZ-estC23 mutant library was
381
subcloned into the pMD19-T vector, and then transformed into DH5α. The library
382
was selected on LB plates supplemented with 100 µg mL-1 ampicillin, 0.1 mM IPTG,
383
20 µg mL−1 X-gal, and 1% tributyrin. The diversity of the mutant library was
384
characterized by phenotypic analysis and DNA sequencing. To construct the deletion
385
or insertion libraries of gfp gene by RECODE, the purified products were subcloned
386
into the pMD19-T vector, and then transformed into DH5α. The deletion or insertion
387
libraries were selected on LB plates supplemented with 100 µg mL-1 ampicillin,
388
respectively.
389
To construct the characterization library of the SIR variants, the gfp fragment was
390
firstly
391
EcoRI/BamHI
392
pBBR1MCS2-gfp. Then the SIR mutant library was assembled into the linear
subcloned
into
the
restriction
low-copy-number enzymes,
yielding
plasmid the
19
ACS Paragon Plus Environment
pBBR1MCS-2 recombinant
with
plasmid
Page 25 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
393
pBBR1MCS2-gfp via enzymatic one-step assembly to control expression of the
394
reporter gene gfp. The recombinant plasmids carrying the SIR library were
395
transformed into E. coli DH5α competent cells and cultured on LB plates
396
supplemented with 50 µg mL−1 kanamycin. Screening was performed by
397
high-throughput batch-cultures in 96-well microtiter plates. Transformants were
398
picked into microtiter wells with a Qbot robotic colony picker (Genetix, Hampshire,
399
UK) and grown in LB broth at 37 °C at 200 rpm for 24 h. To analyze the activity of
400
the promoter library, cells were collected by centrifugation (1, 3000 g, 5 min) and the
401
cell pellet washed three times with sterile water. The fluorescence intensity value of
402
per OD600 cells was measured by excitation at 490 nm and emission at 530 nm using
403
an Ultra Multifunctional Microplate Reader (Tecan, Durham, NC, USA). Then, some
404
variants were further verified in shake flask culture.
405
The hyaluronidase gene H6LHyal variants were inserted into pPIC9K under the
406
control of the AOX1 promoter by enzymatic assembly. The recombinant plasmids
407
were linearized with SalI and transformed into P. pastoris GS115 competent cells by
408
electroporation. The cells were cultured on MD plates supplemented with 2 mg mL−1
409
G418. Variants were picked into microtiter wells and cultured in BMMY broth
410
containing methanol (1% v/v) at 30 °C at 200 rpm for 72 h. The cultured supernatants
411
of variants were collected and diluted for the enzymatic assay in 96-well microtiter
412
plates. The selected variants were further cultured in shake flask rocking at 200 rpm at
413
30 °C according to the previous manipulation of shake flask culture.56 The enzymatic 20
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 26 of 37
414
reaction containing a 1.6 mg mL−1 of the substrate (hyaluronic acid or heparin,
415
Sigma-Aldrich, USA) and an appropriate amount of enzyme in 50 mM citrate buffer
416
(pH 5.5) was incubated at 38 °C for 10 min. The reaction was terminated by heat
417
inactivation then examined using the DNS method. Colorimetric reactions were
418
measured by using the Ultra Multifunctional Microplate Reader at 540 nm. One unit
419
of hyaluronidase activity was defined as equal to the reducing power of glucuronic
420
acid (glucose equivalents in micrograms) liberated per hour from HA at 38 °C and pH
421
5.5. One unit of heparanase activity was defined as equal to the reducing power of
422
oligosaccharides (glucose equivalents in milligrams) liberated per hour from HA at
423
30 °C and pH 5.5.
424
The
three
separated
mutant
libraries
(RBS/hemL,
RBS/hemA
and
425
promoter/RBS/hemF) of the ALA pathway were assembled into pRSFduet-1 via
426
enzymatic assembly and then transformed into BL21 (DE3) competent cells. To
427
analyze the ALA yields from the variants, high-throughput screening was carried out
428
in microtiter wells with mineral basal medium broth at 37 °C at 200 rpm for 30 h. The
429
candidate variants were further verified in shake flask cultures. The production of
430
ALA was analyzed by using the modified Ehrlich’s reagent.74
431
■ ASSOCIATED CONTENT
432
S Supporting Information ○
433
Supplementary data including figures, tables and methods associated with this article
434
can be referenced in the supplementary materials. This material is available free of 21
ACS Paragon Plus Environment
Page 27 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
435
charge via the Internet at http://pubs.acs.org.
436
AUTHOR INFORMATION
437
Corresponding Author
438
Zhen Kang, *E-mail:
[email protected] 439
Notes
440
The authors declare no competing financial interest.
441
■ ACKNOWLEDGMENTS
442
This work was financially supported by the Major State Basic Research Development
443
Program of China (973 Program, 2014CB745103), the Natural Science Foundation of
444
Jiangsu Province (BK20141107), a grant from the Key Technologies R & D Program
445
of Jiangsu Province, China (BE2014607); Program for Changjiang Scholars and
446
Innovative Research Team in University (No. IRT_15R26); China Postdoctoral
447
Science Foundation funded project (125960).
448
■ REFERENCES
449 450 451 452 453 454 455 456
(1) Bornscheuer, U., Huisman, G., Kazlauskas, R., Lutz, S., Moore, J. and Robins, K. (2012) Engineering the third wave of biocatalysis. Nature 485, 185–194. (2) Romero, P. A. and Arnold, F. H. (2009) Exploring protein fitness landscapes by directed evolution. Nat. Rev. Mol. Cell Biol. 10, 866–876. (3) Dalby, P. A. (2011) Strategy and success for the directed evolution of enzymes. Curr. Opin. Struct. Biol. 21, 473–480. (4) Johannes, T. W. and Zhao, H. (2006) Directed evolution of enzymes and biosynthetic pathways. Curr. Opin. Microbiol. 9, 261–267. 22
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
457 458 459 460 461 462
(5) Lu, T. K., Khalil, A. S. and Collins, J. J. (2009) Next-generation synthetic gene networks. Nat. Biotechnol. 27, 1139–1150. (6) Carr, P. A. and Church, G. M. (2009) Genome engineering. Nat. Biotechnol. 27, 1151–1162. (7) Stemmer, W. P. (1994) Rapid evolution of a protein in vitro by DNA shuffling. Nature 370, 389–391.
463
(8) Zhao, H., Giver, L., Shao, Z., Affholter, J. A. and Arnold, F. H. (1998) Molecular
464
evolution by staggered extension process (StEP) in vitro recombination. Nat.
465
Biotechnol. 16, 258–261.
466
(9) Shao, Z., Zhao, H., Giver, L. and Arnold, F. H. (1998) Random-priming in vitro
467
recombination: an effective tool for directed evolution. Nucleic Acids Res. 26, 681–
468
683.
469
(10) Coco, W. M., Levinson, W. E., Crist, M. J., Hektor, H. J., Darzins, A., Pienkos,
470
P. T., Squires, C. H. and Monticello, D. J. (2001) DNA shuffling method for
471
generating highly recombined genes and evolved enzymes. Nat. Biotechnol. 19, 354–
472
359.
473
(11) Ness, J. E., Kim, S., Gottman, A., Pak, R., Krebber, A., Borchert, T. V.,
474
Govindarajan, S., Mundorff, E. C. and Minshull, J. (2002) Synthetic shuffling
475
expands functional protein diversity by allowing amino acids to recombine
476
independently. Nat. Biotechnol. 20, 1251–1255.
477
(12) Ostermeier, M., Shim, J. H. and Benkovic, S. J. (1999) A combinatorial
478
approach to hybrid enzymes independent of DNA homology. Nat. Biotechnol. 17,
479
1205–1209.
480 481 482 483
(13) Sieber, V., Martinez, C. A. and Arnold, F. H. (2001) Libraries of hybrid proteins from distantly related sequences. Nat. Biotechnol. 19, 456–460. (14) Packer, M. S. and Liu, D. R. (2015) Methods for the directed evolution of proteins. Nat. Rev. Genet. 16, 379–394.
23
ACS Paragon Plus Environment
Page 28 of 37
Page 29 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
484
(15) Kang, Z., Zhang, J., Jin, P. and Yang, S. (2015) Directed evolution combined
485
with synthetic biology strategies expedite semi-rational engineering of genes and
486
genomes. Bioengineered 6, 136–140.
487
(16) Coussement, P., Maertens, J., Beauprez, J., Van Bellegem, W. and De Mey, M.
488
(2014) One step DNA assembly for combinatorial metabolic engineering. Metab. Eng.
489
23, 70–77.
490
(17) Herman, A. and Tawfik, D. S. (2007) Incorporating Synthetic Oligonucleotides
491
via Gene Reassembly (ISOR): a versatile tool for generating targeted libraries.
492
Protein Eng. Des. Sel. 20, 219–226.
493
(18) Wang, H. H., Isaacs, F. J., Carr, P. A., Sun, Z. Z., Xu, G., Forest, C. R. and
494
Church, G. M. (2009) Programming cells by multiplex genome engineering and
495
accelerated evolution. Nature 460, 894–898.
496
(19) Li, Y., Gu, Q., Lin, Z., Wang, Z., Chen, T. and Zhao, X. (2013) Multiplex
497
iterative plasmid engineering for combinatorial optimization of metabolic pathways
498
and diversification of protein coding sequences. ACS Synth. Biol. 2, 651–661.
499
(20) Stemmer, W. P. (1994) DNA shuffling by random fragmentation and
500
reassembly: in vitro recombination for molecular evolution. Proc. Natl. Acad. Sci.
501
U.S.A. 91, 10747–10751.
502
(21) Crameri, A. and Stemmer, W. P. (1995) Combinatorial multiple cassette
503
mutagenesis creates all the permutations of mutant and wild-type sequences.
504
Biotechniques 18, 194–196.
505
(22) Reetz, M. T., Wilensek, S., Zha, D. and Jaeger, K. E. (2001) Directed evolution
506
of an enantioselective enzyme through combinatorial multiple-cassette mutagenesis.
507
Angew. Chem. Int. Ed. 40, 3589–3591.
508 509
(23) Bloom, J. D. (2014) An experimentally determined evolutionary model dramatically improves phylogenetic fit. Mol. Biol. Evol. 31, 1956–1978.
24
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
510
(24) Hidalgo, A., Schließmann, A., Molina, R., Hermoso, J. and Bornscheuer, U. T.
511
(2008) A one-pot, simple methodology for cassette randomisation and recombination
512
for focused directed evolution. Protein Eng. Des. Sel. 21, 567–576.
513 514
(25) Reetz, M. T. and Carballeira, J. D. (2007) Iterative saturation mutagenesis (ISM) for rapid directed evolution of functional enzymes. Nat. Protoc. 2, 891–903.
515
(26) Jain, P. C. and Varadarajan, R. (2014) A rapid, efficient, and economical
516
inverse polymerase chain reaction-based method for generating a site saturation
517
mutant library. Anal. Biochem. 449, 90–98.
518 519 520 521 522 523 524 525
(27) Parker, A. S., Griswold, K. E. and Bailey-Kellogg, C. (2011) Optimization of combinatorial mutagenesis. J. Comput. Biol. 18, 1743–1756. (28) Kaur, J. and Sharma, R. (2006) Directed evolution: an approach to engineer enzymes. Crit. Rev. Biotechnol. 26, 165–199. (29) Seyfang, A. and Jin, J. H. (2004) Multiple site-directed mutagenesis of more than 10 sites simultaneously and in a single round. Anal. Biochem. 324, 285–291. (30) Young, L. and Dong, Q. (2003) TAMS technology for simple and efficient in vitro site-directed mutagenesis and mutant screening. Nucleic Acids Res. 31, e11.
526
(31) Sawano, A. and Miyawaki, A. (2000) Directed evolution of green fluorescent
527
protein by a new versatile PCR strategy for site-directed and semi-random
528
mutagenesis. Nucleic Acids Res. 28, e78.
529 530 531 532
(32) Firnberg, E. and Ostermeier, M. (2012) PFunkel: efficient, expansive, user-defined mutagenesis. PLoS One 7, e52031. (33) Kunkel, T. A. (1985) Rapid and efficient site-specific mutagenesis without phenotypic selection. Proc. Natl. Acad. Sci. U.S.A. 82, 488–492.
533
(34) Weiss, G. A., Watanabe, C. K., Zhong, A., Goddard, A. and Sidhu, S. S. (2000)
534
Rapid mapping of protein functional epitopes by combinatorial alanine scanning.
535
Proc. Natl. Acad. Sci. U.S.A. 97, 8950–8954.
536 537
(35) Dougherty, M. J. and Arnold, F. H. (2009) Directed evolution: new parts and optimized function. Curr. Opin. Biotechnol. 20, 486–491. 25
ACS Paragon Plus Environment
Page 30 of 37
Page 31 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
538 539
(36) Cameron, D. E., Bashor, C. J. and Collins, J. J. (2014) A brief history of synthetic biology. Nat. Rev. Microbiol. 12, 381–390.
540
(37) Smanski, M. J., Bhatia, S., Zhao, D., Park, Y., L, B. A. W., Giannoukos, G.,
541
Ciulla, D., Busby, M., Calderon, J., Nicol, R., Gordon, D. B., Densmore, D. and Voigt,
542
C. A. (2014) Functional optimization of gene clusters by combinatorial design and
543
assembly. Nat. Biotechnol. 32, 1241–1249.
544
(38) Alper, H., Fischer, C., Nevoigt, E. and Stephanopoulos, G. (2005) Tuning
545
genetic control through promoter engineering. Proc. Natl. Acad. Sci. U.S.A. 102,
546
12678–12683.
547
(39) Salis, H. M., Mirsky, E. A. and Voigt, C. A. (2009) Automated design of
548
synthetic ribosome binding sites to control protein expression. Nat. Biotechnol. 27,
549
946–950.
550
(40) Dueber, J. E., Wu, G. C., Malmirchegini, G. R., Moon, T. S., Petzold, C. J.,
551
Ullal, A. V., Prather, K. L. and Keasling, J. D. (2009) Synthetic protein scaffolds
552
provide modular control over metabolic flux. Nat. Biotechnol. 27, 753–759.
553
(41) Pfleger, B. F., Pitera, D. J., Smolke, C. D. and Keasling, J. D. (2006)
554
Combinatorial engineering of intergenic regions in operons tunes expression of
555
multiple genes. Nat. Biotechnol. 24, 1027–1032.
556
(42) Zhang, F., Carothers, J. M. and Keasling, J. D. (2012) Design of a dynamic
557
sensor-regulator system for production of chemicals and fuels derived from fatty acids.
558
Nat. Biotechnol. 30, 354–359.
559
(43) Ajikumar, P. K., Xiao, W. H., Tyo, K. E., Wang, Y., Simeon, F., Leonard, E.,
560
Mucha, O., Phon, T. H., Pfeifer, B. and Stephanopoulos, G. (2010) Isoprenoid
561
pathway optimization for Taxol precursor overproduction in Escherichia coli. Science
562
330, 70–74.
563
(44) Xu, P., Gu, Q., Wang, W., Wong, L., Bower, A. G., Collins, C. H. and Koffas,
564
M. A. (2013) Modular optimization of multi-gene pathways for fatty acids production
565
in E. coli. Nat. Commun. 4, 1409. 26
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
566
(45) Du, J., Yuan, Y., Si, T., Lian, J. and Zhao, H. (2012) Customized optimization
567
of metabolic pathways by combinatorial transcriptional engineering. Nucleic Acids
568
Res. 40, e142.
569
(46) Na, D., Yoo, S. M., Chung, H., Park, H., Park, J. H. and Lee, S. Y. (2013)
570
Metabolic engineering of Escherichia coli using synthetic small regulatory RNAs.
571
Nat. Biotechnol. 31, 170–174.
572
(47) Kang, Z., Wang, X., Li, Y., Wang, Q. and Qi, Q. (2012) Small RNA RyhB as a
573
potential tool used for metabolic engineering in Escherichia coli. Biotechnol. Lett. 34,
574
527–531.
575 576 577 578 579 580 581 582 583 584
(48) Lee, S. Y. and Kim, H. U. (2015) Systems strategies for developing industrial microbial strains. Nat. Biotechnol. 33, 1061–1072. (49) Dai, Z. and Nielsen, J. (2015) Advancing metabolic engineering through systems biology of industrial microorganisms. Curr. Opin. Biotechnol. 36, 8–15. (50) Otte, K. B. and Hauer, B. (2015) Enzyme engineering in the context of novel pathways and products. Curr. Opin. Biotechnol. 35C, 16–22. (51) Keasling, J. D. (2012) Synthetic biology and the development of tools for metabolic engineering. Metab. Eng. 14, 189–195. (52) Stephanopoulos, G. (2012) Synthetic biology and metabolic engineering. ACS Synth. Biol. 1, 514–525.
585
(53) Kok, S. d., Stanton, L. H., Slaby, T., Durot, M., Holmes, V. F., Patel, K. G.,
586
Platt, D., Shapland, E. B., Serber, Z. and Dean, J. (2014) Rapid and reliable DNA
587
assembly via ligase cycling reaction. ACS Synth. Biol. 3, 97–106.
588 589
(54) Michael, S. F. (1994) Mutagenesis by incorporation of a phosphorylated oligo during PCR amplification. Biotechniques 16, 410–412.
590
(55) Kang, Z., Wang, Q., Zhang, H. and Qi, Q. (2008) Construction of a
591
stress-induced system in Escherichia coli for efficient polyhydroxyalkanoates
592
production. Appl. Microbiol. Biotechnol. 79, 203–208.
27
ACS Paragon Plus Environment
Page 32 of 37
Page 33 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
593
(56) Jin, P., Kang, Z., Zhang, N., Du, G. and Chen, J. (2014) High-yield novel leech
594
hyaluronidase to expedite the preparation of specific hyaluronan oligomers. Sci. Rep.
595
4, 4471.
596
(57) Jin, P., Pei, X., Du, P., Yin, X., Xiong, X., Wu, H., Zhou, X. and Wang, Q.
597
(2012) Overexpression and characterization of a new organic solvent-tolerant esterase
598
derived from soil metagenomic DNA. Bioresour. Technol. 116, 234–240.
599
(58) Holland, P. M., Abramson, R. D., Watson, R. and Gelfand, D. H. (1991)
600
Detection of specific polymerase chain reaction product by utilizing the
601
5'-3'exonuclease activity of Thermus aquaticus DNA polymerase. Proc. Natl. Acad.
602
Sci. U.S.A. 88, 7276–7280.
603
(59) Tyo, K. E., Alper, H. S. and Stephanopoulos, G. N. (2007) Expanding the
604
metabolic engineering toolbox: more options to engineer cells. Trends Biotechnol. 25,
605
132–137.
606
(60) Alper, H., Moxley, J., Nevoigt, E., Fink, G. R. and Stephanopoulos, G. (2006)
607
Engineering yeast transcription machinery for improved ethanol tolerance and
608
production. Science 314, 1565–1568.
609
(61) Zhang, J., Kang, Z., Chen, J. and Du, G. (2015) Optimization of the heme
610
biosynthesis pathway for the production of 5-aminolevulinic acid in Escherichia coli.
611
Sci. Rep. 5, 1452–1456.
612 613
(62) Denard, C. A., Ren, H. and Zhao, H. (2015) Improving and repurposing biocatalysts via directed evolution. Curr. Opin. Chem. Biol. 25, 55–64.
614
(63) Crameri, A., Raillard, S. A., Bermudez, E. and Stemmer, W. P. (1998) DNA
615
shuffling of a family of genes from diverse species accelerates directed evolution.
616
Nature 391, 288–291.
617
(64) Lee, J. W., Na, D., Park, J. M., Lee, J., Choi, S. and Lee, S. Y. (2012) Systems
618
metabolic engineering of microorganisms for natural and non-natural chemicals. Nat.
619
Chem. Biol. 8, 536–546.
28
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
620 621
(65) Pfleger, B. F., Gossing, M. and Nielsen, J. (2015) Metabolic engineering strategies for microbial synthesis of oleochemicals. Metab. Eng. 29, 1–11.
622
(66) Brockman, I. M. and Prather, K. L. J. (2015) Dynamic metabolic engineering:
623
New strategies for developing responsive cell factories. Biotechnol. J., DOI:
624
10.1002/biot.201400422.
625 626 627 628
(67) Orelle, C., Carlson, E. D., Szal, T., Florin, T., Jewett, M. C. and Mankin, A. S. (2015) Protein synthesis by ribosomes with tethered subunits. Nature 524, 119–124. (68) Soye, B. J. D., Patel, J. R., Isaacs, F. J. and Jewett, M. C. (2015) Repurposing the translation apparatus for synthetic biology. Curr. Opin. Chem. Biol. 28, 83–90.
629
(69) Jacobs, T. M., Yumerefendi, H., Kuhlman, B. and Leaver-Fay, A. (2015)
630
SwiftLib: rapid degenerate-codon-library optimization through dynamic programming.
631
Nucleic Acids Res. 43, e34.
632 633
(70) Merryman, C. and Gibson, D. G. (2012) Methods and applications for assembling large DNA constructs. Metab. Eng. 14, 196–204.
634
(71) Gibson, D. G., Young, L., Chuang, R. Y., Venter, J. C., Hutchison, C. A. and
635
Smith, H. O. (2009) Enzymatic assembly of DNA molecules up to several hundred
636
kilobases. Nat. Methods 6, 343–345.
637 638
(72) Shao, Z., Zhao, H. and Zhao, H. (2009) DNA assembler, an in vivo genetic method for rapid construction of biochemical pathways. Nucleic Acids Res. 37, e16.
639
(73) Kang, Z., Wang, Y., Gu, P., Wang, Q. and Qi, Q. (2011) Engineering
640
Escherichia coli for efficient production of 5-aminolevulinic acid from glucose.
641
Metab. Eng. 13, 492–498.
642 643
(74) Burnham, B. F. (1970) δ-Aminolevulinicacidsynthase (Rhodopseudomonas sphaeroides). Methods Enzymol. 17, 195–204.
644 645 646
Figures legends
29
ACS Paragon Plus Environment
Page 34 of 37
Page 35 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
647
Figure 1. Schematic diagram of the RECODE method with two-step (a) or one-step
648
(b) PCR. Phosphorylated mutagenic oligonucleotides, UAP and DAP were annealed
649
to template DNA after denaturation. All gaps between the primers were filled and
650
ligated with thermostable DNA polymerase and ligation. For two-step PCR,
651
single-stranded mutant products of the 1st step PCR were purified, and then used as
652
the template to synthesize double-stranded variants in the 2nd step PCR reaction with
653
AP primer which specifically matches DAP. For one-step PCR, the AP primer was
654
added in the reaction to selectively synthesize the complementary chains of the
655
mutant strand. The concentration of AP was set with 0.2 µM which was 2 times of
656
that of the DAP primer.
657
Figure 2. Optimization and application of RECODE system. (a) Termination codons
658
were introduced into galactosidase (lacZ) and esterase (estC23) with substitution or
659
deletion of the native codons by RECODE. (b) The phenotype of the recombinants
660
wild-type galactosidase (blue colonies) and esterase (clear halos). LB agars containing
661
tributyrin (esterase substrate) and x-gal (galactosidase substrate) were used. (c) and (d)
662
showed the phenotypes of variants by RECODE with two-step and one-step PCR
663
systems, respectively. (e) Combinatorial engineering of SIR fragment of the rpoS
664
gene. Four mutant spacers between -35 and -10 boxes of SIR segments were inserted
665
upstream of the gfp gene to control the expression of GFP. (f) The transcriptional
666
strength of the SIR variants with one-round evolution by RECODE (with GFP protein
667
as a reporter). (g) Alignment of the nucleic acid sequences of the SIR variants. 30
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
668
Figure 3. Combinatorial evolution of the leech hyaluronidase by RECODE. (a)
669
Illustration of the evolution and screening process. The activities of the wild-type
670
LHAase and its variants towards heparin (top) and hyaluronan (bottom) were
671
determined in 96-well microtiter plates. (b) and (c) are the enzyme activities of the
672
variants (culture supernatants) towards hyaluronan and the non-natural substrate
673
heparin, respectively. (d) Diverse distribution of the mutant amino acids (red) in the
674
constructed variants. All error bars indicate ± SD, n = 3.
675
Figure 4. Combinatorial engineering the regulatory elements and pathway enzymes to
676
optimize synthetic pathway towards ALA. (a) The heme biosynthesis pathway via
677
ALA in E. coli. The arrows in red represent a positive relationship between ALA
678
accumulation and overexpression of the enzymes HemA, HemL and HemF. (b)
679
Illustration of combinatorial engineering of the promoters, ribosome binding sites and
680
pathway genes. (c) Evaluation of the variants in 96-well microtiter plates with the
681
parent E. coli LAF strain as control (red arrow). (d) ALA accumulation of the variants
682
in shake flasks. All error bars indicate ± SD, n = 3.
683
Figure 5. RECODE generalized workflow for rapid construction of novel regulatory
684
elements and enzymes, and engineering of optimized synthetic pathways to facilitate
685
the development of in vivo and in vitro synthetic biology.
686
Supplementary data
687
Supplementary data associated with this article can be found in the online version. 31
ACS Paragon Plus Environment
Page 36 of 37
Page 37 of 37
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
688
32
ACS Paragon Plus Environment