Combinatorial Evolution of Enzymes and Synthetic Pathways Using

Jan 11, 2016 - *E-mail: [email protected]. ... Here, we present a versatile evolution method termed “rapidly efficient combinatorial oligonucleo...
0 downloads 0 Views 2MB Size
Subscriber access provided by UNIV OF CALIFORNIA SAN DIEGO LIBRARIES

Article

Combinatorial Evolution of Enzymes and Synthetic Pathways Using One-Step PCR Peng Jin, Zhen Kang, Junli Zhang, Linpei Zhang, Guocheng Du, and Jian Chen ACS Synth. Biol., Just Accepted Manuscript • DOI: 10.1021/acssynbio.5b00240 • Publication Date (Web): 11 Jan 2016 Downloaded from http://pubs.acs.org on January 16, 2016

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

ACS Synthetic Biology is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Figure 1. Schematic diagram of the RECODE method with two-step (a) or one-step (b) PCR. Phosphorylated mutagenic oligonucleotides, UAP and DAP were annealed to template DNA after denaturation. All gaps between the primers were filled and ligated with thermostable DNA polymerase and ligation. For two-step PCR, single-stranded mutant products of the 1st step PCR were purified, and then used as the template to synthesize double-stranded variants in the 2nd step PCR reaction with AP primer which specifically matches DAP. For one-step PCR, the AP primer was added in the reaction to selectively synthesize the complementary chains of the mutant strand. The concentration of AP was set with 0.2 µM which was 2 times of that of the DAP primer. 352x139mm (300 x 300 DPI)

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 2. Optimization and application of RECODE system. (a) Termination codons were introduced into galactosidase (lacZ) and esterase (estC23) with substitution or deletion of the native codons by RECODE. (b) The phenotype of the recombinants wild-type galactosidase (blue colonies) and esterase (clear halos). LB agars containing tributyrin (esterase substrate) and x-gal (galactosidase substrate) were used. (c) and (d) showed the phenotypes of variants by RECODE with two-step and one-step PCR systems, respectively. (e) Combinatorial engineering of SIR fragment of the rpoS gene. Four mutant spacers between -35 and -10 boxes of SIR segments were inserted upstream of the gfp gene to control the expression of GFP. (f) The transcriptional strength of the SIR variants with one-round evolution by RECODE (with GFP protein as a reporter). (g) Alignment of the nucleic acid sequences of the SIR variants. 160x175mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 2 of 37

Page 3 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Figure 3. Combinatorial evolution of the leech hyaluronidase by RECODE. (a) Illustration of the evolution and screening process. The activities of the wild-type LHAase and its variants towards heparin (top) and hyaluronan (bottom) were determined in 96-well microtiter plates. (b) and (c) are the enzyme activities of the variants (culture supernatants) towards hyaluronan and the non-natural substrate heparin, respectively. (d) Diverse distribution of the mutant amino acids (red) in the constructed variants. All error bars indicate ± SD, n = 3. 203x182mm (300 x 300 DPI)

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 4. Combinatorial engineering the regulatory elements and pathway enzymes to optimize synthetic pathway towards ALA. (a) The heme biosynthesis pathway via ALA in E. coli. The arrows in red represent a positive relationship between ALA accumulation and overexpression of the enzymes HemA, HemL and HemF. (b) Illustration of combinatorial engineering of the promoters, ribosome binding sites and pathway genes. (c) Evaluation of the variants in 96-well microtiter plates with the parent E. coli LAF strain as control (red arrow). (d) ALA accumulation of the variants in shake flasks. All error bars indicate ± SD, n = 3. 264x188mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 4 of 37

Page 5 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Figure 5. RECODE generalized workflow for rapid construction of novel regulatory elements and enzymes, and engineering of optimized synthetic pathways to facilitate the development of in vivo and in vitro synthetic biology. 306x206mm (300 x 300 DPI)

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

Combinatorial Evolution of Enzymes and Synthetic Pathways Using

2

One-Step PCR

3

Peng Jin†,‡, Zhen Kang†,‡,§,*, Junli Zhang†,‡, Linpei Zhang†,‡, Guocheng Du†,‡,§ and

4

Jian Chen†,‡

5



6

Biotechnology, Jiangnan University, Wuxi 214122, China

7



8

Wuxi, Jiangsu 214122, China

9

§

The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of

Synergetic Innovation Center of Food Safety and Nutrition, Jiangnan University,

The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of

10

Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China

11

S Supporting Information ○

12 13

ABSTRACT: DNA engineering is the fundamental motive driving the rapid

14

development of modern biotechnology. Here, we present a versatile evolution method

15

termed “Rapidly Efficient Combinatorial Oligonucleotides for Directed Evolution”

16

(RECODE) for rapidly introducing multiple combinatorial mutations to the target

17

DNA by combined action of a thermostable high-fidelity DNA polymerase and a

18

thermostable DNA Ligase in one reaction system. By applying this method, we

19

rapidly constructed a variant library of the rpoS promoters (with activity of 8% to

20

460%), generated a novel heparinase from the highly specific leech hyaluronidase

21

(with more than 30 mutant residues) and optimized the heme biosynthetic pathway by 1

ACS Paragon Plus Environment

Page 6 of 37

Page 7 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

22

combinatorial evolution of regulatory elements and pathway enzymes (2500±120 mg

23

L-1 with 20-fold increase). The simple RECODE method enabled researchers the

24

unparalleled ability to efficiently create diverse mutant libraries for rapid evolution

25

and optimization of enzymes and synthetic pathways.

26 27

KEYWORDS: Combinatorial mutagenesis; single-stranded DNA oligonucleotides;

28

directed evolution; pathway optimization; synthetic biology

29 30 31

Enzymes have attracted much attention and study in the fields of metabolic

32

engineering and synthetic biology because of their unique properties of accepting

33

complex substrates and high selectivity and specificity. In particular, enzymes have

34

been widely used for synthesis of fine chemicals, complex pharmaceuticals,

35

agrochemicals, bulk chemicals and biofuels.1 However, due to limited sources and

36

performance of natural enzymes (for instance stability, substrate specificity, or

37

enantioselectivity), construction of novel robust enzymes with desired properties2-4 2

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

38

has always been a subject of interest. However, it is not straightforward to artificially

39

design and synthesize new interchangeable parts, modules or systems with specific

40

phenotypes owing to our limited understanding of the relationship between sequence

41

and function.5,6 Thus, many mutagenesis protocols including PCR based (such as

42

DNA shuffling,7 StEP,8 RPR,9 RACHITT10 and Synthetic shuffling11) and non-PCR

43

based (such as ITCHY12 and SHIPREC13) have been developed and applied.

44

Especially in recent years, application DNA assembly technologies have greatly

45

advanced significant progress in enzyme directed evolution.14-17

46

Several single-stranded DNA (ssDNA) mutagenesis technologies have been

47

developed to for in vivo genome engineering and in vitro enzyme evolution. An

48

ssDNA mediated genome editing approach has been regarded as one of the most

49

potential biotechnologies,18,19 but restricted host strains and time-consuming repetitive

50

electroporation make it inapplicable to rapidly construct mutant libraries for in vivo

51

enzyme evolution. In contrast, although mutagenic ssDNA oligonucleotides have

52

been introduced for improving in vitro DNA shuffling methods17,20-22 and

53

PCR-mediated site-mutagenesis approaches,23-27 the operation process was still

54

comparatively complicated (for instance, the requirement of preparation of substrate

55

and digestion with DNase I) and the generation efficiency of positive mutants was

56

still not satisfied.28 Thus, to enhance mutation efficiency and to simplify the

57

mutagenesis process, several ssDNA mediated PCR methods have also been

58

developed for multiple site-directed mutagenesis.29-31 However, the requirement of 3

ACS Paragon Plus Environment

Page 8 of 37

Page 9 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

59

preparation of linear or circular ssDNAs as templates restricts its practical

60

applications. Moreover, the employment of the room-temperature T4 DNA

61

polymerase and ligase (which merely allows one-time of annealing, extension and

62

ligation) resulted in rare products and blocked its potential applications in mutant

63

library construction.29,30 Recently, a user-defined mutagenesis method PFunkel32 that

64

based on the Kunkel mutagenesis protocols33,34 has been developed and applied for

65

repeated introduction of designed site-directed mutations by recruiting the

66

thermostable Pfu DNA polymerase and Taq DNA ligase, while the dependence of

67

uracil-containing DNA plasmid template still confines its potential applications. Thus,

68

how to fully utilize the advantages of synthetic oligonucleotides to efficiently

69

introduce diversity at specified positions while controlling mutation frequency17 in

70

small, multi-combinatorial and high-quality mutant libraries for high-throughput

71

screening is still a key issue to be solved.1,4,5,35

72

With the rapid development of synthetic biology,36 a wide range of artificial

73

regulatory elements and biomolecular systems have been engineered for various

74

useful tasks.5,37 In the past few decades, many sound strategies and tools including

75

promoter library engineering,38 ribosome binding site engineering,39 modular protein

76

scaffolds,40 tunable intergenic regions,41 dynamic sensor-regulators,42 multivariate

77

modular pathway engineering,43,44 combinatorial transcriptional engineering45 and

78

regulation with small RNAs46,47 and systems metabolic engineering48,49 have been

79

developed for optimizing synthetic metabolic pathways. However, it remains a 4

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

80

challenge to construct balanced synthetic pathways by primarily depending on

81

regulation at transcriptional, post-transcriptional or translational levels,50-52 since these

82

approaches often ignore the inherent shortcoming of pathway enzymes (for instance,

83

stability, fitness, and inhibition by allosteric control with metabolites).2 In fact, tuning

84

and optimization of enzymes are both important concepts for improving the overall

85

synthetic pathway performance.50

86

Here, inspired by the scarless DNA assembly method53 via ligase cycling reaction

87

and the ssDNA-based mutagenesis studies,29,30,32,54 we developed a simple, rapid,

88

efficient, and powerful in vitro evolution method termed “Rapidly Efficient

89

Combinatorial Oligonucleotides for Directed Evolution” (RECODE) (Figure 1). By

90

applying this RECODE method with one- or two-step PCR, the strength of the

91

regulatory element of the rpoS gene55 was increased 5-fold while the leech

92

hyaluronidase LHAase56 was endowed with both activities towards hyaluronan and

93

heparin. Concurrently, a novel strategy of combinatorial engineering of regulatory

94

elements and enzymes for optimizing the synthetic pathway was developed based on

95

this RECODE method. As a result, the heme biosynthetic pathway was optimized

96

with a significant increase in the production of 5-aminolevulinic acid (ALA, 20-fold).

97

■ RESULTS AND DISCUSSION

98

The Basic Principle and Feature of the RECODE Method. The RECODE method

99

is dependent on using thermostable DNA polymerase and ligase to achieve

100

combinatorial mutations at multiple sites (Figure 1). The single-stranded 5

ACS Paragon Plus Environment

Page 10 of 37

Page 11 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

101

oligonucleotides containing various desirable mutations were designed according to

102

the target fragments. After repeated PCR-based thermal cycling, the phosphorylated

103

single-stranded oligonucleotides annealed to the parent template DNA and extended

104

to the adjacent oligonucleotide position by DNA polymerase. Then the adjacent 5'

105

phosphate and 3' hydroxyl termini were ligated by Taq DNA ligase to form

106

completely

107

denaturation-annealing-extension-ligation temperature cycles, the DNA fragments

108

with combinatorial mutations were generated in one-round evolution. Specifically, the

109

upstream anchor primer (UAP), the downstream anchor primer (DAP) and the

110

antisense primer (AP) with specific structures29 were designed and exclusively

111

applied to amplify the variant DNA, which would be in favor of avoiding digestion

112

with DpnI endonuclease and facilitating subsequent assembly (Figure 1).

matched

duplex

DNA

structures.

After

25

113

Optimization of the Constructed RECODE Reaction System for Multiple

114

Site-directed Mutagenesis. To construct, evaluate and optimize the RECODE

115

method, we first constructed an indicator system (Figure 2a) by co-overexpressing a

116

galactosidase (encoded by lacZ) and an esterase (encoded by estC23)57. After

117

induction with isopropyl-β-D-thiogalactopyranoside (IPTG), both activities of

118

esterase (clear halos) and galactosidase (blue colonies) could be observed on

119

Luria-Bertani (LB) agarose plate with tributyrin and X-gal (Figure 2b). In

120

consideration of fidelity and avoiding the 5'-3'exonuclease (which may introduce

121

non-specific mutations and remove the downstream oligonucleotides annealed on a 6

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

122

template from 5' terminus),58 the Phusion® high fidelity DNA polymerase without the

123

5'-3' exonuclease activity was selected for extension reactions. Simultaneously, the

124

commonly used thermostable DNA ligases were comparatively investigated and the

125

results showed that only the Ampligase thermostable DNA Ligase generated small

126

amounts of mutant products (Supplementary Figure S1), which should be ascribed to

127

its similar optimal pH (8.3) compared with the high fidelity Phusion® DNA

128

polymerase (9.3). Due to the incompatible working concentration of Mg2+ for the

129

Phusion® DNA polymerase and Ampligase thermostable DNA Ligase (2 mM and 10

130

mM, respectively), the Mg2+ concentration was further investigated and optimized to

131

reconcile the extension and ligation reactions. When Mg2+ was above 7 mM or below

132

3 mM, only small amounts of mutant products were produced (Supplementary Figure

133

S2a), suggesting high concentration of Mg2+ inhibits the extension activity of

134

Phusion® DNA polymerase while low concentration of Mg2+ hinders the ligation

135

activity of Ampligase thermostable DNA Ligase. Accordingly, 5 mM Mg2+ was

136

utilized for extension and ligation in the optimized RECODE reaction buffer system

137

(Supplementary methods). In addition, the melting temperature (Tm) of the mutagenic

138

oligonucleotides, which depends on the flanking homology region of targets, showed

139

a wide range (40−66 °C). Specifically, the Tm value was optimized at 50 °C for high

140

recombination frequency (data not shown) by applying a consistent design of

141

mutagenic oligonucleotides (Supplementary Table S1).

142

Through systematic comparison and optimization of the components and reaction 7

ACS Paragon Plus Environment

Page 12 of 37

Page 13 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

143

cycle conditions (Supplementary Figure S1-3), the RECODE method with two-step

144

PCR was established (Figure 1a). On this basis, a color indicator system with

145

three-locus lethal mutations (inactivation of galactosidase in one site and esterase in

146

two sites) (Figure 2a) was designed and applied to visually examine the mutation

147

efficiency of this method. Remarkably, it could be found that all the recombinant

148

colonies on LB plates with X-gal and tributyrin substrates showed no parental

149

phenotype (blue color and transparent zone) (Figure 2c) and about 66.5% of the

150

colonies

151

(Supplementary Table S2). DNA sequencing results showed that 93 of the 116

152

variants (blue colonies) were simultaneously introduced mutations at two sites of the

153

estC23 gene while 109 of the 230 variants (white colonies) were simultaneously

154

introduced mutations at three sites of the designed lacZ-estC23 operon (47.5%).

lost

both

lipolytic

and

galactosidase

activities

(white

colonies)

155

To further simplify the operation process, a more simple RECODE method with

156

one-step PCR in one reaction system was designed and established (Figure 1b). To

157

guarantee the matching efficiency to the template, the DAP (60 nt) was designed with

158

a 40 nt complementary sequence to the template and 20 nt to the AP segment (Figure

159

1b). Meanwhile, a two-fold molar excess of APs over other primers were added with a

160

final concentration of 0.2 µM to guarantee extension and synthesis of all the mutant

161

single-strand

162

(Supplementary Figure S4) gave rise to fewer recombinants, the proportion of the

163

white colonies (about 61.6%, Figure 2d) was similar with that of the two-step PCR

chains.

Although

comparatively

less

8

ACS Paragon Plus Environment

target

PCR

products

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

164

systems. Furthermore, DNA sequencing results showed that 26 of the 45 mutants with

165

color colonies were simultaneously introduced mutations at three sites of the designed

166

lacZ-estC23 operon (57.8%, Figure 2d) which was higher than that of the two-step

167

PCR systems. Taken together, all the results demonstrate the high efficiency of the

168

RECODE method with one-step or two-step PCR system.

169

Application of RECODE to Rapidly Construct a Diverse Mutant Library of

170

Regulatory Elements. Promoter engineering is a powerful strategy in metabolic

171

engineering and synthetic biology.59 The transcriptional sigma factors and promoters

172

have been engineered for pathway construction and optimization.60 Here, four spacer

173

regions between the -35 and -10 elements from the 5′-untranslated region of rpoS

174

gene (encoding the sigma factor 38 for stress resistance in Escherichia coli) was

175

cloned and engineered by the RECODE method (Figure 2e) to generate a mutant

176

library with different transcriptional strengths. Specifically, the green fluorescent

177

protein that encoded by gfp was chosen as a reporter for visualized analysis of the

178

mutant library. After one-round evolution, five hundred colonies with different visual

179

green fluorescence intensities were randomly selected and cultured in 96-well plates

180

for characterization. The variants that showed significant differences compared to the

181

wild-type counterpart were then further confirmed in shake flask cultures. As shown

182

in Figure 2f, 74 variants exhibited a broad span of fluorescence intensity (8–460% of

183

the wild type). DNA sequencing results of the randomly picked variants showed that

184

the introduced multiple-cassette mutations covered the four spacer regions (Figure 2g), 9

ACS Paragon Plus Environment

Page 14 of 37

Page 15 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

185

demonstrating its high efficiency in combinatorial editing of DNA segments.

186

Collectively, these data suggest that this simple RECODE method should have wide

187

potential applications in rapid enzyme evolution.

188

Application of RECODE to Rapidly Broaden the Substrate Specificity of the

189

Leech Hyaluronidase. As an application example, a novel leech hyaluronidase

190

(LHAase, without activity to heparin) secreted by a recombinant Pichia pastoris56 was

191

chosen as a candidate for rapid evolution by RECODE (Figure 3a). The objective is to

192

rapidly create variants with improved LHAase activities and/or broadened substrate

193

specificity to heparin. According to the phylogenetic relationship and conserved

194

domains among the hyaluronidases and heparanases,56 10 mutagenic oligonucleotides

195

with degenerate sequences (Supplementary Table S1) were designed to cover the 10

196

candidate regions near the conserved amino acids.56 After one-step PCR evolution and

197

assembly, all the mutants were transformed into P. pastoris GS115. Subsequently,

198

Seven hundred colonies were firstly picked in 96 deep-well plates for cultivation and

199

assay of HAase activity. Then, the variants with significant differences were further

200

investigated in flask cultures. Remarkably, many variants with different activities

201

towards hyaluronan and heparin were successfully created after one-round evolution

202

by the RECODE method (Figure 3a). In comparison, the variant M2D7 showed a

203

2.4-fold increase of the LHAase activity while both the variants M4A4 and M5B4

204

showed increased LHAase activity and heparanase activity (Figure 3b and 3c).

205

Impressively, although the LHAase activity of the variant M4A12 decreased, the 10

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

206

heparanase activity increased with the highest titer of 750 ± 20 U mL–1, indicating its

207

potential to be an exclusive “heparanase”. Further DNA sequencing analysis of these

208

variants showed that the introduced mutation sites covered all the area of candidates

209

with random combinations (Figure 3d). In particular, more than 30 amino acid

210

residues in a single variant were simultaneously mutated (accounting for 6.12% of the

211

LHAase, 489 amino acids), demonstrating the powerful capacity of this RECODE

212

method in enzyme directed evolution.

213

Combinatorial Engineering of Regulatory Elements and Enzymes to Optimize

214

Synthetic Pathway. In many cases, it is not possible to optimize a synthetic pathway

215

only by engineering the regulatory elements (such as promoters and RBS). Here,

216

based on the established simple RECODE method, a novel approach of combinatorial

217

engineering of regulatory elements and pathway enzymes was proposed to optimize

218

synthetic pathways towards target compounds. As an example, the heme biosynthesis

219

pathway was simultaneously engineered at the transcriptional and protein levels for

220

efficient accumulation of the intermediate ALA (Figure 4a). Specifically, the three

221

crucial genes hemA, hemL and hemF61 were simultaneously engineered at the

222

transcriptional and protein levels by the RECODE method (Figure 4b). After

223

one-round evolution, 1000 recombinant colonies were characterized in 96 deep-well

224

plates (Figure 4c) and further investigated with flask cultures. As shown in Figure 4d,

225

the recombinant BL21 (DE3) strain containing the variant A24-H6 accumulated ALA

226

to the highest titer (2500 ± 120 mg L–1) which was 20-fold of that of the parental 11

ACS Paragon Plus Environment

Page 16 of 37

Page 17 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

227

strain. Furthermore, DNA sequencing results of eight variants (Figure 4d) showed that

228

the mutation sites covered most of the designed regions including promoters, RBS

229

regions and the open reading frames of hemA, hemL and hemF (Supplementary Figure

230

S6). The results demonstrate the high efficiency of this simple approach in pathway

231

engineering. Meanwhile, the results also confirmed that fine-tuning of the three

232

committed enzymes HemA, HemL and HemF are critical for ALA accumulation.61

233

Discussion. Enzyme directed evolution technologies have achieved great progress

234

in the past two decades, however, the sizes of the generated mutant libraries are

235

generally too large to fully explore by available selection strategies.50,62 Thus,

236

development of simple, fast and efficient techniques that allow rational design and

237

target engineering of smaller libraries with higher degrees of precision is always a

238

major requirement.1 In the present study, a designated RECODE method with

239

one-step PCR has been developed for rapid DNA editing (Figure 1), which has been

240

demonstrated to be effectively applied in engineering of regulatory elements, enzymes

241

and synthetic pathways.

242

Different from previous ssDNA-based methods,29,30 the RECODE method was

243

developed by employing the thermostable Phusion® high fidelity DNA polymerase

244

and Taq DNA ligase (Figure 1). Combined action of the thermostable high-fidelity

245

DNA polymerase and ligase not only facilitates all kinds of DNA templates (eg.

246

double-stranded DNA fragments, cDNA, genomic DNA and plasmid DNA) but also 12

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

247

avoids or reduces non-specific mutations and incorrect ligations. Here, to construct

248

this reaction system in one tube, the Mg2+ concentration and pH were demonstrated as

249

the major parameters that affect the activities of polymerase and ligase

250

(Supplementary Figure S1, 2) and optimized to achieve accordant extension and

251

ligation activities for the production of amounts of variants with desirable diversities

252

(Supplementary Figure S2). After further optimization of the extension and ligation

253

cycles (to fully anneal the mutagenic oligonucleotides and make full use of the

254

advantages of the thermostable DNA Ligase.53) and the Tm value of the homologous

255

sequence, a sufficient quantity of mutants with multi-combinatorial mutations were

256

rapidly generated for high-throughput screening. In comparison, it would be

257

impossible to achieve combinatorial multiple-cassette mutagenesis by further

258

modification of previous reported protocols because of application of the

259

room-temperature T4 DNA polymerase and ligase.29,30 Moreover, the PFunkel

260

method32 may be laborious due to the delayed addition of reaction reagents (including

261

polymerase and ligase) and degradation of the uracil-containing template and DNA

262

products not in the desired covalently closed circular by uracil DNA glycosylase and

263

exonuclease III.32

264

For enzyme evolution, the size and the diversity of the created library are always

265

the critical characterizations for the subsequent screening especially when no effective

266

selection methods are available. To this end, specific upstream and downstream

267

anchor primers and an antisense primer were designed and standardized (Figure 1). 13

ACS Paragon Plus Environment

Page 18 of 37

Page 19 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

268

Compared with previous DNA shuffling methods,7,63 the RECODE method enabled

269

rapid DNA evolution with higher diversity within a smaller library size. This is

270

accomplished by the application of specific primers in the reaction system that result

271

in synthesis of the full-length active mutants without the repeat amplification of the

272

existing ones. Furthermore, this RECODE method also showed powerful capacity in

273

combinatorial editing of a target DNA with insertion and deletion of interested

274

elements (Supplementary methods and Supplementary Figure S5). Compared with

275

previous DNA shuffling strategies, this RECODE method was much more rational in

276

protein engineering especially with the assistance of crystal structure analysis. As an

277

example, we successfully created a novel “heparanase” from the highly specific leech

278

hyaluronidase (Figure 3a). Moreover, the novel approach devised for rapid

279

optimization of the synthetic pathways by combinatorial engineering of regulatory

280

elements and pathway enzymes also showed high efficiencies (Figure 4d). Compared

281

with other pathway engineering strategies that mainly focused on transcriptional

282

regulation,64-66 this approach should be comparatively easier to use in constructing an

283

optimized pathway by simultaneously improving pathway enzyme properties and

284

optimizing the concentration ratios.

285

Conclusion. We have developed a simple (one-step PCR with any form of DNA

286

templates), rapid (one-round evolution of 5 kb DNA within 3 hours), efficient (higher

287

amount of products) and powerful (multiple single-directed mutations, insertions or

288

deletions and combinatorial multiple-cassette mutagenesis) RECODE method, which 14

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

289

enables us to easily reedit the target DNA to construct novel regulatory parts and

290

modules (Figure 5). Combining with protein synthesis in vitro,67,68 computer-aided

291

design69 and simulation, and efficient assembly techniques,1,70 especially the Gibson

292

isothermal assembly,71 Ligase Cycling Reaction53 and the DNA assembler,72 this

293

RECODE method will greatly promote protein studies and benefit our understanding

294

of the relationship between sequence and enzyme function. In conjunction, the

295

demonstrated approach for simultaneous modification of regulatory elements and

296

pathway enzymes will also enable us to rapidly optimize the synthetic pathways

297

towards the target products (Figure 5). Hence, the RECODE method and the derived

298

approach for pathway engineering will be of great benefit to a variety of applications

299

in the field of enzyme engineering, metabolic engineering and synthetic biology.

300



301

Bacterial Strains and Plasmids. E. coli DH5α and pBBRMCS2 (Stratagene, La Jolla,

302

CA, USA) were used for cloning and characterization of the library of SIR regulatory

303

element.55 The pMD19-T (Takara Ltd., Otsu, Japan) vector was used for cloning of

304

the lacZ-estC23 and gfp fragments. P. pastoris GS115 (his4) and the pPIC9K vector

305

(Invitrogen, Carlsbad, CA, USA) were used for expression of the hyaluronidase

306

mutant library. The pRSFduet-1 (Novagen Darmstadt, Germany) and E. coli BL21

307

(DE3) (Stratagene) were used as the vector and host, respectively, for pathway genes

308

expression of ALA production.

MATERIALS AND METHODS

15

ACS Paragon Plus Environment

Page 20 of 37

Page 21 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

309

Media and Culture Conditions. LB medium was used for cloning and cell culture.

310

LB plates supplemented with 100 µg mL-1 ampicillin, 0.1 mM IPTG, 20 µg mL−1

311

X-gal (Sigma-Aldrich, St. Louis, USA), and 1% (v/v) tributyrin (Merck, Darmstadt,

312

Germany) were used for characterizing the indicator system of lacZ-estC23 at 37 °C.

313

The secondary assay of E. coli libraries was carried out in shake flask with LB broth

314

at 37 °C with 200 rpm shaking. P. pastoris was selected on MD medium (0.34% YNB,

315

20 g L−1 glucose and 15 g L−1 agar) supplemented with 2 mg mL-1 G418. Shake flask

316

cultures of recombinant P. pastoris strains were carried out at 30 °C at 200 rpm with

317

buffered methanol minimal yeast medium (BMMY, 10 g L−1 yeast extract, 20 g L−1

318

peptone, 100 mM potassium phosphate, 0.34% YNB, 10 g L−1 (NH4)2SO4, 1%

319

methanol). For production of ALA, the recombinant E. coli BL21 (DE3) cells were

320

grown at 37 °C at 200 rpm with mineral basal medium (g L−1): 20.0 glucose, 2.0 yeast

321

extract, 16.0 (NH4)2SO4, 3.0 KH2PO4, 16.0 Na2HPO4•12H2O, 1.0 MgSO4•7H2O and

322

0.01 MnSO4•H2O, pH 7.0. When necessary, 50 µg mL−1 kanamycin and 0.1 mM

323

IPTG were supplemented.

324

Primer Design. All primers (BGI; Beijing, China) in this study are listed in

325

Supplementary Table S1. The lethal mutations for galactosidase encoding gene lacZ

326

and esterase encoding gene estC23 were designed by introduction of termination

327

codons into one site of lacZ and two sites of estC23 (Figure 2a). Specifically, the

328

bases GAA, TAC, and ATGA were replaced with the termination codons TAA, TAA,

329

and TGA, respectively. To evaluate the insertion and deletion capacity of the 16

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

330

RECODE method, the oligonucleotides that contain 12 nt (ACATGCCACTGG) and

331

9 nt (TACTGGCAC) fragments for insertion at 196 and 601 sites, and a 6 nt

332

(ACACAA) fragment for deletion at 607-612 site of gfp gene were designed,

333

respectively. For editing the SIR regulatory element, four degenerate oligonucleotides

334

were designed to cover the spacer fragments between the −35 and −10 boxes of the

335

corresponding four natural tandem promoters. For combinatorial evolution of the

336

leech hyaluronidase, mutation sequences instead of the inherent triplet code were

337

introduced into 10 candidate sites near the conserved amino acids, Specifically, the

338

triplet-codon was consciously avoided the introduction of the termination codons (for

339

example VNN; V = A, C, G; N = A, T, C, G). For combinatorial evolution of the

340

ALA pathway, multiple regions of the three pathway genes (hemA, hemL and hemF)73

341

were simultaneously modified with the highly degenerate triplet-code sequence (for

342

example VNN or NNN in oligonucleotides). The ribosome binding site (RBS) regions

343

were modified with degenerate RBS sequences (DDRRRRRDDDD; D = A, G, T; R =

344

A, G), and the spacer sequences between −35 and −10 boxes flanking the hemF

345

promoter were edited with the degenerate sequence (NNNNNNNNNNNNNNNNN).

346

RECODE Manipulation. All the synthetic mutagenic oligonucleotides and the

347

downstream anchor primers (DAP) were mixed with the same concentration and

348

phosphorylated. Specifically, phosphorylation was performed in 50-µL reactions

349

containing 300 pmol of a primer mixture, 1× T4 DNA ligase buffer (NEB, New

350

England Biolabs, USA), and 8 U polynucleotide kinase (NEB). The reaction was 17

ACS Paragon Plus Environment

Page 22 of 37

Page 23 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

351

incubated for 30 min at 37 °C and subsequently terminated by heating for 10 min at

352

75 °C.

353

The RECODE reaction was carried out in 50-µL system containing 0.1 µM of each

354

phosphorylated mutagenic oligonucleotide and the upstream anchor primer (UAP),

355

0.2 µM antisense primer , 0.01 pmol DNA template, 1 U Phusion DNA polymerase

356

(NEB), 5 U Ampligase thermostable DNA ligase (Epicentre, USA) and 1× optimized

357

RECODE reaction buffer (20 mM Tris-HCl, 25 mM KCl, 0.5 mM NAD, 2 mM

358

dNTPs, 5 mM Mg2+, 0.1% Triton X-100, pH 8.3). The one-step reaction system

359

(Figure 1b) was performed with the following PCR conditions: 2 min at 94 °C; 25

360

cycles of 30 s at 94 °C, 30 s at 50 °C, 2 min at 72 °C, and 3 min at 66 °C; a hold

361

period at 4 °C. In the two-step RECODE system (Figure 1a), twenty-five cycles were

362

programed in the 1st step PCR to ensure that the extended mutagenic primers were

363

fully annealed and to generate variants with different combinatorial mutations. After

364

purification, all the PCR products were applied as the templates in the 2nd step PCR.

365

Then the double-stranded products were generated with an antisense primer AP in a

366

new 50-µL PCR system containing the purified 1st step PCR products, 1 µL of

367

antisense primer and 25 µL of 2 × SuperPfu PCR MasterMix (Hangzhou Biosci Co.,

368

Ltd, China). The PCR reaction was performed with the following program: 2 min at

369

94 °C; then 3 cycles of 30 s at 94 °C, 30 s at 50 °C and 90 s at 72 °C; 5 min at 72 °C

370

and a hold period at 4 °C.

371

PCR products were run on 1% agarose gels containing EtBr at 90 V for 45 min. 18

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 24 of 37

372

Bands of the correct size were excised and purified with a gel extraction kit, according

373

to the manufacturer’s protocol. When applying plasmid DNA as templates, 1 µL DpnI

374

endonuclease (All endonucleases supplier, Fermentas, USA) was added in the PCR

375

products and incubated at 37 °C for 10 min to digest the template DNA.

376

Variants Library Construction and Screening. To obtain colony libraries,

377

double-stranded DNA variants produced by RECODE were cloned into vectors by the

378

enzymatic assembly DNA method.71 High-throughput screening of the library was

379

performed in 96-well microplates, and then selected variants of all the following

380

libraries were further examined in shake flasks. The lacZ-estC23 mutant library was

381

subcloned into the pMD19-T vector, and then transformed into DH5α. The library

382

was selected on LB plates supplemented with 100 µg mL-1 ampicillin, 0.1 mM IPTG,

383

20 µg mL−1 X-gal, and 1% tributyrin. The diversity of the mutant library was

384

characterized by phenotypic analysis and DNA sequencing. To construct the deletion

385

or insertion libraries of gfp gene by RECODE, the purified products were subcloned

386

into the pMD19-T vector, and then transformed into DH5α. The deletion or insertion

387

libraries were selected on LB plates supplemented with 100 µg mL-1 ampicillin,

388

respectively.

389

To construct the characterization library of the SIR variants, the gfp fragment was

390

firstly

391

EcoRI/BamHI

392

pBBR1MCS2-gfp. Then the SIR mutant library was assembled into the linear

subcloned

into

the

restriction

low-copy-number enzymes,

yielding

plasmid the

19

ACS Paragon Plus Environment

pBBR1MCS-2 recombinant

with

plasmid

Page 25 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

393

pBBR1MCS2-gfp via enzymatic one-step assembly to control expression of the

394

reporter gene gfp. The recombinant plasmids carrying the SIR library were

395

transformed into E. coli DH5α competent cells and cultured on LB plates

396

supplemented with 50 µg mL−1 kanamycin. Screening was performed by

397

high-throughput batch-cultures in 96-well microtiter plates. Transformants were

398

picked into microtiter wells with a Qbot robotic colony picker (Genetix, Hampshire,

399

UK) and grown in LB broth at 37 °C at 200 rpm for 24 h. To analyze the activity of

400

the promoter library, cells were collected by centrifugation (1, 3000 g, 5 min) and the

401

cell pellet washed three times with sterile water. The fluorescence intensity value of

402

per OD600 cells was measured by excitation at 490 nm and emission at 530 nm using

403

an Ultra Multifunctional Microplate Reader (Tecan, Durham, NC, USA). Then, some

404

variants were further verified in shake flask culture.

405

The hyaluronidase gene H6LHyal variants were inserted into pPIC9K under the

406

control of the AOX1 promoter by enzymatic assembly. The recombinant plasmids

407

were linearized with SalI and transformed into P. pastoris GS115 competent cells by

408

electroporation. The cells were cultured on MD plates supplemented with 2 mg mL−1

409

G418. Variants were picked into microtiter wells and cultured in BMMY broth

410

containing methanol (1% v/v) at 30 °C at 200 rpm for 72 h. The cultured supernatants

411

of variants were collected and diluted for the enzymatic assay in 96-well microtiter

412

plates. The selected variants were further cultured in shake flask rocking at 200 rpm at

413

30 °C according to the previous manipulation of shake flask culture.56 The enzymatic 20

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 26 of 37

414

reaction containing a 1.6 mg mL−1 of the substrate (hyaluronic acid or heparin,

415

Sigma-Aldrich, USA) and an appropriate amount of enzyme in 50 mM citrate buffer

416

(pH 5.5) was incubated at 38 °C for 10 min. The reaction was terminated by heat

417

inactivation then examined using the DNS method. Colorimetric reactions were

418

measured by using the Ultra Multifunctional Microplate Reader at 540 nm. One unit

419

of hyaluronidase activity was defined as equal to the reducing power of glucuronic

420

acid (glucose equivalents in micrograms) liberated per hour from HA at 38 °C and pH

421

5.5. One unit of heparanase activity was defined as equal to the reducing power of

422

oligosaccharides (glucose equivalents in milligrams) liberated per hour from HA at

423

30 °C and pH 5.5.

424

The

three

separated

mutant

libraries

(RBS/hemL,

RBS/hemA

and

425

promoter/RBS/hemF) of the ALA pathway were assembled into pRSFduet-1 via

426

enzymatic assembly and then transformed into BL21 (DE3) competent cells. To

427

analyze the ALA yields from the variants, high-throughput screening was carried out

428

in microtiter wells with mineral basal medium broth at 37 °C at 200 rpm for 30 h. The

429

candidate variants were further verified in shake flask cultures. The production of

430

ALA was analyzed by using the modified Ehrlich’s reagent.74

431

■ ASSOCIATED CONTENT

432

S Supporting Information ○

433

Supplementary data including figures, tables and methods associated with this article

434

can be referenced in the supplementary materials. This material is available free of 21

ACS Paragon Plus Environment

Page 27 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

435

charge via the Internet at http://pubs.acs.org.

436

AUTHOR INFORMATION

437

Corresponding Author

438

Zhen Kang, *E-mail: [email protected]

439

Notes

440

The authors declare no competing financial interest.

441

■ ACKNOWLEDGMENTS

442

This work was financially supported by the Major State Basic Research Development

443

Program of China (973 Program, 2014CB745103), the Natural Science Foundation of

444

Jiangsu Province (BK20141107), a grant from the Key Technologies R & D Program

445

of Jiangsu Province, China (BE2014607); Program for Changjiang Scholars and

446

Innovative Research Team in University (No. IRT_15R26); China Postdoctoral

447

Science Foundation funded project (125960).

448

■ REFERENCES

449 450 451 452 453 454 455 456

(1) Bornscheuer, U., Huisman, G., Kazlauskas, R., Lutz, S., Moore, J. and Robins, K. (2012) Engineering the third wave of biocatalysis. Nature 485, 185–194. (2) Romero, P. A. and Arnold, F. H. (2009) Exploring protein fitness landscapes by directed evolution. Nat. Rev. Mol. Cell Biol. 10, 866–876. (3) Dalby, P. A. (2011) Strategy and success for the directed evolution of enzymes. Curr. Opin. Struct. Biol. 21, 473–480. (4) Johannes, T. W. and Zhao, H. (2006) Directed evolution of enzymes and biosynthetic pathways. Curr. Opin. Microbiol. 9, 261–267. 22

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

457 458 459 460 461 462

(5) Lu, T. K., Khalil, A. S. and Collins, J. J. (2009) Next-generation synthetic gene networks. Nat. Biotechnol. 27, 1139–1150. (6) Carr, P. A. and Church, G. M. (2009) Genome engineering. Nat. Biotechnol. 27, 1151–1162. (7) Stemmer, W. P. (1994) Rapid evolution of a protein in vitro by DNA shuffling. Nature 370, 389–391.

463

(8) Zhao, H., Giver, L., Shao, Z., Affholter, J. A. and Arnold, F. H. (1998) Molecular

464

evolution by staggered extension process (StEP) in vitro recombination. Nat.

465

Biotechnol. 16, 258–261.

466

(9) Shao, Z., Zhao, H., Giver, L. and Arnold, F. H. (1998) Random-priming in vitro

467

recombination: an effective tool for directed evolution. Nucleic Acids Res. 26, 681–

468

683.

469

(10) Coco, W. M., Levinson, W. E., Crist, M. J., Hektor, H. J., Darzins, A., Pienkos,

470

P. T., Squires, C. H. and Monticello, D. J. (2001) DNA shuffling method for

471

generating highly recombined genes and evolved enzymes. Nat. Biotechnol. 19, 354–

472

359.

473

(11) Ness, J. E., Kim, S., Gottman, A., Pak, R., Krebber, A., Borchert, T. V.,

474

Govindarajan, S., Mundorff, E. C. and Minshull, J. (2002) Synthetic shuffling

475

expands functional protein diversity by allowing amino acids to recombine

476

independently. Nat. Biotechnol. 20, 1251–1255.

477

(12) Ostermeier, M., Shim, J. H. and Benkovic, S. J. (1999) A combinatorial

478

approach to hybrid enzymes independent of DNA homology. Nat. Biotechnol. 17,

479

1205–1209.

480 481 482 483

(13) Sieber, V., Martinez, C. A. and Arnold, F. H. (2001) Libraries of hybrid proteins from distantly related sequences. Nat. Biotechnol. 19, 456–460. (14) Packer, M. S. and Liu, D. R. (2015) Methods for the directed evolution of proteins. Nat. Rev. Genet. 16, 379–394.

23

ACS Paragon Plus Environment

Page 28 of 37

Page 29 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

484

(15) Kang, Z., Zhang, J., Jin, P. and Yang, S. (2015) Directed evolution combined

485

with synthetic biology strategies expedite semi-rational engineering of genes and

486

genomes. Bioengineered 6, 136–140.

487

(16) Coussement, P., Maertens, J., Beauprez, J., Van Bellegem, W. and De Mey, M.

488

(2014) One step DNA assembly for combinatorial metabolic engineering. Metab. Eng.

489

23, 70–77.

490

(17) Herman, A. and Tawfik, D. S. (2007) Incorporating Synthetic Oligonucleotides

491

via Gene Reassembly (ISOR): a versatile tool for generating targeted libraries.

492

Protein Eng. Des. Sel. 20, 219–226.

493

(18) Wang, H. H., Isaacs, F. J., Carr, P. A., Sun, Z. Z., Xu, G., Forest, C. R. and

494

Church, G. M. (2009) Programming cells by multiplex genome engineering and

495

accelerated evolution. Nature 460, 894–898.

496

(19) Li, Y., Gu, Q., Lin, Z., Wang, Z., Chen, T. and Zhao, X. (2013) Multiplex

497

iterative plasmid engineering for combinatorial optimization of metabolic pathways

498

and diversification of protein coding sequences. ACS Synth. Biol. 2, 651–661.

499

(20) Stemmer, W. P. (1994) DNA shuffling by random fragmentation and

500

reassembly: in vitro recombination for molecular evolution. Proc. Natl. Acad. Sci.

501

U.S.A. 91, 10747–10751.

502

(21) Crameri, A. and Stemmer, W. P. (1995) Combinatorial multiple cassette

503

mutagenesis creates all the permutations of mutant and wild-type sequences.

504

Biotechniques 18, 194–196.

505

(22) Reetz, M. T., Wilensek, S., Zha, D. and Jaeger, K. E. (2001) Directed evolution

506

of an enantioselective enzyme through combinatorial multiple-cassette mutagenesis.

507

Angew. Chem. Int. Ed. 40, 3589–3591.

508 509

(23) Bloom, J. D. (2014) An experimentally determined evolutionary model dramatically improves phylogenetic fit. Mol. Biol. Evol. 31, 1956–1978.

24

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

510

(24) Hidalgo, A., Schließmann, A., Molina, R., Hermoso, J. and Bornscheuer, U. T.

511

(2008) A one-pot, simple methodology for cassette randomisation and recombination

512

for focused directed evolution. Protein Eng. Des. Sel. 21, 567–576.

513 514

(25) Reetz, M. T. and Carballeira, J. D. (2007) Iterative saturation mutagenesis (ISM) for rapid directed evolution of functional enzymes. Nat. Protoc. 2, 891–903.

515

(26) Jain, P. C. and Varadarajan, R. (2014) A rapid, efficient, and economical

516

inverse polymerase chain reaction-based method for generating a site saturation

517

mutant library. Anal. Biochem. 449, 90–98.

518 519 520 521 522 523 524 525

(27) Parker, A. S., Griswold, K. E. and Bailey-Kellogg, C. (2011) Optimization of combinatorial mutagenesis. J. Comput. Biol. 18, 1743–1756. (28) Kaur, J. and Sharma, R. (2006) Directed evolution: an approach to engineer enzymes. Crit. Rev. Biotechnol. 26, 165–199. (29) Seyfang, A. and Jin, J. H. (2004) Multiple site-directed mutagenesis of more than 10 sites simultaneously and in a single round. Anal. Biochem. 324, 285–291. (30) Young, L. and Dong, Q. (2003) TAMS technology for simple and efficient in vitro site-directed mutagenesis and mutant screening. Nucleic Acids Res. 31, e11.

526

(31) Sawano, A. and Miyawaki, A. (2000) Directed evolution of green fluorescent

527

protein by a new versatile PCR strategy for site-directed and semi-random

528

mutagenesis. Nucleic Acids Res. 28, e78.

529 530 531 532

(32) Firnberg, E. and Ostermeier, M. (2012) PFunkel: efficient, expansive, user-defined mutagenesis. PLoS One 7, e52031. (33) Kunkel, T. A. (1985) Rapid and efficient site-specific mutagenesis without phenotypic selection. Proc. Natl. Acad. Sci. U.S.A. 82, 488–492.

533

(34) Weiss, G. A., Watanabe, C. K., Zhong, A., Goddard, A. and Sidhu, S. S. (2000)

534

Rapid mapping of protein functional epitopes by combinatorial alanine scanning.

535

Proc. Natl. Acad. Sci. U.S.A. 97, 8950–8954.

536 537

(35) Dougherty, M. J. and Arnold, F. H. (2009) Directed evolution: new parts and optimized function. Curr. Opin. Biotechnol. 20, 486–491. 25

ACS Paragon Plus Environment

Page 30 of 37

Page 31 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

538 539

(36) Cameron, D. E., Bashor, C. J. and Collins, J. J. (2014) A brief history of synthetic biology. Nat. Rev. Microbiol. 12, 381–390.

540

(37) Smanski, M. J., Bhatia, S., Zhao, D., Park, Y., L, B. A. W., Giannoukos, G.,

541

Ciulla, D., Busby, M., Calderon, J., Nicol, R., Gordon, D. B., Densmore, D. and Voigt,

542

C. A. (2014) Functional optimization of gene clusters by combinatorial design and

543

assembly. Nat. Biotechnol. 32, 1241–1249.

544

(38) Alper, H., Fischer, C., Nevoigt, E. and Stephanopoulos, G. (2005) Tuning

545

genetic control through promoter engineering. Proc. Natl. Acad. Sci. U.S.A. 102,

546

12678–12683.

547

(39) Salis, H. M., Mirsky, E. A. and Voigt, C. A. (2009) Automated design of

548

synthetic ribosome binding sites to control protein expression. Nat. Biotechnol. 27,

549

946–950.

550

(40) Dueber, J. E., Wu, G. C., Malmirchegini, G. R., Moon, T. S., Petzold, C. J.,

551

Ullal, A. V., Prather, K. L. and Keasling, J. D. (2009) Synthetic protein scaffolds

552

provide modular control over metabolic flux. Nat. Biotechnol. 27, 753–759.

553

(41) Pfleger, B. F., Pitera, D. J., Smolke, C. D. and Keasling, J. D. (2006)

554

Combinatorial engineering of intergenic regions in operons tunes expression of

555

multiple genes. Nat. Biotechnol. 24, 1027–1032.

556

(42) Zhang, F., Carothers, J. M. and Keasling, J. D. (2012) Design of a dynamic

557

sensor-regulator system for production of chemicals and fuels derived from fatty acids.

558

Nat. Biotechnol. 30, 354–359.

559

(43) Ajikumar, P. K., Xiao, W. H., Tyo, K. E., Wang, Y., Simeon, F., Leonard, E.,

560

Mucha, O., Phon, T. H., Pfeifer, B. and Stephanopoulos, G. (2010) Isoprenoid

561

pathway optimization for Taxol precursor overproduction in Escherichia coli. Science

562

330, 70–74.

563

(44) Xu, P., Gu, Q., Wang, W., Wong, L., Bower, A. G., Collins, C. H. and Koffas,

564

M. A. (2013) Modular optimization of multi-gene pathways for fatty acids production

565

in E. coli. Nat. Commun. 4, 1409. 26

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

566

(45) Du, J., Yuan, Y., Si, T., Lian, J. and Zhao, H. (2012) Customized optimization

567

of metabolic pathways by combinatorial transcriptional engineering. Nucleic Acids

568

Res. 40, e142.

569

(46) Na, D., Yoo, S. M., Chung, H., Park, H., Park, J. H. and Lee, S. Y. (2013)

570

Metabolic engineering of Escherichia coli using synthetic small regulatory RNAs.

571

Nat. Biotechnol. 31, 170–174.

572

(47) Kang, Z., Wang, X., Li, Y., Wang, Q. and Qi, Q. (2012) Small RNA RyhB as a

573

potential tool used for metabolic engineering in Escherichia coli. Biotechnol. Lett. 34,

574

527–531.

575 576 577 578 579 580 581 582 583 584

(48) Lee, S. Y. and Kim, H. U. (2015) Systems strategies for developing industrial microbial strains. Nat. Biotechnol. 33, 1061–1072. (49) Dai, Z. and Nielsen, J. (2015) Advancing metabolic engineering through systems biology of industrial microorganisms. Curr. Opin. Biotechnol. 36, 8–15. (50) Otte, K. B. and Hauer, B. (2015) Enzyme engineering in the context of novel pathways and products. Curr. Opin. Biotechnol. 35C, 16–22. (51) Keasling, J. D. (2012) Synthetic biology and the development of tools for metabolic engineering. Metab. Eng. 14, 189–195. (52) Stephanopoulos, G. (2012) Synthetic biology and metabolic engineering. ACS Synth. Biol. 1, 514–525.

585

(53) Kok, S. d., Stanton, L. H., Slaby, T., Durot, M., Holmes, V. F., Patel, K. G.,

586

Platt, D., Shapland, E. B., Serber, Z. and Dean, J. (2014) Rapid and reliable DNA

587

assembly via ligase cycling reaction. ACS Synth. Biol. 3, 97–106.

588 589

(54) Michael, S. F. (1994) Mutagenesis by incorporation of a phosphorylated oligo during PCR amplification. Biotechniques 16, 410–412.

590

(55) Kang, Z., Wang, Q., Zhang, H. and Qi, Q. (2008) Construction of a

591

stress-induced system in Escherichia coli for efficient polyhydroxyalkanoates

592

production. Appl. Microbiol. Biotechnol. 79, 203–208.

27

ACS Paragon Plus Environment

Page 32 of 37

Page 33 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

593

(56) Jin, P., Kang, Z., Zhang, N., Du, G. and Chen, J. (2014) High-yield novel leech

594

hyaluronidase to expedite the preparation of specific hyaluronan oligomers. Sci. Rep.

595

4, 4471.

596

(57) Jin, P., Pei, X., Du, P., Yin, X., Xiong, X., Wu, H., Zhou, X. and Wang, Q.

597

(2012) Overexpression and characterization of a new organic solvent-tolerant esterase

598

derived from soil metagenomic DNA. Bioresour. Technol. 116, 234–240.

599

(58) Holland, P. M., Abramson, R. D., Watson, R. and Gelfand, D. H. (1991)

600

Detection of specific polymerase chain reaction product by utilizing the

601

5'-3'exonuclease activity of Thermus aquaticus DNA polymerase. Proc. Natl. Acad.

602

Sci. U.S.A. 88, 7276–7280.

603

(59) Tyo, K. E., Alper, H. S. and Stephanopoulos, G. N. (2007) Expanding the

604

metabolic engineering toolbox: more options to engineer cells. Trends Biotechnol. 25,

605

132–137.

606

(60) Alper, H., Moxley, J., Nevoigt, E., Fink, G. R. and Stephanopoulos, G. (2006)

607

Engineering yeast transcription machinery for improved ethanol tolerance and

608

production. Science 314, 1565–1568.

609

(61) Zhang, J., Kang, Z., Chen, J. and Du, G. (2015) Optimization of the heme

610

biosynthesis pathway for the production of 5-aminolevulinic acid in Escherichia coli.

611

Sci. Rep. 5, 1452–1456.

612 613

(62) Denard, C. A., Ren, H. and Zhao, H. (2015) Improving and repurposing biocatalysts via directed evolution. Curr. Opin. Chem. Biol. 25, 55–64.

614

(63) Crameri, A., Raillard, S. A., Bermudez, E. and Stemmer, W. P. (1998) DNA

615

shuffling of a family of genes from diverse species accelerates directed evolution.

616

Nature 391, 288–291.

617

(64) Lee, J. W., Na, D., Park, J. M., Lee, J., Choi, S. and Lee, S. Y. (2012) Systems

618

metabolic engineering of microorganisms for natural and non-natural chemicals. Nat.

619

Chem. Biol. 8, 536–546.

28

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

620 621

(65) Pfleger, B. F., Gossing, M. and Nielsen, J. (2015) Metabolic engineering strategies for microbial synthesis of oleochemicals. Metab. Eng. 29, 1–11.

622

(66) Brockman, I. M. and Prather, K. L. J. (2015) Dynamic metabolic engineering:

623

New strategies for developing responsive cell factories. Biotechnol. J., DOI:

624

10.1002/biot.201400422.

625 626 627 628

(67) Orelle, C., Carlson, E. D., Szal, T., Florin, T., Jewett, M. C. and Mankin, A. S. (2015) Protein synthesis by ribosomes with tethered subunits. Nature 524, 119–124. (68) Soye, B. J. D., Patel, J. R., Isaacs, F. J. and Jewett, M. C. (2015) Repurposing the translation apparatus for synthetic biology. Curr. Opin. Chem. Biol. 28, 83–90.

629

(69) Jacobs, T. M., Yumerefendi, H., Kuhlman, B. and Leaver-Fay, A. (2015)

630

SwiftLib: rapid degenerate-codon-library optimization through dynamic programming.

631

Nucleic Acids Res. 43, e34.

632 633

(70) Merryman, C. and Gibson, D. G. (2012) Methods and applications for assembling large DNA constructs. Metab. Eng. 14, 196–204.

634

(71) Gibson, D. G., Young, L., Chuang, R. Y., Venter, J. C., Hutchison, C. A. and

635

Smith, H. O. (2009) Enzymatic assembly of DNA molecules up to several hundred

636

kilobases. Nat. Methods 6, 343–345.

637 638

(72) Shao, Z., Zhao, H. and Zhao, H. (2009) DNA assembler, an in vivo genetic method for rapid construction of biochemical pathways. Nucleic Acids Res. 37, e16.

639

(73) Kang, Z., Wang, Y., Gu, P., Wang, Q. and Qi, Q. (2011) Engineering

640

Escherichia coli for efficient production of 5-aminolevulinic acid from glucose.

641

Metab. Eng. 13, 492–498.

642 643

(74) Burnham, B. F. (1970) δ-Aminolevulinicacidsynthase (Rhodopseudomonas sphaeroides). Methods Enzymol. 17, 195–204.

644 645 646

Figures legends

29

ACS Paragon Plus Environment

Page 34 of 37

Page 35 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

647

Figure 1. Schematic diagram of the RECODE method with two-step (a) or one-step

648

(b) PCR. Phosphorylated mutagenic oligonucleotides, UAP and DAP were annealed

649

to template DNA after denaturation. All gaps between the primers were filled and

650

ligated with thermostable DNA polymerase and ligation. For two-step PCR,

651

single-stranded mutant products of the 1st step PCR were purified, and then used as

652

the template to synthesize double-stranded variants in the 2nd step PCR reaction with

653

AP primer which specifically matches DAP. For one-step PCR, the AP primer was

654

added in the reaction to selectively synthesize the complementary chains of the

655

mutant strand. The concentration of AP was set with 0.2 µM which was 2 times of

656

that of the DAP primer.

657

Figure 2. Optimization and application of RECODE system. (a) Termination codons

658

were introduced into galactosidase (lacZ) and esterase (estC23) with substitution or

659

deletion of the native codons by RECODE. (b) The phenotype of the recombinants

660

wild-type galactosidase (blue colonies) and esterase (clear halos). LB agars containing

661

tributyrin (esterase substrate) and x-gal (galactosidase substrate) were used. (c) and (d)

662

showed the phenotypes of variants by RECODE with two-step and one-step PCR

663

systems, respectively. (e) Combinatorial engineering of SIR fragment of the rpoS

664

gene. Four mutant spacers between -35 and -10 boxes of SIR segments were inserted

665

upstream of the gfp gene to control the expression of GFP. (f) The transcriptional

666

strength of the SIR variants with one-round evolution by RECODE (with GFP protein

667

as a reporter). (g) Alignment of the nucleic acid sequences of the SIR variants. 30

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

668

Figure 3. Combinatorial evolution of the leech hyaluronidase by RECODE. (a)

669

Illustration of the evolution and screening process. The activities of the wild-type

670

LHAase and its variants towards heparin (top) and hyaluronan (bottom) were

671

determined in 96-well microtiter plates. (b) and (c) are the enzyme activities of the

672

variants (culture supernatants) towards hyaluronan and the non-natural substrate

673

heparin, respectively. (d) Diverse distribution of the mutant amino acids (red) in the

674

constructed variants. All error bars indicate ± SD, n = 3.

675

Figure 4. Combinatorial engineering the regulatory elements and pathway enzymes to

676

optimize synthetic pathway towards ALA. (a) The heme biosynthesis pathway via

677

ALA in E. coli. The arrows in red represent a positive relationship between ALA

678

accumulation and overexpression of the enzymes HemA, HemL and HemF. (b)

679

Illustration of combinatorial engineering of the promoters, ribosome binding sites and

680

pathway genes. (c) Evaluation of the variants in 96-well microtiter plates with the

681

parent E. coli LAF strain as control (red arrow). (d) ALA accumulation of the variants

682

in shake flasks. All error bars indicate ± SD, n = 3.

683

Figure 5. RECODE generalized workflow for rapid construction of novel regulatory

684

elements and enzymes, and engineering of optimized synthetic pathways to facilitate

685

the development of in vivo and in vitro synthetic biology.

686

Supplementary data

687

Supplementary data associated with this article can be found in the online version. 31

ACS Paragon Plus Environment

Page 36 of 37

Page 37 of 37

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

688

32

ACS Paragon Plus Environment