and Ligation-Independent Cloning (SLIC) - ACS Publications

The E. coli BAP1 strain is a genetically engineered derivative of E. coli BL21 ... Overall, the PtetO promoter and E. coli BAP1 expression system is p...
0 downloads 0 Views 942KB Size
Subscriber access provided by UNIV OF DURHAM

Letter

Direct Pathway Cloning (DiPaC) combined with Sequenceand Ligation-Independent Cloning (SLIC) for fast Biosynthetic Gene Cluster Refactoring and Heterologous Expression Paul D'Agostino, and Tobias A. M. Gulder ACS Synth. Biol., Just Accepted Manuscript • DOI: 10.1021/acssynbio.8b00151 • Publication Date (Web): 25 Jun 2018 Downloaded from http://pubs.acs.org on June 26, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 21

ACS Synthetic Biology

1

Direct Pathway Cloning (DiPaC) combined with Sequence- and

2

Ligation-Independent Cloning (SLIC) for fast Biosynthetic Gene

3

Cluster Refactoring and Heterologous Expression

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

4 5

Paul M. D’Agostino and Tobias A. M. Gulder*

6

Biosystems Chemistry, Department of Chemistry and Center for Integrated Protein Science Munich

7

(CIPSM), Technical University of Munich, Lichtenbergstraße 4, 85748 Garching bei München,

8

Germany.

9

Keywords: Synthetic biology, Direct Pathway Cloning (DiPaC), Sequence- and ligation-independent

10

cloning (SLIC), Hapalosin, Heterologous Expression

11

ACS Paragon Plus Environment

1

ACS Synthetic Biology 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 2 of 21

12

Abstract

13

The need of new pharmacological lead structures, especially against drug-resistances, has led to a surge in natural

14

product research and discovery. New biosynthetic gene cluster capturing methods to efficiently clone and

15

heterologously express natural products have thus been developed. Direct Pathway Cloning (DiPaC) is an

16

emerging synthetic biology strategy that utilises long-amplification PCR and HiFi DNA assembly for the capture

17

and expression of natural product biosynthetic gene clusters. Here, we have further streamlined DiPaC by

18

reducing cloning time and reagent costs by utilising T4 DNA polymerase (sequence- and ligation-independent

19

cloning) for gene cluster capture. As a proof of principle, the majority of the cyanobacterial hapalosin gene

20

cluster was cloned as a single piece (23 kb PCR product) using this approach, and predicted transcriptional

21

terminators were removed by simultaneous pathway refactoring, leading to successful heterologous expression.

22

The complementation of DiPaC with SLIC depicts a time and cost-efficient method for simple capture and

23

expression of new natural product pathways.

ACS Paragon Plus Environment

2

Page 3 of 21 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

24

Microbial natural products are renowned for their bioactivity and structural complexity, making their discovery

25

highly significant. For example, approximately 49% of anti-infective compounds and 61% of anti-cancer

26

pharmaceutical agents currently in clinical use are natural products or their derivatives.1 The imminent rise of

27

antibiotic resistance has further spurred the scientific community to identify more natural products and their

28

corresponding biosynthetic gene clusters (BGCs). Aided by the rise of whole-genome sequencing,2 this revealed

29

that many organisms encode more BGCs then there are yet discovered natural products.3 A range of bioinformatic

30

tools, such as AntiSMASH4 and PRISM,5 have been developed for the in silico identification of BGCs within

31

whole-genome sequences.6 Bioinformatic investigations of hundreds of (meta-)genomes across many genera have

32

provided thousands of BGCs without a known corresponding natural product.7, 8

33

In an attempt to activate BGCs, two broad methodologies which involve either untargeted altering of the physical

34

bacterial culture environment (e.g. different medium9) or synthetic biology can be employed.10 Synthetic biology

35

uses molecular engineering techniques to manipulate natural product pathways and can subsequently be coupled

36

with heterologous expression.11 To experimentally validate the ever-increasing database of BGCs, a range of in

37

vitro and in vivo methods for cloning and BGC capturing have been developed. Traditionally, the most common

38

method has been the capture of large-insert genomic libraries via the packaging of partially digested genomic

39

DNA into cosmid-, fosmid- or BAC-based (bacterial artificial chromosomes) libraries, which still has the

40

advantage of not requiring genome sequence data. Additional in vivo methods require known sequence data and

41

are based on Rec/ET recombineering,12 such as linear-linear-(LLHR)13 and linear-circular-homologous

42

recombination (LCHR),14 exonuclease combined with RecET recombination (ExoCET),15 and Cas9-assisted

43

targeting of chromosome segments (CATCH).16 The natural recombination capability of the yeast Saccharomyces

44

cerevisiae has also been utilized for the capture of BGCs via transformation-associated recombination (TAR)

45

cloning.17-19

46

conditions and include circular polymerase extension cloning (CPEC),20 assembly of fragment ends after PCR

47

(AFEAP),21 and sequence- and ligation-independent cloning (SLIC),22 amongst others.23 Whilst many of these

48

methods have revolutionised natural products research, cloning and expression can still take an extended length

49

of time due to the high amounts of genomic DNA and/or multiple steps required for successful in vivo vector

50

construction and activation. Direct Pathway Cloning (DiPaC),23 used for cloning small- to mid-size BGCs, helps

51

overcome many of these issues via the utilisation of long-amplicon PCR. This considerably decreases the time

Alternatively, in vitro methods commonly utilize specialized primer design and PCR cycling

ACS Paragon Plus Environment

3

ACS Synthetic Biology 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 4 of 21

52

required for vector and insert construction whilst simultaneously allowing for cluster refactoring and free vector

53

backbone choice during both the cloning and expression stage.

54

Cyanobacteria are renowned for their ability to be impervious to genetic manipulation, yet they are prolific

55

producers of bioactive natural products. One such natural product, hapalosin, is a small depsipeptide encoded by a

56

hybrid non-ribosomal peptide synthetase (NRPS)/polyketide synthase (PKS) within the cyanobacteria

57

Fischerella, Hapalosiphon and Westiella intricata.24, 25 Interest in hapalosin is due to its ability to reverse and

58

overcome P-glycoprotein mediated multiple drug resistance making it an effective agent to improve the success

59

of chemotherapy,24 thus making it an attractive target for total synthesis and activity based assays.26,

60

hapalosin biosynthetic gene cluster (hap) spans almost 25.7kb in length and is encoded by the five genes hapA-

61

hapE.25 However, there are inconsistencies regarding the true ORF of hapD, with two different translation start

62

sites annotated amongst various hapalosin producing cyanobacteria. Biosynthetically, hapalosin utilises a rare

63

adenylation-ketoreductase (A-KR) didomain responsible for incorporation of the non-amino acid 2-oxoisovaleric

64

acid (Figure S1). Issues with genetic manipulation of cyanobacteria, slow growth times and inconsistent ORF

65

annotation of hapD made the hap cluster a prime candidate for DiPaC and pathway refactoring. Through this

66

study, we aimed to utilise the hap gene cluster as a proof of principle to further streamline DiPaC by replacing the

67

HiFi DNA assembly with SLIC while still retaining high cloning efficiency. Further, we planned to show the

68

applicability of DiPaC to refactor pathways to shed light on the true ORF of hapD.

27

The

69

ACS Paragon Plus Environment

4

Page 5 of 21 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

70

Results and Discussion

71

Cloning strategy and DiPaC of the hapalosin biosynthetic gene cluster

72

The hap BGC was identified in the sequenced genome of Fischerella sp. PCC 9431 (NCBI accession:

73

NZ_ALVX00000000)28 by utilising H. welwitschii HapA (JGI accession: 2529292566)25 as a query sequence.

74

Both BGCs retained identical gene synteny and 100% nucleotide sequence similarity. The first step in the cloning

75

strategy was to identify putative terminators within the hap gene cluster utilising the ARNold tool.29 A putative

76

rho-independent transcriptional terminator was identified within the intergenic sequence directly downstream of

77

hapA (Figure 1). Therefore, the cloning strategy included the removal of the 470bp intergenic region between

78

hapA and hapB for the excision of the putative terminator. This involved cloning hapA as a single gene

79

downstream of the PtetO promoter and within range of the promoter shine-dalgarno sequence, to generate the

80

intermediate plasmid pET28b-ptetO::hapA. To further streamline time and cost efficiency of DiPaC, we

81

envisioned to replace HiFi DNA assembly by SLIC for integration of the entire hapBCDE DNA fragment into the

82

pET28b-ptet::hapA intermediate vector. SLIC has commonly been utilised for the ligation-independent cloning of

83

single genes in enzyme overexpression experiments. To obtain homologous overhang sequences available for

84

annealing of the insert of choice, SLIC utilises the 3ʹ→ 5ʹ exonuclease activity of T4 polymerase.22 By

85

incorporating terminal homologous sequence via primers, the DNA anneals and stitching of the plasmid occurs in

86

vivo by E. coli post transformation.22 Importantly, positioning of the primer target sequence provides flexibility in

87

how the expression vector is generated. In general, we found amplification was more pronounced if homology

88

sequences were placed on primers amplifying the smaller linear fragment. Accordingly, the entire native 23kb

89

NRPS/PKS consisting of hapBCDE was successfully amplified by PCR using insert-specific primers as a clean

90

and single product (Figure S2). After purification and concentration, the hapBCDE PCR product was integrated

91

into the PCR amplified pET28b-ptetO::hapA vector backbone – equipped with 26-30 bp homology sequences

92

introduced during PCR amplification – via SLIC to generate the expression vector pET28b-ptetO::hap.

93

Preliminary clones were initially screened by colony PCR, where all 11 selected clones appeared to harbour the

94

correct insert (Figure S3A). Successful cloning of the pET28b-ptetO::hap expression vector was then confirmed

95

by restriction digest (Figure S4) and terminal insert sequencing. SLIC mediated DiPaC was therefore found to be

96

extremely efficient at cloning DNA fragments up to 23kb, requiring only a 2.5 min incubation time compared to ACS Paragon Plus Environment

5

ACS Synthetic Biology

97

1 2 98 3 4 5 99 6 7 100 8 9 101 10 11 12 102 13 14 103 15 16 104 17 18 105 19 20 106 21 22 107 23 24 25 108 26 27 109 28 29 110 30 31 111 32 33 112 34 35 36 113 37 38 114 39 40 115 41 42 116 43 44 117 45 46 118 47 48 49 119 50 51 52 120 53 54 55 121 56 57 58 122 59 60

Page 6 of 21

the 60 min incubation time often used for HiFi DNA assembly. Consequently, SLIC is at least as efficient as HiFi DNA assembly while vastly quicker and more economical. To investigate the true ORF of hapD, a second expression vector was then generated, where the hap pathway was further refactored to remove a 316 bp intergenic region between hapC and hapD (Figure S5). Primers amplifying the pET28b-ptetO::hapA backbone were designed to include 25-30 bp homology sequences for the successful capture of hapB-hapC via SLIC. The pET28b-ptetO::refhapAC vector was then linearized and hapD-hapE inserted via SLIC to create pET28b-ptetO::refhapAC-hapDE. Thus, the expression plasmids pET28b-ptetO::hap and pET28b-ptetO::refhapAC-hapDE are identical with the exception of a 316 bp intergenic region between hapC and hapD. Colony PCR confirmed the removal of the 316 bp hapC-hapD intergenic region with a total of eight out of ten positive clones (Figure S3B). The final pET28b-ptetO::refhapAC-hapDE was confirmed by restriction digest and sequencing of the hapC-hapD insertion site. The DiPaC strategy utilizes long-amplicon PCR products and in vitro DNA hybridization. Here, we successfully ligated a 23kb PCR product within an 8kb vector, the largest cloned linear DNA fragments via SLIC to date. We found several factors vital for successful amplification and DiPaC of the hap gene cluster. Pure high molecular weight and stable gDNA was vital for successful amplification, with the observable loss of amplification after 2 months of gDNA storage at -20°C. Further, the successful cloning was dependent on the total absence of UV light exposure to the large DNA fragments, where UV exposure resulted in a complete loss of cloning efficiency. By basing our homology arm primer design as previously described by Greunke et al.,23 we found these resulted in compatible homology arms that could be used for both SLIC and HiFi DNA assembly. For future DiPaC cloning, we recommend utilizing SLIC cloning for the first attempt followed by HiFi DNA assembly if SLIC is unsuccessful, as the most economical DiPaC strategy. Overall, the factors described above were essential for the efficient cloning of the hap cluster.

ACS Paragon Plus Environment

6

Page 7 of 21

123

1 2 3 124 4 5 125 6 7 126 8 9 127 10 11 12 128 13 14 129 15 16 130 17 18 131 19 20 132 21 22 133 23 24 134 25 26 135 27 28 136 29 30 31 137 32 33 138 34 35 139 36 37 38 140 39 40 141 41 42 142 43 44 143 45 46 144 47 48 145 49 50 146 51 52 147 53 54 55 148 56 57 149 58 59 60

ACS Synthetic Biology

Heterologous expression of the hapalosin gene cluster The fully constructed pET28b-ptetO::hap and pET28b-ptetO::refhapAC-hapDE vectors were transformed into E. coli BAP1. The E. coli BAP1 strain is a genetically engineered derivative of E. coli BL21 (DE3) with the integration of the phosphopantetheinyl transferase sfp into the expression host genome,30 thereby facilitating the activation of NRPS/PKS pathways for heterologous expression.23, 31 Transcription of the expression plasmid was placed under the control of the PtetO promoter, which has been successful in activating a range of cyanobacterial natural product pathways.23,

32, 33

Hapalosin was identified in expression cultures harbouring the pET28b-

ptetO::hap expression vector but could not be identified within pET28b-ptetO::refhapAC-hapDE extracts (Figure 2). LCMS and HR-LCMS identified hapalosin at a RT of 7.9 min and m/z 490.3160 [M+H]+ in both pellet and supernatant fractions which was consistent with extracts of Fischerella sp. 9431 as a positive control (Figure S6 and Figure S7). These results confirm that the putative hap cluster is truly responsible for hapalosin biosynthesis, as predicted by Micallef et al.25 Comparison of supernatant samples indicate E. coli BAP1 heterologous expression cultures produce approximately 45% of hapalosin compared to the Fischerella sp. PCC 9431 native producer (Figure S8). Considering the very long growth times of Fischerella sp. PCC 9431 (30-40 days) compared to E. coli (5 day expression), this is a significant improvement towards production of hapalosin. Overall, the PtetO promoter and E. coli BAP1 expression system is proving to be efficient for the investigation of small to mid-sized cyanobacterial natural product pathways. The ORF of hapD within the genomes of Fischerella sp. PCC 9431, Hapalosiphon sp. MRB220, Hapalosiphon welwitschii IC-52-3 and Westiella intricata HT-29-1 begins with a TTG start codon and encodes a 1526aa protein. Alternatively, the ORF of hapD within H. welwitschii UTEX B 1830 is annotated to begin 181 bp downstream using the start codon ATG, thus encoding a 1466aa protein (Figure S5). Importantly, the Fischerella sp. PCC 9431 and H. welwitschii UTEX B 1830 hap gene clusters have a 100% nucleotide sequence similarity. The ability of DiPaC to efficiently refactor biosynthetic gene clusters via the selected integration of genetic blocks allowed for a simple platform to investigate the true ORF of hapD. We performed heterologous expression experiments using both full length (TTG start codon) and reduced length hapD (ATG start codon). From these expression cultures, hapalosin was only identified within full length hapD constructs (Figure 2). These results indicate the 60aa N-terminal region of HapD is essential for biosynthesis and therefore, TTG is the true start ACS Paragon Plus Environment

7

ACS Synthetic Biology

150

1 2 151 3 4 152 5 6 7 153 8 9 10 154 11 12 13 155 14 15 156 16 17 157 18 19 158 20 21 159 22 23 160 24 25 26 161 27 28 162 29 30 163 31 32 164 33 34 35 165 36 37 38 39 40 166 41 42 43 44 167 45 46 47 48 168 49 50 169 51 52 170 53 54 55 171 56 57 172 58 59 60

Page 8 of 21

codon of hapD. Cyanobacterial ORFs are predicted using bioinformatics, thus, they are prone to false positive identifications. Improvements to annotation software are decreasing the rate of false positive identification, but bioinformatics need to be further supported by experimentally validated data, particularly with the underrepresented bias of TTG start codons.34, 35

Conclusion DiPaC is proven to be an efficient method for the fast cloning of small- to medium-sized biosynthetic gene clusters.22 Here, we have utilised SLIC in combination with long-amplicon PCR to further streamline DiPaC by improvement of cloning time and reducing cost whilst retaining a high level of cloning efficiency. We expect these improvements to make the capturing of silent/orphan gene clusters more accessible to the research community. The ability to refactor the hap gene cluster during the cloning step was vital for the speed of cloning and proved a successful platform to investigate the true ORF of hapD. By providing a basis for the successful cloning of ever increasing DNA sizes and reducing cloning time and cost, the efficiency of DiPaC has been vastly improved. Cyanobacterial natural product pathways are prime candidates for DiPaC and refactoring due to the challenges towards the genetic manipulation of native producers and the difficulty in activating cyanobacterial native promoters in E. coli expression hosts.32, 33

Materials and Methods Bacterial strains, plasmids and genomic DNA extraction. Bacterial strains and plasmids generated in this study are listed in Table 1. Fischerella sp. PCC 9431 was obtained from the Collections des Cyanobactéries, Institut Pasteur, Paris, France. Fischerella sp. PCC 9431 was cultivated in BG-11 medium ([pH 8], Sigma-Aldrich, Germany) at room temperature under 24 hr light without shaking. E. coli strains were grown in LB medium supplemented with 50 µg/mL kanamycin. ACS Paragon Plus Environment

8

Page 9 of 21

173

1 2 174 3 4 175 5 6 7 176 8 9 177 10 11 178 12 13 179 14 15 16 180 17 18 19 181 20 21 182 22 23 183 24 25 184 26 27 185 28 29 186 30 31 32 187 33 34 188 35 36 189 37 38 190 39 40 191 41 42 43 192 44 45 193 46 47 194 48 49 195 50 51 196 52 53 197 54 55 198 56 57 199 58 59 60

ACS Synthetic Biology

The extraction of high molecular weight genomic DNA free of impurities was vital for long-amplicon PCR utilized in this study. Genomic DNA was extracted from cyanobacterial cultures using an optimized method adapted from D’Agostino et al.36 and Greunke et al.23 Briefly, extraction was performed as follows: fresh or frozen cell pellets were washed with 0.9% NaCl once and resuspended in 5 mL lysis buffer (25 mM EDTA, 0.3 M sucrose, 25 mM Tris-HCl [pH 7.5]). Extracted gDNA was stored in 0.1x TE buffer (1 mM Tris-HCl, 0.1 mM EDTA [pH8.0]) and stored in aliquots at -20°C until use.

Bioinformatic analysis and PCR For the development of cloning strategies and primer design, the Geneious37 software package (Version 8.1.9) and the NEBuilder assembly web tool (New England Biolabs; http://nebuilder.neb.com) were used. Maps of plasmids were constructed using the SnapGene software. The sequenced genomes of Fischerella sp. PCC 943128 was downloaded from the NCBI database using the accession number ALVX01000000. The putative hap gene cluster was analysed bioinformatically for the presence of transcriptional terminators using ARNold (http://rna.igmors.upsud.fr/toolbox/arnold).29 PCR used to generate linear fragments for cloning were performed in 20 µL reaction batch and long-amplicon cycling reactions consisted of: 1x Q5 reaction buffer, 200 µM deoxynucleotide triphosphates, 500 nM of forward and reverse primer, DNA template and 0.02 U/µL Q5 High-Fidelity DNA polymerase (NEB). Template DNA amounts were 15 ng for Fischerella sp. PCC9431 gDNA or 10 ng for plasmid DNA. Thermal cycling was performed in a Bio-Rad T100 Thermal Cycler and began with an initial denaturation cycle of 98°C for 2 min, followed by 30 cycles of DNA denaturation at 98°C for 20 s, primer annealing for 15 s, DNA amplification at 72°C for 30 s/kb amplified, and a final extension at 72°C for 5 min. The annealing temperature for specific primers was calculated using the NEB Tm Calculator tool (http://tmcalculator.neb.com) while the optimum annealing temperature for primers harbouring homologous sequences were experimentally determined using gradient PCR with annealing temperature 50-70°C. Screening of positive transformants was performed by colony PCR using Taq DNA polymerase (NEB). Reaction mixtures contained 1x Taq buffer (10 mM Tris-HCl, 1.5 mM MgCl2, 50 mM KCl, pH 8.3 at 25 °C), 4% DMSO, 100 µM deoxynucleotide triphosphates, 500 nM of forward and reverse primer, DNA template (bacterial suspension in water) and Taq DNA polymerase (0.025 U/µL, NEB). ACS Paragon Plus Environment

9

ACS Synthetic Biology

200

1 2 201 3 4 5 202 6 7 8 203 9 10 204 11 12 205 13 14 206 15 16 17 207 18 19 208 20 21 22 209 23 24 210 25 26 27 211 28 29 212 30 31 213 32 33 214 34 35 215 36 37 216 38 39 217 40 41 218 42 43 219 44 45 46 220 47 48 49 221 50 51 222 52 53 223 54 55 56 224 57 58 225 59 60

Page 10 of 21

Cycling conditions were performed as previously described with the exception of a 5 min 98°C initial denaturation step. DiPaC strategy and construction of expression vector Linear fragments for all DiPaC steps were generated using PCR with all vectors listed in Table 1 and primers listed in Table S1. PCR products of any vector backbone was firstly treated by DpnI (NEB) to remove template plasmid that would act as transformation background. All linear DNA fragments used for cloning were further purified prior to cloning using the NEB Monarch Gel Purification Kit (NEB). An essential factor for the successful cloning of long DNA fragments was to ensure that agarose gel extracted DNA was not exposed to any UV light. Further, gel purified DNA was eluted in 10 µl to maximise possible concentration.

Firstly, primers were used to generate linear DNA fragments of the pET28b-ptetO vector and hapA insert (Table S1). The pET28b-ptetO vector is a derivative of pET28b where the T7 promoter has been exchanged for the tetracycline-inducible PtetO promoter. The purified linear fragments were used to generate pET28b-ptetO::hapA using HiFi DNA assembly as described by Greunke et al.23 Next, pET28b-ptetO::hapA was utilised as the vector backbone to create two expression vectors harbouring the hap gene cluster. The pET28b-ptetO::hapA backbone for cloning was generated by PCR. The first construct was generated with the PCR product spanning hapBCDE (23kb) to generate pET28b-ptetO::hap. The second construct utilised two rounds of cloning by firstly incorporating hapB-hapC (13kb) to generate pET28b-ptetO::refhapAC. This intermediate vector The pET28bptetO::hapAC (21kb) was amplified and utilised for the incorporation of hapD-hapE (10kb) to generate the expression vector pET28b-ptetO::refhapAC-hapDE. The pET28b-ptetO::hap and pET28b-ptetO:: refhapAChapDE vectors are identical with the exception that the 316 bp intergenic region between hapC and hapD has been deleted in the latter construct.

For the construction of the pET28b-ptetO::hap, pET28b-ptetO::hapAC and pET28b-ptetO::refhapAC-hapDE, the SLIC method was utilized as a cheaper and quicker alternative to HiFi DNA assembly. Concentrations of linear DNA fragments for assembly were calculated as described by Greunke et al.23 SLIC was performed by utilising 1x buffer 2.1 (NEB) and 0.5µl of T4 polymerase (NEB) in a 10 µL total reaction volume. The SLIC reaction mixture was incubated for 2.5 min at room temperature, followed by a 10 min incubation on ice. A total of 5 µl of ACS Paragon Plus Environment

10

Page 11 of 21

226

1 2 227 3 4 228 5 6 7 229 8 9 10 230 11 12 231 13 14 232 15 16 233 17 18 234 19 20 235 21 22 236 23 24 25 237 26 27 238 28 29 239 30 31 240 32 33 34 241 35 36 242 37 38 243 39 40 244 41 42 43 245 44 45 246 46 47 247 48 49 248 50 51 52 249 53 54 250 55 56 251 57 58 252 59 60

ACS Synthetic Biology

reaction mixture was transformed by heat shock into chemically competent E. coli DH5α. Positive clones for all constructs were initially screened by colony PCR followed by restriction digest analysis and terminal-end sanger sequencing. Heterologous expression of the hapalosin gene cluster Heterologous expression conditions were based on previously described experiments for the pET28b-ptetO plasmid with minor changes.23, 33 Briefly, prior to each heterologous expression experiment, E. coli BAP1 was chemically transformed with pET28b-ptetO (empty), pET28b-ptetO::hap or pET28b-ptetO::refhapAC-hapDE. A single colony was used to generate a 10 mL pre-expression culture grown in LB medium supplemented with 50 µg/mL kanamycin and incubated overnight at 37°C with shaking at 200 rpm. Pre-expression cultures were used to inoculate expression cultures (1% v/v) in 200 mL of LB or TB medium supplemented with 50 µg/mL kanamycin. Expression cultures were incubated at 30°C with shaking (200 rpm) until an OD600 of 0.4 (LB) or 0.8 (TB) was reached. Cultures were then cooled on ice for 30 min, induced with 0.5 µg/mL tetracycline and incubated for 5 days at 20 °C with shaking at 200 rpm. Flasks were covered in foil to reduce light-induced decomposition of tetracycline. Extraction of expression cultures and LCMS analysis Cyanobacterial or E. coli biomass was separated from growth medium by centrifugation at 10,000 g for 10 min. The supernatant was extracted in 1 vol of ethyl acetate and repeated three times followed by desiccation in vacuo at 40°C using a rotary evaporator. To extract cell biomass, dichloromethane was added to cell pellets and incubated in a sonicator bath for 30 mins. Extracted cell debris was removed via centrifugation (10,000 g for 10 min) and the solvent desiccation in vacuo at 40°C using a rotary evaporator. Desiccated extracts were dissolved in HPLC-grade methanol and filtered through a Millex-GP, syringe driven 0.22 µm PES membrane filter (Millipore, USA) prior to injection into LCMS systems. LCMS experiments were conducted on an UltiMate 3000 LC System coupled to a LCQ Fleet Ion Trap Mass Spectrometer (Thermo Scientific). The chromatographic HPLC separation was carried out on a Hypersil Gold aQ C18 column (150 × 2.1 mm, 3 µm particle size). Buffers consisted of water (A) and acetonitrile (B) as the eluents, both supplemented with 0.1% formic acid. Chromatographic separation was performed at 0.7 mL/min using a gradient as follows: 5% B at 0 min to 95% B by 8 mins followed by washing the column at 100% B for 2 minutes ACS Paragon Plus Environment

11

ACS Synthetic Biology

253

1 2 254 3 4 255 5 6 7 256 8 9 257 10 11 258 12 13 14 259 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 12 of 21

and re-equilibration of the column at 5% B for 2 minutes prior to the next injection. HR-ESI-MS spectra were recorded with a Thermo LTQ-FT Ultra coupled with a Dionex UltiMate 3000 HPLC system. Separation was achieved using a C18 column with a 10 min gradient from 10% to 98% solvent B (solvent A = water + 0.1% FA, solvent B = 90% ACN, 10 % water, 0.1% FA). The mass spectrometer was operated in a positive mode, collecting full scans from m/z = 100 to m/z = 2000. Interpretation of all recorded MS data was performed using the Thermo Xcalibur Qual Browser 2.2 SP1.48 software.

ACS Paragon Plus Environment

12

Page 13 of 21 1 260 2 3 4 261 5 6 262 7 8 263 9 10 264 11 12 265 13 14 266 15 16 267 17 18 19 268 20 21 22 269 23 24 270 25 26 271 27 28 272 29 30 31 273 32 33 274 34 35 275 36 37 276 38 39 277 40 41 278 42 43 279 44 45 46 280 47 48 281 49 50 282 51 52 283 53 54 284 55 56 57 285 58 59 60

ACS Synthetic Biology

Supporting Information: The Supporting Information is available free of charge on the ACS Publications website at DOI: List of nucleotides used for cloning and sequencing (Table S1), proposed hapalosin biosynthetic pathway (Figure S1), Amplification of hapBCDE from Fischerella sp. PCC 9431 gDNA (Figure S2), Colony PCR of pET28ptetO::hap and pET28-ptetO::refhapAC-hapDE E. coli DH5α transformants (Figure S3), Restriction digest confirmation of pET28-ptetO::hap expression vector (Figure S4), Organisation of the Fischerella sp. 9431 hap gene cluster (Figure S5), LCMS of expression extracts (Figure S6), High-resolution LCMS of hapalosin (Figure S7). Relative production of hapalosin in E. coli and Fischerella sp. PCC 9431 (Figure S8).

Author Information. Corresponding Author *Tobias A. M. Gulder E-mail: [email protected] Telephone number: +49-(0)89-289-13833

ORCID P. M. D’Agostino: 0000-0002-8323-5416 T. A. M. Gulder: 0000-0001-6013-3161

Postal Address Technische Universität München Department of Chemistry and Center for Integrated Protein Science Munich (CIPSM) Biosystems Chemistry Lichtenbergstraße 4 85748 Garching

Author Contributions P. M. D. performed all experiments. P. M. D. and T. A. M. G. designed the research project and wrote the manuscript.

ACS Paragon Plus Environment

13

ACS Synthetic Biology 1 286 2 3 4 287 5 288 6 7 289 8 290 9 10 291 11 12 292 13 293 14 15 16 294 17 295 18 296 19 20 297 21 22 298 23 24 299 25 26 300 27 301 28 29 302 30 31 303 32 304 33 34 305 35 36 306 37 38 307 39 308 40 41 309 42 43 310 44 45 311 46 312 47 48 313 49 50 314 51 315 52 53 316 54 55 317 56 57 58 59 60

Page 14 of 21

Acknowledgments We thank Catharina Seel (Biomimetic Catalysis, Prof. Dr. Tanja Gulder, , TUM) and Barbara Hofbauer (Chair of Organic Chemistry II, Prof. Dr. Stephan A. Sieber, TUM) for HR-LCMS analysis of hapalosin. We would like to thank Prof. Bang-Guo Wei and Dr. Chang-Mei Si (Fudan University, China) for providing a synthetic hapalosin standard. We also thank Anna Glöckle for reading and editing the manuscript. Funding: P.M.D thanks the TUM Foundation Fellowship and the Marie Skłodowska-Curie Actions Individual Fellowship(Project ID: 745435) for funding. We thank the DFG for generous financial support of the work in our laboratory (Emmy Noether program and Center for Integrated Protein Science Munich CIPSM).

References 1.

Luo, Y.; Cobb, R. E.; Zhao, H., Recent advances in natural product discovery. Curr. Opin.

Biotechnol. 2014, 30, 230-237. 2.

Harvey, A. L.; Edrada-Ebel, R.; Quinn, R. J., The re-emergence of natural products for drug

discovery in the genomics era. Nat. Rev. Drug Discovery 2015, 14, 111. 3.

Bentley, S. D.; Chater, K. F.; Cerdeno-Tarraga, A. M.; Challis, G. L.; Thomson, N. R.;

James, K. D.; Harris, D. E.; Quail, M. A.; Kieser, H.; Harper, D.; Bateman, A.; Brown, S.; Chandra, G.; Chen, C. W.; Collins, M.; Cronin, A.; Fraser, A.; Goble, A.; Hidalgo, J.; Hornsby, T.; Howarth, S.; Huang, C. H.; Kieser, T.; Larke, L.; Murphy, L.; Oliver, K.; O'Neil, S.; Rabbinowitsch, E.; Rajandream, M. A.; Rutherford, K.; Rutter, S.; Seeger, K.; Saunders, D.; Sharp, S.; Squares, R.; Squares, S.; Taylor, K.; Warren, T.; Wietzorrek, A.; Woodward, J.; Barrell, B. G.; Parkhill, J.; Hopwood, D. A., Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature 2002, 417 (6885), 141-147. 4.

Blin, K.; Wolf, T.; Chevrette, M. G.; Lu, X.; Schwalen, C. J.; Kautsar, S. A.; Suarez Duran,

H. G.; de los Santos, Emmanuel L. C.; Kim, H. U.; Nave, M.; Dickschat, J. S.; Mitchell, D. A.; Shelest, E.; Breitling, R.; Takano, E.; Lee, S. Y.; Weber, T.; Medema, M. H., antiSMASH 4.0— improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res. 2017, 45 (W1), W36-W41. 5.

Skinnider, M. A.; Dejong, C. A.; Rees, P. N.; Johnston, C. W.; Li, H.; Webster, Andrew L.

H.; Wyatt, M. A.; Magarvey, N. A., Genomes to natural products PRediction Informatics for Secondary Metabolomes (PRISM). Nucleic Acids Res. 2015, 43 (20), 9645-9662. 6.

Medema, M. H.; Fischbach, M. A., Computational approaches to natural product discovery. Nat.

Chem. Biol. 2015, 11 (9), 639-648.

ACS Paragon Plus Environment

14

Page 15 of 21

318

1 2 319 3 4 320 5 321 6 7 322 8 9 323 10 11 324 12 325 13 14 326 15 16 327 17 18 328 19 329 20 21 330 22 23 331 24 332 25 26 333 27 28 334 29 30 335 31 336 32 33 337 34 35 338 36 339 37 38 340 39 40 341 41 42 342 43 343 44 45 344 46 47 345 48 49 346 50 347 51 52 348 53 54 349 55 350 56 57 58 59 60

7.

ACS Synthetic Biology

Doroghazi, J. R.; Albright, J. C.; Goering, A. W.; Ju, K.-S.; Haines, R. R.; Tchalukov, K. A.;

Labeda, D. P.; Kelleher, N. L.; Metcalf, W. W., A roadmap for natural product discovery based on large-scale genomics and metabolomics. Nat. Chem. Biol. 2014, 10, 963. 8.

Wang, H.; Fewer, D. P.; Holm, L.; Rouhiainen, L.; Sivonen, K., Atlas of nonribosomal peptide

and polyketide biosynthetic pathways reveals common occurrence of nonmodular enzymes. Proc. Natl. Acad. Sci. U. S. A. 2014, 111 (25), 9259-9264. 9.

Hemphill, C. F. P.; Sureechatchaiyan, P.; Kassack, M. U.; Orfali, R. S.; Lin, W.; Daletos, G.;

Proksch, P., OSMAC approach leads to new fusarielin metabolites from Fusarium tricinctum. J. Antibiot. 2017, 70, 726. 10.

Rutledge, P. J.; Challis, G. L., Discovery of microbial natural products by activation of silent

biosynthetic gene clusters. Nat. Rev. Microbiol. 2015, 13 (8), 509-523. 11.

Smanski, M. J.; Zhou, H.; Claesen, J.; Shen, B.; Fischbach, M. A.; Voigt, C. A., Synthetic

biology to access and expand nature's chemical diversity. Nat. Rev. Microbiol. 2016, 14 (3), 135-149. 12.

Wang, H.; Li, Z.; Jia, R.; Hou, Y.; Yin, J.; Bian, X.; Li, A.; Müller, R.; Stewart, A. F.; Fu,

J.; Zhang, Y., RecET direct cloning and Redαβ recombineering of biosynthetic gene clusters, large operons or single genes for heterologous expression. Nat. Protoc. 2016, 11, 1175. 13.

Fu, J.; Bian, X.; Hu, S.; Wang, H.; Huang, F.; Seibert, P. M.; Plaza, A.; Xia, L.; Muller, R.;

Stewart, A. F.; Zhang, Y., Full-length RecE enhances linear-linear homologous recombination and facilitates direct cloning for bioprospecting. Nat. Biotechnol. 2012, 30 (5), 440-446. 14.

Wang, H.; Bian, X.; Xia, L.; Ding, X.; Müller, R.; Zhang, Y.; Fu, J.; Stewart, A. F.,

Improved seamless mutagenesis by recombineering using ccdB for counterselection. Nucleic Acids Res. 2014, 42 (5), e37. 15.

Wang, H.; Li, Z.; Jia, R.; Yin, J.; Li, A.; Xia, L.; Yin, Y.; Müller, R.; Fu, J.; Stewart, A. F.;

Zhang, Y., ExoCET: Exonuclease in vitro assembly combined with RecET recombination for highly efficient direct DNA cloning from complex genomes. Nucleic Acids Res. 2017. 16.

Jiang, W.; Zhu, T. F., Targeted isolation and cloning of 100-kb microbial genomic sequences by

Cas9-assisted targeting of chromosome segments. Nat. Protoc. 2016, 11, 960. 17.

Kouprina, N.; Larionov, V., Selective isolation of genomic loci from complex genomes by

transformation-associated recombination cloning in the yeast Saccharomyces cerevisiae. Nat. Protoc. 2008, 3, 371. 18.

Kim, J. H.; Feng, Z.; Bauer, J. D.; Kallifidas, D.; Calle, P. Y.; Brady, S. F., Cloning large

natural product gene clusters from the environment: Piecing environmental DNA gene clusters back together with TAR. Biopolymers 2010, 93 (9), 833-844.

ACS Paragon Plus Environment

15

ACS Synthetic Biology

351

1 2 352 3 4 353 5 354 6 7 355 8 9 356 10 11 357 12 358 13 14 359 15 16 360 17 18 361 19 362 20 21 363 22 23 364 24 365 25 26 366 27 28 367 29 30 368 31 369 32 33 370 34 35 371 36 372 37 38 373 39 40 374 41 42 375 43 376 44 45 377 46 47 378 48 49 379 50 380 51 52 381 53 54 382 55 383 56 57 58 59 60

19.

Page 16 of 21

Yamanaka, K.; Reynolds, K. A.; Kersten, R. D.; Ryan, K. S.; Gonzalez, D. J.; Nizet, V.;

Dorrestein, P. C.; Moore, B. S., Direct cloning and refactoring of a silent lipopeptide biosynthetic gene cluster yields the antibiotic taromycin A. Proc. Natl. Acad. Sci. U. S. A. 2014, 111 (5), 1957-1962. 20.

Quan, J.; Tian, J., Circular polymerase extension cloning for high-throughput cloning of

complex and combinatorial DNA libraries. Nat. Protoc. 2011, 6, 242. 21.

Zeng, F.; Zang, J.; Zhang, S.; Hao, Z.; Dong, J.; Lin, Y., AFEAP cloning: A precise and

efficient method for large DNA sequence assembly. BMC Biotechnol. 2017, 17 (1), 81. 22.

Jeong, J.-Y.; Yim, H.-S.; Ryu, J.-Y.; Lee, H. S.; Lee, J.-H.; Seen, D.-S.; Kang, S. G., One-

step sequence- and ligation-independent cloning as a rapid and versatile cloning method for functional genomics studies. Appl. Environ. Microbiol. 2012, 78 (15), 5440-5443. 23.

Greunke, C.; Duell, E. R.; D’Agostino, P. M.; Glöckle, A.; Lamm, K.; Gulder, T. A. M.,

Direct pathway cloning (DiPaC) to unlock natural product biosynthetic potential. Metab. Eng. 2018, 47, 334-345. 24.

Stratmann, K.; Burgoyne, D. L.; Moore, R. E.; Patterson, G. M. L.; Smith, C. D., Hapalosin, a

cyanobacterial cyclic depsipeptide with multidrug-resistance reversing activity. J. Org. Chem. 1994, 59 (24), 7219-7226. 25.

Micallef, M. L.; D'Agostino, P. M.; Sharma, D.; Viswanathan, R.; Moffitt, M. C., Genome

mining for natural product biosynthetic gene clusters in the Subsection V cyanobacteria. BMC Genomics 2015, 16, 669. 26.

O'Connell, C. E.; Salvato, K. A.; Meng, Z.; Littlefield, B. A.; Schwartz, C. E., Synthesis and

evaluation of hapalosin and analogs as MDR-reversing agents. Bioorg. Med. Chem. Lett. 1999, 9 (11), 1541-1546. 27.

Si, C.-M.; Shao, L.-P.; Mao, Z.-Y.; Zhou, W.; Wei, B.-G., An efficient approach to trans-4-

hydroxy-5-substituted 2-pyrrolidinones through a stereoselective tandem Barbier process: Divergent syntheses of (3R,4S)-statines, (+)-preussin and (-)-hapalosin. Org. Biomol. Chem. 2017, 15 (3), 649661. 28.

Shih, P. M.; Wu, D.; Latifi, A.; Axen, S. D.; Fewer, D. P.; Talla, E.; Calteau, A.; Cai, F.;

Tandeau de Marsac, N.; Rippka, R.; Herdman, M.; Sivonen, K.; Coursin, T.; Laurent, T.; Goodwin, L.; Nolan, M.; Davenport, K. W.; Han, C. S.; Rubin, E. M.; Eisen, J. A.; Woyke, T.; Gugger, M.; Kerfeld, C. A., Improving the coverage of the cyanobacterial phylum using diversity-driven genome sequencing. Proc. Natl. Acad. Sci. U. S. A. 2013, 110 (3), 1053-1058. 29.

Naville, M.; Ghuillot-Gaudeffroy, A.; Marchais, A.; Gautheret, D., ARNold: A Web Tool for

the Prediction of Rho-Independent Transcription Terminators. RNA Biol. 2011, 8 (1), 11-13.

ACS Paragon Plus Environment

16

Page 17 of 21

384

1 2 385 3 4 386 5 387 6 7 388 8 9 389 10 11 390 12 391 13 14 392 15 16 393 17 18 394 19 395 20 21 396 22 23 397 24 398 25 26 399 27 28 400 29 30 401 31 402 32 33 403 34 35 404 36 405 37 38 39 406 40 41 407 42 43 44 408 45 46 47 409 48 49 50 51 52 53 54 55 56 57 58 59 60

30.

ACS Synthetic Biology

Pfeifer, B. A.; Admiraal, S. J.; Gramajo, H.; Cane, D. E.; Khosla, C., Biosynthesis of complex

polyketides in a metabolically engineered strain of E. coli. Science 2001, 291 (5509), 1790-1792. 31.

Antosch, J.; Schaefers, F.; Gulder, T. A. M., Heterologous reconstitution of ikarugamycin

biosynthesis in E. coli. Angew. Chem., Int. Ed. 2014, 53 (11), 3011-3014. 32.

Ongley, S. E.; Bian, X.; Zhang, Y.; Chau, R.; Gerwick, W. H.; Müller, R.; Neilan, B. A.,

High-titer heterologous production in E. coli of lyngbyatoxin, a protein kinase C activator from an uncultured marine cyanobacterium. ACS Chem. Biol. 2013, 8 (9), 1888-1893. 33.

Liu, T.; Mazmouz, R.; Ongley, S. E.; Chau, R.; Pickford, R.; Woodhouse, J. N.; Neilan, B.

A., Directing the Heterologous Production of Specific Cyanobacterial Toxin Variants. ACS Chem. Biol. 2017, 12 (8), 2021-2029. 34.

Smollett, K. L.; Fivian-Hughes, A. S.; Smith, J. E.; Chang, A.; Rao, T.; Davis, E. O.,

Experimental determination of translational start sites resolves uncertainties in genomic open reading frame predictions – application to Mycobacterium tuberculosis. Microbiology 2009, 155 (1), 186-197. 35.

Hyatt, D.; Chen, G.; LoCascio, P.; Land, M.; Larimer, F.; Hauser, L., Prodigal: Prokaryotic

gene recognition and translation initiation site identification. BMC Bioinf. 2010, 11 (1), 119. 36.

D'Agostino, P. M.; Song, X.; Neilan, B. A.; Moffitt, M. C., Proteogenomics of a saxitoxin-

producing and non-toxic strain of Anabaena circinalis (cyanobacteria) in response to extracellular NaCl and phosphate depletion. Environ. Microbiol. 2016, 18 (2), 461–476. 37.

Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.;

Cooper, A.; Markowitz, S.; Duran, C.; Thierer, T.; Ashton, B.; Meintjes, P.; Drummond, A., Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28 (12), 1647-1649.

ACS Paragon Plus Environment

17

ACS Synthetic Biology

Page 18 of 21

410

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 411 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Table 1: bacterial strains and plasmids used in this study Strains

Description

Reference or Source

E. coli DH5α E. coli BAP1 Fischerella sp. PCC 9431

Host strain for cloning Heterologous expression strain Native producer of hapalosin

NEB

Plasmids

Description

pET28b-ptetO (6,029bp)

Tetracycline inducible expression plasmid, ColE1, KanR

pET28b-ptetO::hapA (8,156bp)

Tetracycline inducible expression plasmid, ColE1, This study KanR harbouring the first gene of the hap cluster, hapA

pET28b-ptetO::hap (31,163bp)

Built using pET28b-ptetO::hapA as the vector and This study hapBCDE single piece nucleotide insert.

pET28b-ptetO::refhapAC (20,969bp)

Built using pET28b-ptetO::hapA as the vector and This study hapB-hapC single piece nucleotide insert.

pET28b-ptetO::refhapAC-hapDE (30,847bp)

Constructed using pET28b-ptetO::refhapAC as the This study vector and hapD-hapE single piece nucleotide insert. This plasmid was refactored to completely remove all intergenic regions of the hap cluster

30

Institut Pasteur

23, 33

ACS Paragon Plus Environment

18

Page 19 of 21 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 412 413 36 414 37 38 415 39 416 40 417 41 418 42 419 43 420 44 421 45 46 422 47 423 48 424 49 425 50 51 52 426 53 54 427 55 56 428 57 58 59 60

ACS Synthetic Biology

Figure 1: Cloning strategy for the production of pET28b-ptetO::hap and pET28bptetO::refhapAC-hapDE. Bioinformatic analysis of the hap cluster identified a putative transcriptional terminator (red triangle) which was removed with the selective amplification and cloning of hapA (green) to produce pET28b-ptetO::hapA (1). The entire hapBCDE (blue) PCR fragment was amplified and incorporated into pET28b-ptetO::hapA via SLIC to produce pET28b-ptetO::hap (2). To shed light on the true ORF of hapD, a further refactored expression vector was generated by amplifying the ATG start site of hapBC (orange) and incorporated into the pET28b-ptetO::hapA to produce pET28bptetO::refhapAC (3). To complete the cluster, hapDE (red) was then incorporated into pET28bptetO::refhapAC to produce pET28b-ptetO::refhapAC-hapDE (4). The blue triangle indicates the site of excision of the hapC-hapD intergenic region.

ACS Paragon Plus Environment

19

ACS Synthetic Biology

Page 20 of 21

429

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 430 37 38 431 39 432 40 433 41 434 42 435 43 44 436 45 437 46 438 47 439 48 440 49 441 50 51 442 52 443 53 444 54 445 55 446 56 447 57 448 58 59 60

Figure 2: Structure of hapalosin and LCMS of heterologous expression extracts. A) Structure of hapalosin with calculated and observed high-resolution ion masses (Figure S7). HR-LCMS detection of hapalosin is presented in the supporting information. B) Heterologous expression extracts identified hapalosin solely within pET28b-ptetO::hap expression cultures at a RT of 7.9. Hapalosin could not be detected in fully refactored expression plasmid cultures (pET28b-ptetO::refhapAC-hapDE) and empty plasmid controls. Peak intensity is extracted ion mass of m/z 490. LCMS comparison to extracts of Fischerella sp. 9431 can be found in Figure S6.

ACS Paragon Plus Environment

20

Page 21 of 21

ACS Synthetic Biology

449

1 450 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 451 21 452 22 453 23 454 24 455 25 456 26 27 457 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

For Table of Contents Only

ACS Paragon Plus Environment

21