Genome Engineering in Cyanobacteria: Where We Are and Where We

May 18, 2015 - Recently, research in this area has improved the ability to engineer cyanobacteria, but we are still in need of continued effort to ena...
0 downloads 0 Views 740KB Size
Subscriber access provided by NEW YORK UNIV

Review

Genome Engineering in Cyanobacteria: Where we are and where we need to go C. Josh Ramey, Ángel Barón-Sola, Hanna R. Aucoin, and Nanette R. Boyle ACS Synth. Biol., Just Accepted Manuscript • DOI: 10.1021/acssynbio.5b00043 • Publication Date (Web): 18 May 2015 Downloaded from http://pubs.acs.org on May 24, 2015

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

ACS Synthetic Biology is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Genome Engineering in Cyanobacteria: Where we are and where we need to go C. Josh Ramey, Ángel Barón-Sola, Hanna R. Aucoin, Nanette R. Boyle1 Chemical and Biological Engineering Department, Colorado School of Mines, Golden, CO 1

Corresponding Author [email protected]

Abstract Genome engineering of cyanobacteria is a promising area of development in order to produce fuels, feedstocks and value added chemicals in a sustainable way. Unfortunately, the current state of genome engineering tools for cyanobacteria lags far behind those of model organisms such as Escherichia coli and Saccharomyces cerevisiae. In this review, we present the current state of synthetic biology tools for genome engineering efforts in the most widely used cyanobacteria strains and areas which need concerted research efforts to improve tool development. Cyanobacteria pose unique challenges to genome engineering efforts because their cellular biology differs significantly from other eubacteria; therefore, tools developed for other genera are not directly transferrable. Standardized parts, such as promoters and ribosome binding sites, which control gene expression, require characterization in cyanobacteria in order to have fully predictable results. The application of these tools to genome engineering efforts is also discussed; the ability to do genome wide searching and introducing multiple mutations simultaneously is one area that needs additional research in order to enable fast and efficient strain engineering.

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Introduction The dramatic decrease in the cost of DNA synthesis and sequencing in the last decade coupled to advances in molecular biology have enabled the development of synthetic biology and led to the sophisticated re-engineering of cellular machinery. The standardization of biological parts, such as promoters and other regulatory elements, allows researchers to edit and re-write genomes in a fast, efficient and predictable way. It is not currently possible to redesign entire genomes de novo; however, in the two most widely researched model organisms (E.coli and S. cerevisiae), this goal is within reach. Subsets of cellular machinery have been re-engineered in some organisms, one notable example is the complete refactoring of the nitrogen fixing gene cluster in Klebsiella oxytoca1. Despite the significant progress which has been achieved in re-writing genomes using synthetic biology techniques; the application of synthetic biology methods and techniques needs to be broadened to a larger group of organisms to fully reach its potential. One major area that needs concerted research effort is the development and use of synthetic biology tools in cyanobacteria. Cyanobacteria hold great promise as platform strains for the production of chemicals and feedstocks, especially since they require only carbon dioxide and light as substrates. Unfortunately, the current state of knowledge of these organisms lags far behind other model systems. Recently, research in this area has improved the ability to engineer cyanobacteria, but we are still in need of continued effort to enable more predictable and efficient redesign of these organisms as industrial production strains. In this review, we will discuss the current state of synthetic biology and genome engineering in cyanobacteria and future directions which need additional research. Standardized Parts One goal of synthetic biology is the development of standard genetic elements which can be ‘pulled off the shelf’ and combined to control gene expression within the cell of any organism. Some regulatory elements are conserved in bacteria and thus a few fully characterized parts from E. coli can be used in cyanobacteria, however, the majority of these parts are not directly transferrable. Below, we will discuss the current state of knowledge for standard parts in cyanobacteria and where improvements can be made. Promoters and terminators The first step in designing or discovering new methods to control transcriptional regulation is to understand the differences between cyanobacteria and typical eubacteria. Cyanobacteria have different consensus promoter sequences and their RNA polymerase holoenzyme (RNAP)2, 3 only functions with sigma (σ) factors belonging to the σ70 –family4, 5. Three different types of σ70 have been described that bind three different promoter sequences (I-III) in cyanobacteria. Group 1 σ70 factors prefer RpoD consensus promoters (Type I) similar to other eubacteria (-35 and -10 elements as TTGACA and TATAAT)6. Group 1 σ70 factors are responsible for the expression of common housekeeping genes and thus are indispensable to the cell. Group 2 and 3 σ70 factors have been reported to increase in concentration in response to environmental stresses6 such as heat7-9, salt10, oxidative11, or osmotic stress12 or nutrient limitation6, 13. Type II promoter sequences have only the -10 hexamer or enhancer motif sequences associated with transcriptional activator proteins; Type III promoters are distinct from both Type I and II but little else is known about their specificity and interaction with σ factors. Recently, Kaczmarzyk

ACS Paragon Plus Environment

Page 2 of 25

Page 3 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

et al. improved tolerance to butanol and high temperatures (up to 45˚C - 48˚C) by increasing expression of a group 2 cyanobacterial sigma factor (SigB) which binds to type II promoter sequences14. Engineering the transcription factors themselves can also be a successful approach to improved complex phenotypes, such as improved solvent tolerance. In yeast, Alper et al. used random mutagenesis to engineer transcription factors which resulted in improved glucose/ethanol tolerance 15. Similar approaches can be used in cyanobacteria to elicit global changes in transcription resulting in improved growth in a variety of stressful growth conditions. The ability to tune gene expression is critical for any re-design of cellular machinery; thus, the identification and/or engineering of promoter sequences which span large dynamic ranges and are both constitutive and inducible are necessary steps to further develop synthetic biology in cyanobacteria. The majority of endogenous promoters currently used to drive heterologous gene expression in cyanobacteria are derived from genes associated with the light harvesting complex of photosynthesis (Table 1). These promoters induce high levels of gene expression; however, as these promoters are light induced, there is little or no gene expression when cells are grown in the dark16. Recently, a light-driven super-strong promoter (Pcpc560) reported in the cyanobacterium Synechocystis sp. PCC6803 has been used efficiently for heterologous gene expression resulting in protein levels similar to that produced using the strong tac promoter in E. coli 17-19 with low or medium-copy plasmids. However, one drawback of using these strong promoters is that they are either ‘on’ or ‘off’, which can lead to unpredictable (and deleterious) physiological changes and does not allow tuning of gene expression. There are a number of tunable promoters which are responsive to environmental cues (such as nutrient concentrations) in cyanobacteria which can be used to overcome this hurdle. Most of the currently known inducible promoters are responsive to micronutrient concentrations (see Berla et al. for a comprehensive review 20), such as metal ion concentrations. However, due to the sensitive nature of these promoters (some are responsive to concentrations as low as 50µM), they are difficult to use in the laboratory or industrial settings due to additional experimental considerations, such as acid washing glassware, the use of trace metal grade chemicals and growing cells in nutrient limited conditions for several generations which may result in non-negligible physiological stresses or low growth rates. Due to the dearth of easily tunable promoters in cyanobacteria, Markley et al. engineered promoter libraries based on the cyanobacterial cpcB promoter in Synechococcus sp. PCC700221. Using YFP as a reporter gene, the resulting library showed a 3log range in promoter strength. The cpcB promoter based library was used to control gene expression in E. coli and results varied dramatically from what was measured in Synechococcus, further evidence that promoter performance in E. coli cannot be used to predict promoter response in cyanobacteria. Engineered E. coli promoter sequences have also been used in cyanobacteria with varying degrees of success23, 24 (Table 1). A transcriptional modulation system based on TetR regulated promoters allowed for the induction of a 290-fold change of eYFP (Yellow Fluorescent Protein) levels25. Despite elevated levels of gene expression using the TetR regulated promoter, the high light sensitivity of the inducer aTc (anyhydrotetracycline) restricts its use to low light conditions (30µmol photons m-2 s-1). Isopropyl β-D-1-thiogalactopyranoside (IPTG) inducible promoters have also been used in cyanobacteria with a wide range of effectiveness. Utilization of the same IPTG inducible promoter, such as Ptrc , in different cyanobacteria strains can result in a high level of variability. For example, the use of Ptrc in Synechococcus elongatus PCC7942 resulted in an increase in gene expression up to 36-fold26 while in Synechocystis sp. PCC6803 there was little

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

to no change in gene expression27. A possible explanation for this difference could be the capacity of different cyanobacteria to transport the IPTG molecules into cells. Markley et al. engineered a constitutive promoter library based on well-characterized E.coli promoters using YFP as a reporter gene21 and reported a range of YFP expression of 2.5-log. By combining the cpcB (discussed in previous paragraph) and the E.coli-based promoter libraries, Markley et al.21 engineered an IPTG induced system yielding an approximately 48 fold increase in YFP expression in Synechococcus sp. PCC7002. These results are impressive when compared to the previously reported isiAB-Fe3+ repressor system, which has a 2-fold induction; however, they are still one order of magnitude lower than those obtained with the Ptrc E. coli promoter20. Promoter engineering in cyanobacteria is a powerful tool for controlling gene expression however, it is not without its challenges. These examples illustrate the complexity of this endeavor as these promoters are affected by a multitude of factors such as growth conditions, growth phase, competence to inducer uptake, inducer type and concentration, sigma factors, and promoter type. In addition to promoter sequences, terminator sequences are also extensively used in synthetic biology to control gene expression at the transcriptional level. In prokaryotes, terminator sequences control gene expression via two different mechanisms: intrinsic termination, also known as Rho-independent termination, which occurs when the formation of a stem-loop structure in mRNA causes RNAP to stall. Rho-dependent termination relies on the binding of the Rho protein to the nascent mRNA causing RNAP to be released. Although homologs of E. coli Rho proteins are found in a majority of bacteria, they have not been found in two of the most widely used cyanobacteria (either Synechocystis sp. PCC6803 or Synechococcus elongatus). A high resolution transcriptome map created for Synechococcus elongatus PCC7942 has been used to identify termination sequences. Secondary structures at the 3' end of mRNA transcripts were found to have lower free energy, and therefore a preference to form stem-loop structures just prior to the end of the transcripts as is found in Rho-independent transcription termination28. Additionally, several terminator sequences were found to occur between open reading frames (ORFs) within the same operon suggesting promoter-independent transcription level regulation occurs in cyanobacteria. This mechanism of transcriptional regulation has been used in cyanobacteria by inserting terminator sequences between ORFs to prevent background transcription effects of upstream and/or downstream genes on the gene of interest. The endogenous cyanobacterial Rubisco terminator29, E. coli rrnB strong terminator30, and transcription terminator sequences derived from T723 bacteriophage have also been used in cyanobacteria. Although these terminator sequences are used in synthetic constructs, it is important to note that to date, there have been no concerted efforts to test the effectiveness of terminator sequences in cyanobacteria specifically. Ribosome binding site Ribosome binding sites (RBSs) are also well characterized genetic tools used to control protein expression in model organisms. There are several computational tools available to predict the effect of altering a RBS sequence, such as the RBS Designer Model31, the UTR Designer32 and the RBS Calculator33, 34. The RBS Designer can be used to predict protein translation efficiencies of a given mRNA based on the RBS sequence, spacer length and homology to the Shine-Dalgarno (SD) sequence. The UTR Designer is based on the characterization of specific E.coli mRNA regions that produce dynamic changes in mRNA folding32. Use of this model showed that the 5´ RNA region of the coding sequence is critical for modulating protein

ACS Paragon Plus Environment

Page 4 of 25

Page 5 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

expression levels. Finally, the RBS calculator is based on thermodynamic analysis of mRNA sequences in E.coli. The effects of different RBS sequences and surrounding nucleotides are considered in this model along with the start codon sequence and putative mRNA secondary structures. In E. coli, the RBS calculator is quite robust and can predict translation rates over a range of 100,000 within a factor of roughly 2. These three RBS prediction software packages were used by Markley et al.21 to design and evaluate a RBS library in Synechococcus sp. PCC 7002. Unfortunately, the predicted levels of YFP expression (up to 213-fold range) were poorly correlated with experimental YFP fluorescence data (30-fold range). The poor correlation between RBS calculator predictions and experimental evidence was also reported in a study to optimize 2,3-butanediol in Synechococcus elongatus sp. PCC 7942 35. Although several RBS models have been developed to improve RBS efficiencies in E.coli, the use of these predictive models in cyanobacteria does not yield predictable results. Thus, protein translation is regulated via a different mechanism in cyanobacteria and deserves further research to develop specific predictive RBS tools for these organisms. It is important to note that while the above studies report changes in gene expression due to changes in promoter and/or RBS, the use of the same expression construct within different genetic context may result in different results36, 37. To our knowledge, there have been no rigorous studies which examine the effect of genetic context on the promoter strength in cyanobacteria, therefore it is difficult to know how much variation will be seen. In recent work by Mutalik et al., approximately 500 transcription and translation initiation units were engineered to minimize effects of secondary structure38. These genetic elements can be used with any downstream interest and expression levels should be within twofold of the target level; this is an 87% reduction in expression error typically reported. In order to achieve the goal of synthetic biologists and have truly interchangeable parts, more research should be done to create fully insulated parts; the results of Mutalik et al. indicate that this indeed can be achieved with enough effort.

RNA Regulators and Post-translational Control Yet another mechanism to control gene expression and/or protein concentrations is by altering RNA conformation or base pairing with DNA or other RNA sequences. These so called RNA regulators can be used to engineer genetic networks. One type of RNA regulator that cyanobacteria rely on is non-coding RNA (ncRNA). Antisense RNA (asRNA), a type of ncRNA, has been reported to regulate gene expression in cyanobacteria in several studies39-41 suggesting an important role of this genetic control in the general metabolism of these organisms42. For instance, an asRNA was found to bind AU-rich box and RBS regions of the psbA2 to prevent the degradation caused by RNAse E in Synechocystis sp. PCC6803. Interestingly, both asRNA and psbA2 transcript abundance were regulated by light and expression levels were correlated43. Riboregulators are specific RNA sequences that respond to signal molecules and can act on the translation process by releasing silencing RNA or activating ribozymes. This system has been successfully adapted from E. coli45 for use in cyanobacteria46 as a method for posttranscriptional control of gene expression. Abe et al.46 designed and tested different riboregulators in E. coli based on the cis-repressed (cr)/ trans-activating (ta) RNA system developed by Isaacs et al.45. The crRNA sequence represses gene expression by binding RBS region of the mRNA. Then,

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

taRNA transcript can bind crRNA by disrupting the inactivation of the mRNA RBS thus, activating gene transcription. The authors further evaluated the described riboregulator system using a GFPuv gene as a reporter in Synechocystis sp. PCC6803 and report comparable results to that obtained in E. coli. These RNA-based genetic tools can be used in cyanobacteria to design specific transcriptional approaches to block or promote mRNA degradation allowing for the modulation of metabolic pathways for the optimal production of target compounds. Regulation of mRNA stability can also be conducted using a plasmid based approach controlling the transcript levels of ncRNA using an inducible/repressible promoter. This allows for the targeting of multiple transcripts from the multiple copies of a gene simultaneously by ncRNA transcripts addressing concerns derived from polyploidy nature of cyanobacteria (see Multiple Chromosomes and Associated Challenges). Riboswitches are yet another approach to altering protein levels in a cell. Riboswitches are functional non-coding RNA molecules which have a sensing domain (aptamer) and a regulating domain. A conformational changes occurs when the aptamer domain binds to a specific ligand and thereby affects the expression of the adjacent genes. A recent study in Synechocccus elongatus PCC 7942 reports the use of a theophylline-dependent riboswitch which is capable of inducing protein expression 190 fold47. The riboswitch was also very effective at repressing translation in the absence of theophylline, unlike leaky gene expression when IPTG inducible promoters are used 23, 48, 49. Another advantage of riboswitches is that protein levels of genes under control of the riboswitch can be tuned by altering the concentration of the ligand. Since 16s rRNA is conserved amongst related cyanobacteria, this implies that similar approaches can be used for other model cyanobacteria, such as Synechocystis sp. PCC 6803 and Anabaena sp. PCC 7120. Protein stability is also an important point of regulation; protein degradation tags control the abundance of protein and may prove useful for altering cellular mechanisms. Natural protein degradation mechanisms have been reported in Synechocystis sp. PCC6803 which protect the organism from heat and light stress conditions50. Landry et al.51 evaluated a set of different cyanobacterial and E. coli protein degradation tags fused to the reporter enhanced Yellow Fluorescence Protein (eYFP) in Synechocystis sp. PCC6803. Fluorescence levels showed a wide a range (from 1 to 50% of untagged eYFP) indicating that this system of regulation has the potential to fine-tune enzyme levels in cyanobacteria. Harnessing the natural mechanisms of metabolic control described in cyanobacteria for use as synthetic biology tools can be used to optimize a desired metabolic pathway however, plenty remains to be discovered and understood about transcriptional and translational regulation in cyanobacteria and each should be considered fields to be explored to determine the limitations and advantage of every approach. Genome Engineering Multiple Chromosomes and Associated Challenges A major challenge in any genome engineering effort, regardless of the organism used is obtaining clones/mutants which are homogenous. From early studies in E. coli, it was assumed that prokaryotes contain only a single copy of their genome (monoploid); however, in E. coli, this is only true when growth is extremely slow (td = 16 hours)52. The process of replicating a chromosome in E. coli takes approximately 60 – 90 minutes to complete53; therefore, in order to replicate at faster growth rates which are typically seen in rich medium (td = 20 minutes), E. coli

ACS Paragon Plus Environment

Page 6 of 25

Page 7 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

must initiate multiple rounds of replication before the previous replication machinery is complete. This results in an average chromosome copy number of 6.8 in normal growth conditions 54, 55. Despite having much slower growth rates, cyanobacteria have also been reported to have multiple copies of their chromosome when grown under ‘normal’ conditions and it can vary from a handful to several hundred. One of the most widely used species for metabolic engineering and synthetic biology is Synechocystis sp. PCC 6803, which is highly polyploid; having chromosome copy numbers ranging from 60 - 225 copies of its chromosome per cell 56 depending on the strain being used (Table 1). The number of chromosomal copies can also vary depending on the growth phase, Griese et al. report that Synechocystis sp. PCC 6803 has an average of 218 chromosome copies in exponential phase which drops to 58 and 57.6 in linear and stationary phase respectively56. Another widely used species, Synechococcus elongatus sp. PCC 7942 is reported to have on average 4 copies of its chromosome per cell in both linear and stationary growth phases 56, 57. Griese et al. also compiled information on several other species, which can range from 1 to 218 chromosomes per cell56. There are several evolutionary advantages to carrying multiple copies of the chromosome, such as efficient repair of double strand breaks, gene redundancy, lower rates of spontaneous mutations and rapid cell divisions. Despite the numerous evolutionary advantages multiple chromosomes provides the cell, they also create a grand challenge for genome engineering, both in terms of creating mutant libraries and tracking them on a chromosome by chromosome basis. Even in organisms with advanced genome engineering tools, such as E. coli, the challenge of working with multiple chromosome copies has not been overcome (see58 for a detailed discussion of how multiple chromosomes affects recombination efficiencies). Desired mutations must either be integrated into every chromosome or continuously selected for until a homogenous population is achieved; if this is not possible (as many mutations are not selectable) the mutation will be diluted out significantly due to further replication and division of WT cells. This is a significant problem in E. coli, where the copy number is only 6.8; in cyanobacteria, where some species have hundreds of copies of their chromosome present it becomes especially problematic. If desired mutations are introduced on one or even a few chromosomes, they will quickly be diluted by growth during the recovery period. No reports to date address this problem specifically in E. coli or cyanobacteria, however if ploidy can be finely controlled without significant defects on growth, recombination efficiency can be increased tremendously. The fewer chromosomes present, the fewer recombination events need to occur (or shorter outgrowth period) to have homogenous populations. As discussed earlier, cells grown more slowly have lower chromosome copy numbers, however this significantly decreases the ability to introduce mutations quickly. Therefore, control of ploidy is a major challenge that needs to be addressed in any genome engineering effort and has yet to be addressed in many of the genome engineering literature. Genomic, Expression and Deletions Libraries in Cyanobacteria The use of genomic libraries to understand complex genetic phenotypes such as increased production of bio-products and understanding tolerance mechanisms are well established59, 60. Cyanobacteria have been engineered for the production of ethylene, sucrose and 2, 3 butanediol; however, yields still need to be improved for economic feasibility61, 62. The development of genetic tools such as genomic libraries, expression libraries and gene deletion collections will facilitate the development of cyanobacteria into an important production organism. The

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

construction of genomic libraries in cyanobacteria requires the continued improvement of shuttle vectors between cyanobacteria and E. coli (current vectors for cyanobacteria have recently been reviewed16). Plasmid vectors currently used in cyanobacteria either use a replicon of a broad host range plasmid or an endogenous cryptic plasmid. Important considerations for the development of an effective genomic library in cyanobacteria are the following: plasmid copy number, plasmid partitioning into daughter cells, library insert size, genetic coverage and available selectable markers. One concern that is specific for using endogenous plasmids is any effect caused by endogenous genes or DNA sequence elements present on the plasmid. Use of shuttle vectors for the construction of a genomic library in E. coli allows for all cyanobacteria genomic sequences to be represented. Construction in E. coli permits the identification of genes and DNA sequences whose overexpression is toxic under normal conditions in cyanobacteria, as well as genes and DNA sequences that promote increased biosynthesis of chemical products by increasing tolerance or flux through important biochemical pathways. Cyanobacteria genomic library approaches and resources applied thus far are listed in Table 1. To date genomic libraries have been utilized for complementation analysis, genome sequencing or as a starting point for constructing a knockout library using transposon mutagenesis of genomic library plasmids in E. coli (Table 1). Genomic libraries are a simple overexpression at the level of DNA sequence, therefore when developing genomic libraries, an important consideration is the plasmid copy number. Due to the polyploidy nature of cyanobacterium discussed above, the effectiveness of a simple overexpression at the DNA level might be limited. To overcome this limitation, an expression library of all known open reading frames (ORFs) of a particular cyanobacterium needs continued development; this requires a sequenced and annotated genome (Table 1). Also listed in Table 1 are expression libraries created in cyanobacteria species and strains. Expression libraries have been reported for Synechocystis sp. PCC 6803, Synechococcus elongatus and Anabaena. The expression system used for Synechococcus elongatus was a bacteriophage lambda based system that utilizes the lac promoter to identify carotenoid binding proteins by isolating immunopositive clones63. This library was designed to express Synechococcus elongatus proteins in E. coli and thus cannot be utilized to elucidate genes which confer complex phenotypes in the endogenous host. A cosmid based expression library has been reported in Synechocystis for identification of enzymes involved in light dependent chlorophyll biosynthesis by complementing a defect in a R. capsulatus strain64. In Anabaena, an expression library was created by ligating (2-10kb) genomic fragments into a plasmid containing the highly expressed endogenous rbcL promoter65. These reported expression systems are insufficient for understanding complex phenotypes because they do not control gene expression over a dynamic range and the genomic fragments inserted into the expression vectors are not defined. Ideally, genomic libraries would be constructed in shuttle vectors with interchangeable pieces (promoters, RBS, and terminators) so that different levels of gene expression could be tested in the native host to aide in identifying gene targets for further engineering. Defined expression library collections using inducible promoters and optimized ribosome binding site sequences are an important tool for the development of increased production of bioproducts and to facilitate the understanding of tolerance mechanisms. One advantage of expression libraries is that protein levels can be controlled over a much greater dynamic range than a simple genomic library. A current limitation of expression libraries in cyanobacteria is the lack of an effective and easy to use cyanobacteria-specific inducible promoter that does not

ACS Paragon Plus Environment

Page 8 of 25

Page 9 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

affect the overall physiology of cyanobacteria (see 20 for a comprehensive list of promoters used in cyanobacteria). As discussed previously, the majority of inducible promoters native to cyanobacteria are trace metal responsive; these promoters require the removal of inducer metal ions which have deleterious effects to the overall physiology of the cell and growth rate. There are a number of E. coli based promoters which have been engineered to function in cyanobacteria which can be used for this purpose but their range of expression can be limited or problematic. For example, IPTG inducible promoters are known to be leaky in the absence of the IPTG. Advanced metabolic engineering requires the ability to fine tune entire metabolic pathways; to test different levels of gene expression in different genetic backgrounds, genomic or expression libraries require the availability of promoters whose strength can be tuned easily. Expression library construction should allow for easy shuffling of vector backbones controlling plasmid copy number and promoters controlling transcript levels. The construction of an expression library could be accomplished through traditional means or take advantage of homologous recombination cloning methods in cyanobacteria or Saccharomyces cerevisiae67, 68. Important considerations for the development of expression libraries are the following: plasmid copy number, promoter strength, translation efficiency, ribosome binding sequences, terminators, selectable markers available and downstream techniques for introducing the overexpression into the genome. Multiple methods have been described for the construction of knockout libraries in cyanobacteria 20, 69, 70 including: homology based recombination of selectable markers, markerless knockout using selection/counter selection strategies, transposon mutagenesis of genomes and transposon mutagenesis of plasmid libraries in E. coli that are then integrated back into the host genome71. Cyanobacteria are an attractive model for genetic manipulation because they are naturally competent, easily taking up foreign DNA and integrating it into the genome through homologous recombination pathways. An important limitation for the construction of deletion libraries is the polyploid genome of cyanobacteria. This is overcome via continued selection over multiple generations, thus greatly increasing the time needed to create a single gene knockout. By traditional methods of constructing a knockout library, each gene knockout takes approximately three weeks. One caveat to this approach is that obtaining homogeneous strains that contain only the gene knockout and not the wild type gene is selected against if the gene knockout negatively affects the cell. Using a plasmid shuffle approach can solve this problem. A covering plasmid containing the wild type gene with a counter selectable maker allows for all the genomic copies of the gene to be deleted while the gene function is covered by the plasmid copy of the gene, followed by counter selection against the plasmid. Another important consideration when constructing a knockout library in cyanobacteria is species/strain selection. For example, if the genetic target or molecular process being targeted is involved in photosynthesis, it is important to determine if your species is an obligate photoautotroph; if it is, this would preclude any efforts to knockout or reduce activity of those essential genes. This is one reason why Synechocystis sp. PCC 6803 is used extensively as a model organism – it can be grown purely autotrophically, mixotrophically or heterotrophically. It is important to note that it has been recently shown that the introduction of sugar transport systems allow for Synechococcus elongatus PCC7942, an obligate photoautotroph, to grow using saccharides such as glucose, xylose and sucrose in the absence of light72. We believe that constructing a knock-out library in cyanobacteria is a worthwhile effort that the community should work on in a massive parallel fashion while next generation knockout approaches such as Crisper/Cas9 and TALEN systems are developed. Examples of collaborative efforts for the construction of a deletion collection in Saccharomyces cerevisiae and E. coli are available to model an approach to making a deletion

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 10 of 25

collection in cyanobacteria73, 74 67. The construction of a knockout library is an excellent opportunity to integrate research into undergraduate education. Current knockout libraries in species and strains of cyanobacteria are listed in Table 1. Multiplexing As discussed above, the generation of a single gene knockout in a fast-growing cyanobacterium can take approximately three weeks using conventional methods. Therefore, in order to design and create production strains in a timely and cost-effective manner, it is imperative to develop methods which allow the introduction of several genetic mutations at once, thus streamlining the process (Figure 1). The first step required for multiplexing in cyanobacteria is to identify the genes to target for a given phenotype. In E. coli, genes associated with a given phenotype (for example, solvent tolerance) can be identified using genomic library enrichment strategies. For example, MultiScalar Analysis Library Enrichment (SCALE)75 uses clones from a plasmid-based genomic library to identify individual genes or group of genes responsible for the desired phenotype 76-78. The enrichment of clones which are able to grow in the given condition is quantitatively evaluated by hybridization or next generation sequencing. A similar method that uses barcodes to track genetic modifications is TRackable Multiplex Recombineering (TRMR)79. This approach merges the yeast barcoding method80 with homologous recombination processes in order to integrate and track mutations in the chromosome. These types of genome-wide searching tools have not be developed for use in cyanobacteria, but they are necessary in order to rationally engineer production strains by targeting genes which are known to increase or improve growth in the desired conditions. It has been shown that tuning the expression of genes in an entire pathway can lead to higher productivities when compared to just overexpressing genes encoding rate limiting enzymes81. In order to tune the expression level of several genes simultaneously, methods such as Multiplex Automated Genome Engineering (MAGE) need to be developed for cyanobacteria. MAGE, which was developed in E. coli, enables the simultaneous generation of mutants with multiple genetic modifications. Based on evolutionary processes within a cell population, MAGE is able to generate up to 15 billion genetic variants per day in E. coli81. This approach involves a large-scale introduction of synthetic single stranded DNA (ssDNA) oligos which encode desired mutations into a genome using a short cycling automation process (2 - 2.5 h). With the longer doubling times of even the fastest growing cyanobacteria (Table 1), achieving similar library sizes would take much longer (1 week to 1 month); however, would still be faster than creating mutants with desired gene expression levels one at a time. One of the most challenging aspects of creating a MAGE-like method for creating mutant libraries is how to design an effective screen or selection to identify mutants with the desired phenotype. This is exacerbated by the high chromosomal copy number in cyanobacteria; it is a critical step to reduce the heterogeneity of the population and to minimize the number of clones which need to be tested further in order to identify the ‘winner’. Multiplex genome engineering methods, such as MAGE and TRMR, would enable researchers to design, build and test production strains (Figure 1) faster and more efficiently than current methods allow. This expanded ability to generate mutants would also have the additional benefit of generating increased knowledge of processes within the cell which differ from model bacteria (such as E. coli). CRISPR-Cas systems

ACS Paragon Plus Environment

Page 11 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Clustered regularly interspaced short palindromic repeats (CRISPR) along with CRISPR associated proteins (Cas proteins) have been described in Eubacteria and Archaea as an adaptive immune system against foreign genetic elements86, 87. This immune systems has been used as a basis to design genome editing tools in a variety of organisms, from bacteria88 and yeast89 to plants90 and human cells90, 91. The three major CRISPR/Cas systems described in prokaryotes92 share a common mechanistic process in order to acquire immunity against foreign genetic material (viruses and plasmids). The first stage is the incorporation of external nucleic acids into the host genome CRISPR/Cas regions. The next step involves the transcription of DNA into small pieces of RNA (crRNA) that will be further used as an interference element for targeting and degrading the complementary foreign genetic material. All of the processes are performed by the Cas proteins which function as helicases and nucleases. Unlike other previous genome editing technologies, such as zinc finger nucleases and transcription activator like effector nucleases (TALENs), CRISPR/Cas systems do not rely on DNA-protein interaction but simply work with RNA-DNA complementarity rules. This feature simplifies the protocol by avoiding the complex steps required for protein engineering. Type II CRISPR/Cas system can make insertions and/or deletions (indels) as well as site directed point mutations. This mechanism involves various elements for the maturation of a pre-crRNA into crRNA: a short trans-encoded RNA known as transactivating crRNA (tracrRNA), the nuclease Cas9 (Csn1) and host RNA polIII. Additionally, multiple efforts to improve the efficiency of CRISPR/Cas systems have been performed to optimize the specificity of Cas nucleases93 and prevent reported off-target effects in eukaryotic cells94. CRISPR/Cas systems are becoming a preferred tool as a highly efficient and reliable method for genome editing applications. The presence of CRISPR/Cas repeat sequences have been reported in different cyanobacteria. Analysis of 126 cyanobacterial genomes found that only marine cyanobacterial strains lack CRISPR/Cas sequences95. Genomic studies in the cyanobacterium Synechocystis sp. PCC680396 have described three different types of CRISPR/Cas systems in the endogenous pSYSA plasmid. Each CRISPR system bears different Cas proteins and uses independent processing pathways. Authors found high transcript abundance of CRISPR/Cas sequences and a strong dependency between crRNA accumulation and Cas6 protein activity. Further analysis in the same strain revealed that crRNA accumulation was affected by environmental conditions and the presence of genes encoding putative transcriptional regulators97. These results offer a promising genome editing strategy to improve the design and creation of cyanobacterial production strains. For example, CRISPR/Cas systems can be designed to cut all wild type sequences, therefore eliminating the need for a screen or selection to identify mutants; this makes it an incredibly powerful tool for overcoming the presence of multiple chromosomes in cyanobacteria. CRISPR/Cas systems designed to cut wild type DNA also eliminates the need for long periods of segregation to ensure homogeneous populations. By way of illustration, the CRISPR/Cas9 nuclease system has been successfully used for genome editing polyploid yeast Sacharomyces cerevisiae98. In a single step of mutagenesis, authors obtained a mutant which increased cellobiose fermentation rates by over 10-fold. CRISPR/Cas can be designed to cut any nonmutant chromosomes, CRISPR/Cas also presents an additional advantage – that of immunity to bacteriophages. At the industrial level, bacteriophages can cause huge economic losses if they are unwittingly introduced to a production fermentation processes. Phages are usually considered one of the most significant threats in the bioproduct industry and the application of genome editing

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 12 of 25

technologies based on CRISPR/Cas can prevent bacteriophage infections86, 99, 100. Cells can also be engineered to have immunity against cyanobacteria specific phages (cyanophages)101, which addresses one concern of large scale production: crop-protection. CRISPR/Cas genome editing systems are a promising tool to overcome many of the challenges of working with cyanobacteria; it has the potential to reduce the time needed to produce mutants or mutant libraries and can overcome the limitations associated with multiple chromosomes. Conclusion Cyanobacteria are a promising group of organisms to engineer as platform strains using synthetic biology techniques for the production of biochemicals and biofuels. Because cyanobacteria can convert light energy and CO2 to high value chemical products they are a truly sustainable platform for bioproduction. Current limitations for the development of cyanobacteria into industrial productions strains are: increasing CO2 fixation, increasing light capture, reducing intracellular oxidative stress, and the improvement and development of sophisticated genetic tools. In this review, we have focused on the current state of synthetic biology tools and areas where the development of new genetic tools can greatly increase our understanding of the physiology of cyanobacteria as well as the development of strains with increased production and tolerance of bioproducts. It has already been demonstrated that cyanobacteria are capable of producing high value bioproducts; the development of sophisticated genomic tools will help make these bioproducts commercially viable. The development of well-defined genetic libraries such as: genomic libraries, expression libraries and knockout libraries will facilitate a better understanding of complex phenotypes in cyanobacteria. Genetic libraries will also provide the basis for more advanced genome searching techniques such as SCALES and TRMR. In addition to genetic library based techniques there are new exciting synthetic biology techniques that need to be utilized such as Crisper/Cas9 and MAGE. One important limitation of genetic analysis in cyanobacteria is their polyploidy nature. Depending on the species and strain DNA copy number can vary between 2 and more than 200, making it difficult of obtain homozygous strain. Recently the Crisper/Cas9 system has been utilized to obtain homologous mutants in a polyploid industrial production strain of yeast, indicating that Crisper/Cas9 genomic editing technique has promise in ployploid strains of cyanobacteria. The leverage of new synthetic biology techniques and application of knowledge discovered in model organisms will allow for the development of cyanobacteria into an important green production platform for bioproducts.

ACS Paragon Plus Environment

Page 13 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Table 1. Cyanobacterial strains for synthetic biology

Strains

Synechococcus sp. PCC 7002

Synechococcus elongatus PCC 7942

Synechocystis sp. PCC 6803

Leptolyngbia sp. BL0902

Representative shuttle vectors

Representative promotersa

pAQE17102 pAQ1Ex-Pcpc 22

cpc22 (E,NI) isiAB103 (E,I)

pUC303105 pCB4106 pSG111107

idiA108 (E,I) smt109 (E,I) trp-lac110 (F,I) trc26 (F,I)

pFC1114 pSL1211115

pAM5052/5057/5059122

isiAB+GFP116 (E,I) trc2023 (F,I) A1lacO-127 (F,I)

trc122 (F,I)

tac126 (F,I) petE127 (E,I) psbA1128 (E,NI) nir129 (E,I) a Endogenous(E)/ Foreign (F)/ Inducible (I)/ Non Inducible (NI) b GeneBank Assembly Accession number

Anabaena (Nostoc) sp. PCC7120

pRL1125 pPMQAK123

Polyploidy (chromosomes/cell)

622

3-6111

60-22556

-----------

8-10130

Doubling time (h)

3.522

12112

624

Genomic, Expression or Knockout Libraries

Sequenced annotated Genomeb

Genomic library104 RBS library21

GCA_000019485.1

Genomic and knock out libraries113 Expression library63 Genomic library117, 118 Customized amplification library119 Random insertion DNA library of mutagenesis120 cDNA library121 Expression library64

~20123

Genomic library124

>24131

Genomic library132 Expression library65

ACS Paragon Plus Environment

GCA_000012525.1

GCA_000009725.1

------------

GCA_000009705.1

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 14 of 25

Figure 1. Development and use of genome engineering tools in cyanobacteria. In order to design and rewrite genomes in cyanobacteria, genome-wide search tools (such as SCALEs, MAGE, TRMR and CRSIPR/Cas) need to be developed to aide in gene discovery. Once genes associated with particular phenotypes are known, their gene expression can be re-designed using standardized parts. The design can then be tested in vivo and the strain can be improved using iterative cycles. It is important to note here that molecular tools depicted in each step of the cycle are not necessarily restricted to the step depicted but can be used in any step of the design-buildtest cycle.

ACS Paragon Plus Environment

Page 15 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

REFERENCES [1] Temme, K., Zhao, D., and Voigt, C. A. (2012) Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca, Proceedings of the National Academy of Sciences of the United States of America 109, 7085-7090. [2] Camsund, D., Heidorn, T., and Lindblad, P. (2014) Design and analysis of Lacl-repressed promoters and DNA-looping in a cyanobacterium, Journal of Biological Engineering 8. [3] Gunnelius, L., Hakkila, K., Kurkela, J., Wada, H., Tyystjaervi, E., and Tyystjaervi, T. (2014) The omega subunit of the RNA polymerase core directs transcription efficiency in cyanobacteria, Nucleic Acids Research 42, 4606-4614. [4] Khudyakov, I. Y., and Golden, J. W. (2001) Identification and inactivation of three group 2 sigma factor genes in Anabaena sp strain PCC 7120, Journal of Bacteriology 183, 66676675. [5] Fujisawa, T., Narikawa, R., Okamoto, S., Ehira, S., Yoshimura, H., Suzuki, I., Masuda, T., Mochimaru, M., Takaichi, S., Awai, K., Sekine, M., Horikawa, H., Yashiro, I., Omata, S., Takarada, H., Katano, Y., Kosugi, H., Tanikawa, S., Ohmori, K., Sato, N., Ikeuchi, M., Fujita, N., and Ohmori, M. (2010) Genomic structure of an economically important cyanobacterium, Arthrospira (Spirulina) platensis NIES-39, DNA Research 17, 85-103. [6] Imamura, S., and Asayama, M. (2009) Sigma factors for cyanobacterial transcription, Gene Regulation and Systems Biology 3, 65-87. [7] Imamura, S., Yoshihara, S., Nakano, S., Shiozaki, N., Yamada, A., Tanaka, K., Takahashi, H., Asayama, M., and Shirai, M. (2003) Purification, Characterization, and Gene Expression of All Sigma Factors of RNA Polymerase in a Cyanobacterium, Journal of Molecular Biology 325, 857-872. [8] Inaba, M., Suzuki, I., Szalontai, B., Kanesaki, Y., Los, D. A., Hayashi, H., and Murata, N. (2003) Gene-engineered Rigidification of Membrane Lipids Enhances the Cold Inducibility of Gene Expression in Synechocystis, Journal of Biological Chemistry 278, 12191-12198. [9] Tuominen, I., Tyystjärvi, E., and Tyystjärvi, T. (2003) Expression of Primary Sigma Factor (PSF) and PSF-Like Sigma Factors in the Cyanobacterium Synechocystis sp. Strain PCC 6803, Journal of Bacteriology 185, 1116-1119. [10] Marin, K., Suzuki, I., Yamaguchi, K., Ribbeck, K., Yamamoto, H., Kanesaki, Y., Hagemann, M., and Murata, N. (2003) Identification of histidine kinases that act as sensors in the perception of salt stress in Synechocystis sp. PCC 6803, Proceedings of the National Academy of Sciences 100, 9061-9066. [11] Kanesaki, Y., Yamamoto, H., Paithoonrangsarid, K., Shumskaya, M., Suzuki, I., Hayashi, H., and Murata, N. (2007) Histidine kinases play important roles in the perception and signal transduction of hydrogen peroxide in the cyanobacterium, Synechocystis sp. PCC 6803, The Plant Journal 49, 313-324. [12] Paithoonrangsarid, K., Shoumskaya, M. A., Kanesaki, Y., Satoh, S., Tabata, S., Los, D. A., Zinchenko, V. V., Hayashi, H., Tanticharoen, M., Suzuki, I., and Murata, N. (2004) Five Histidine Kinases Perceive Osmotic Stress and Regulate Distinct Sets of Genes in Synechocystis, Journal of Biological Chemistry 279, 53078-53086. [13] Caslake, L. F., Gruber, T. M., and Bryant, D. A. (1997) Expression of two alternative sigma factors of Synechococcus sp. strain PCC 7002 is modulated by carbon and nitrogen stress, Microbiology 143, 3807-3818.

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

[14] Kaczmarzyk, D., Anfelt, J., Sarnegrim, A., and Hudson, E. P. (2014) Overexpression of sigma factor SigB improves temperature and butanol tolerance of Synechocystis sp. PCC6803, Journal of Bacteriology 182-183, 54-60. [15] Alper, H., Moxley, J., Nevoigt, E., Fink, G. R., and Stephanopoulos, G. (2006) Engineering Yeast Transcription Machinery for Improved Ethanol Tolerance and Production, Science 314, 1565-1568. [16] Wang, B., Wang, J., Zhang, W., and Meldrum, D. R. (2012) Application of synthetic biology in cyanobacteria and algae, Frontiers in Microbiology 3, 344. [17] Zhou, Y. X., Zhu, S. Y., Cai, C. Z., Yuan, P. F., Li, C. M., Huang, Y. Y., and Wei, W. S. (2014) High-throughput screening of a CRISPR/Cas9 library for functional genomics in human cells, Nature 509, 487-491. [18] Weickert, M. J., Pagratis, M., Curry, S. R., and Blackmore, R. (1997) Stabilization of apoglobin by low temperature increases yield of soluble recombinant hemoglobin in Escherichia coli, Applied and Environmental Microbiology 63, 4313-4320. [19] Allen, S. J., and Holbrook, J. J. (1995) Isolation, sequence and overexpression of the gene encoding NAD-dependent formate dehydrogenase from the methylotrophic yeast Candida methylica, Gene 162, 99-104. [20] Berla, B. M., Saha, R., Immethun, C. M., Maranas, C. D., Moon, T. S., and Pakrasi, H. B. (2013) Synthetic biology of cyanobacteria: unique challenges and opportunities, Frontiers in Microbiology 4. [21] Markley, A. L., Begemann, M. B., Clarke, R. E., Gordon, G. C., and Pfleger, B. F. (2014) Synthetic Biology Toolbox for Controlling Gene Expression in the Cyanobacterium Synechococcus sp. strain PCC 7002, ACS Synthetic Biology. [22] Xu, Y., Alvey, R. M., Byrne, P. O., Graham, J. E., Shen, G., and Bryant, D. A. (2011) Expression of genes in cyanobacteria: adaptation of endogenous plasmids as platforms for high-level gene expression in Synechococcus sp. PCC 7002, Methods in Molecular Biology 684, 273-293. [23] Huang, H. H., Camsund, D., Lindblad, P., and Heidorn, T. (2010) Design and characterization of molecular tools for a Synthetic Biology approach towards developing cyanobacterial biotechnology, Nucleic Acids Research 38, 2577-2593. [24] Heidorn, T., Camsund, D., Huang, H. H., Lindberg, P., Oliveira, P., Stensjo, K., and Lindblad, P. (2011) Synthetic Biology in Cyanobacteria: Engineering and Analyzing Novel Functions, Methods in Enzymology, Vol 497: Synthetic Biology, Methods for Part/Device Characterization and Chassis Engineering, Pt A 497, 539-579. [25] Huang, H. H., and Lindblad, P. (2013) Wide-dynamic-range promoters engineered for cyanobacteria, Journal of Biological Engineering 7. [26] Geerts, D., Bovy, A., de Vrieze, G., Borrias, M., and Weisbeek, P. (1995) Inducible expression of heterologous genes targeted to a chromosomal platform in the cyanobacterium Synechococcus sp. PCC 7942, Microbiology 141 ( Pt 4), 831-841. [27] Guerrero, F., Carbonell, V., Cossu, M., Correddu, D., and Jones, P. R. (2012) Ethylene synthesis and regulated expression of recombinant protein in Synechocystis sp. PCC 6803, PLoS One 7, e50470. [28] Vijayan, V., Jain, I. H., and O'Shea, E. K. (2011) A high resolution map of a cyanobacterial transcriptome, Genome Biology 12, R47. [29] Takeshima, Y., Sugiura, M., and Hagiwara, H. (1994) A novel expression vector for the cyanobacterium, Synechococcus PCC 6301, DNA Research 1, 181-189.

ACS Paragon Plus Environment

Page 16 of 25

Page 17 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

[30] Atsumi, S., Higashide, W., and Liao, J. C. (2009) Direct photosynthetic recycling of carbon dioxide to isobutyraldehyde, Nature Biotechnology 27, 1177-1180. [31] Na, D., and Lee, D. (2010) RBSDesigner: software for designing synthetic ribosome binding sites that yields a desired level of protein expression, Bioinformatics 26, 26332634. [32] Seo, S. W., Yang, J. S., Kim, I., Yang, J., Min, B. E., Kim, S., and Jung, G. Y. (2013) Predictive design of mRNA translation initiation region to control prokaryotic translation efficiency, Metabolic Engineering 15, 67-74. [33] Salis, H. M., Mirsky, E. A., and Voigt, C. A. (2009) Automated design of synthetic ribosome binding sites to control protein expression, Nature Biotechnology 27, 946U112. [34] Salis, H. M. (2011) The Ribosome Binding Site Calculator, Synthetic Biology, Pt B 498, 1942. [35] Oliver, J. W., Machado, I. M., Yoneda, H., and Atsumi, S. (2014) Combinatorial optimization of cyanobacterial 2,3-butanediol production, Metabolic Engineering 22, 7682. [36] Cardinale, S., and Arkin, A. P. (2012) Contextualizing context for synthetic biology – identifying causes of failure of synthetic biological systems, Biotechnology Journal 7, 856-866. [37] Lu, T. K., Khalil, A. S., and Collins, J. J. (2009) Next-generation synthetic gene networks, Nat Biotech 27, 1139-1150. [38] Mutalik, V. K., Guimaraes, J. C., Cambray, G., Lam, C., Christoffersen, M. J., Mai, Q.-A., Tran, A. B., Paull, M., Keasling, J. D., Arkin, A. P., and Endy, D. (2013) Precise and reliable gene expression via standard transcription and translation initiation elements, Nat Meth 10, 354-360. [39] Steglich, C., Futschik, M. E., Lindell, D., Voss, B., Chisholm, S. W., and Hess, W. R. (2008) The Challenge of Regulation in a Minimal Photoautotroph: Non-Coding RNAs in Prochlorococcus, Plos Genetics 4. [40] Flaherty, B. L., Van Nieuwerburgh, F., Head, S. R., and Golden, J. W. (2011) Directional RNA deep sequencing sheds new light on the transcriptional response of Anabaena sp strain PCC 7120 to combined-nitrogen deprivation, BMC Genomics 12. [41] Eisenhut, M., Georg, J., Klahn, S., Sakurai, I., Mustila, H., Zhang, P. P., Hess, W. R., and Aro, E. M. (2012) The Antisense RNA As1_flv4 in the Cyanobacterium Synechocystis sp PCC 6803 Prevents Premature Expression of the flv4-2 Operon upon Shift in Inorganic Carbon Supply, Journal of Biological Chemistry 287, 33153-33162. [42] Georg, J., Voss, B., Scholz, I., Mitschke, J., Wilde, A., and Hess, W. R. (2009) Evidence for a major role of antisense RNAs in cyanobacterial gene regulation, Molecular Systems Biology 5. [43] Sakurai, I., Stazic, D., Eisenhut, M., Vuorio, E., Steglich, C., Hess, W. R., and Aro, E. M. (2012) Positive regulation of psbA gene expression by cis-encoded antisense RNAs in Synechocystis sp. PCC 6803, Plant Physiology 160, 1000-1010. [44] Mitschke, J., Georg, J., Scholz, I., Sharma, C. M., Dienst, D., Bantscheff, J., Voss, B., Steglich, C., Wilde, A., Vogel, J., and Hess, W. R. (2011) An experimentally anchored map of transcriptional start sites in the model cyanobacterium Synechocystis sp PCC6803, Proceedings of the National Academy of Sciences of the United States of America 108, 2124-2129.

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

[45] Isaacs, F. J., Dwyer, D. J., Ding, C. M., Pervouchine, D. D., Cantor, C. R., and Collins, J. J. (2004) Engineered riboregulators enable post-transcriptional control of gene expression, Nature Biotechnology 22, 841-847. [46] Abe, K., Sakai, Y., Nakashima, S., Araki, M., Yoshida, W., Sode, K., and Ikebukuro, K. (2014) Design of riboregulators for control of cyanobacterial (Synechocystis) protein expression, Biotechnology Letters 36, 287-294. [47] Nakahira, Y., Ogawa, A., Asano, H., Oyama, T., and Tozawa, Y. (2013) TheophyllineDependent Riboswitch as a Novel Genetic Tool for Strict Regulation of Protein Expression in Cyanobacterium Synechococcus elongatus PCC 7942, Plant and Cell Physiology 54, 1724-1735. [48] Elhai, J. (1993) Strong and regulated promoters in the cyanobacterium Anabaena PCC 7120, FEMS Microbiology Letters 114, 179-184. [49] Mutsuda, M., Michel, K.-P., Zhang, X., Montgomery, B. L., and Golden, S. S. (2003) Biochemical Properties of CikA, an Unusual Phytochrome-like Histidine Protein Kinase That Resets the Circadian Clock in Synechococcus elongatus PCC 7942, Journal of Biological Chemistry 278, 19102-19110. [50] Barker, M., de Vries, R., Nield, J., Komenda, J., and Nixon, P. J. (2006) The deg proteases protect Synechocystis sp. PCC 6803 during heat and light stresses but are not essential for removal of damaged D1 protein during the photosystem two repair cycle, The Journal of Biological Chemistry 281, 30347-30355. [51] Landry, B. P., Stockel, J., and Pakrasi, H. B. (2013) Use of Degradation Tags To Control Protein Levels in the Cyanobacterium Synechocystis sp Strain PCC 6803, Applied and Environmental Microbiology 79, 2833-2835. [52] Skarstad, K., Steen, H. B., and Boye, E. (1983) Cell cycle parameters of slowly growing Escherichia coli B/r studied by flow cytometry, Journal of Bacteriology 154, 656-662. [53] Fossum, S., Crooke, E., and Skarstad, K. (2007) Organization of sister origins and replisomes during multifork DNA replication in Escherichia coli, The EMBO Journal 26, 4514-4522. [54] Bremer, H., and Dennis, P. (1996) Modulation of chemical composition and other parameters of the cell by growth rate. Escherichia coli and Salmonella, In Cellular and Molecular Biology (Neidhart, F., Ed.), pp 1553-1569, University of Michigan Medical School Ann Arbor, MI. [55] Pecoraro, V., Zerulla, K., Lange, C., and Soppa, J. (2011) Quantification of Ploidy in Proteobacteria Revealed the Existence of Monoploid, (Mero-)Oligoploid and Polyploid Species, PLoS One 6, e16392. [56] Griese, M., Lange, C., and Soppa, J. (2011) Ploidy in cyanobacteria, FEMS Microbiology Letters 323, 124-131. [57] Mori, T., Binder, B., and Johnson, C. H. (1996) Circadian gating of cell division in cyanobacteria growing with average doubling times of less than 24 hours, Proceedings of the National Academy of Sciences of the United States of America 93, 10183-10188. [58] Boyle, N. R., Reynolds, T. S., Evans, R., Lynch, M., and Gill, R. T. (2013) Recombineering to homogeneity: extension of multiplex recombineering to large-scale genome editing, Biotechnology Journal 8, 515-522. [59] Reyes, L. H., Almario, M. P., and Kao, K. C. (2011) Genomic library screens for genes involved in n-butanol tolerance in Escherichia coli, PLoS One 6, e17678.

ACS Paragon Plus Environment

Page 18 of 25

Page 19 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

[60] Warnecke, T. E., Lynch, M. D., Karimpour-Fard, A., Lipscomb, M. L., Handke, P., Mills, T., Ramey, C. J., Hoang, T., and Gill, R. T. (2010) Rapid dissection of a complex phenotype through genomic-scale mapping of fitness altering genes, Metabolic Engineering 12, 241-250. [61] Oliver, J. W., Machado, I. M., Yoneda, H., and Atsumi, S. (2013) Cyanobacterial conversion of carbon dioxide to 2,3-butanediol, Proceedings of the National Academy of Sciences of the United States of America 110, 1249-1254. [62] Ducat, D. C., Avelar-Rivas, J. A., Way, J. C., and Silver, P. A. (2012) Rerouting carbon flux to enhance photosynthetic productivity, Applied and Environmental Microbiology 78, 2660-2668. [63] Reddy, K. J., Masamoto, K., Sherman, D. M., and Sherman, L. A. (1989) DNA sequence and regulation of the gene (cbpA) encoding the 42-kilodalton cytoplasmic membrane carotenoprotein of the cyanobacterium Synechococcus sp. strain PCC 7942, Journal of Bacteriology 171, 3486-3493. [64] Suzuki, J. Y., and Bauer, C. E. (1995) A prokaryotic origin for light-dependent chlorophyll biosynthesis of plants, Proceedings of the National Academy of Sciences of the United States of America 92, 3749-3753. [65] Liu, D., and Golden, J. W. (2002) hetL overexpression stimulates heterocyst formation in Anabaena sp. strain PCC 7120, Journal of Bacteriology 184, 6873-6881. [66] Allen, D., and Tanner, K. (2002) Approaches in cell biology teaching, Cell Biol Educ 1, 3-5. [67] Baba, T., Ara, T., Hasegawa, M., Takai, Y., Okumura, Y., Baba, M., Datsenko, K. A., Tomita, M., Wanner, B. L., and Mori, H. (2006) Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection, Molecular Systems Biology 2, 2006 0008. [68] Joska, T. M., Mashruwala, A., Boyd, J. M., and Belden, W. J. (2014) A universal cloning method based on yeast homologous recombination that is simple, efficient, and versatile, Journal of Microbiological Methods 100, 46-51. [69] Viola, S., Ruhle, T., and Leister, D. (2014) A single vector-based strategy for marker-less gene replacement in Synechocystis sp. PCC 6803, Microbial Cell Factories 13, 4. [70] Tan, X., Liang, F., Cai, K., and Lu, X. (2013) Application of the FLP/FRT recombination system in cyanobacteria for construction of markerless mutants, Applied Microbiology and Biotechnology 97, 6373-6382. [71] Watabe, K., Mimuro, M., and Tsuchiya, T. (2014) Development of a high-frequency in vivo transposon mutagenesis system for Synechocystis sp. PCC 6803 and Synechococcus elongatus PCC 7942, Plant & Cell Physiology 55, 2017-2026. [72] McEwen, J. T., Machado, I. M., Connor, M. R., and Atsumi, S. (2013) Engineering Synechococcus elongatus PCC 7942 for continuous growth under diurnal conditions, Applied and Environmental Microbiology 79, 1668-1675. [73] Wach, A., Brachat, A., Pohlmann, R., and Philippsen, P. (1994) New heterologous modules for classical or PCR-based gene disruptions in Saccharomyces cerevisiae, Yeast 10, 17931808. [74] Baudin, A., Ozier-Kalogeropoulos, O., Denouel, A., Lacroute, F., and Cullin, C. (1993) A simple and efficient method for direct gene deletion in Saccharomyces cerevisiae, Nucleic Acids Research 21, 3329-3330. [75] Lynch, M. D., Warnecke, T., and Gill, R. T. (2007) SCALEs: multiscale analysis of library enrichment, Nature Methods 4, 87-93.

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

[76] Singh, A., Lynch, M. D., and Gill, R. T. (2009) Genes restoring redox balance in fermentation-deficient E. coli NZN111, Metabolic Engineering 11, 347-354. [77] Sandoval, N. R., Mills, T. Y., Zhang, M., and Gill, R. T. (2011) Elucidating acetate tolerance in E. coli using a genome-wide approach, Metabolic Engineering 13, 214-224. [78] Woodruff, L. B. A., May, B. L., Warner, J. R., and Gill, R. T. (2013) Towards a metabolic engineering strain "commons": An Escherichia coli platform strain for ethanol production, Biotechnology and Bioengineering 110, 1520-1526. [79] Warner, J. R., Reeder, P. J., Karimpour-Fard, A., Woodruff, L. B. A., and Gill, R. T. (2010) Rapid profiling of a microbial genome using mixtures of barcoded oligonucleotides, Nature Biotechnology 28, 856-U138. [80] Shoemaker, D. D., Lashkari, D. A., Morris, D., Mittmann, M., and Davis, R. W. (1996) Quantitative phenotypic analysis of yeast deletion mutants using a highly parallel molecular bar-coding strategy, Nature Genetics 14, 450-456. [81] Wang, H. H., Isaacs, F. J., Carr, P. A., Sun, Z. Z., Xu, G., Forest, C. R., and Church, G. M. (2009) Programming cells by multiplex genome engineering and accelerated evolution, Nature 460, 894-U133. [82] Wang, H. H., and Church, G. M. (2011) Multiplexed genome engineering and genotyping methods applications for synthetic biology and metabolic engineering, Methods in Enzymology 498, 409-426. [83] Wang, H. H., Kim, H., Cong, L., Jeong, J., Bang, D., and Church, G. M. (2012) Genomescale promoter engineering by coselection MAGE, Nature Methods 9, 591-+. [84] Isaacs, F. J., Carr, P. A., Wang, H. H., Lajoie, M. J., Sterling, B., Kraal, L., Tolonen, A. C., Gianoulis, T. A., Goodman, D. B., Reppas, N. B., Emig, C. J., Bang, D., Hwang, S. J., Jewett, M. C., Jacobson, J. M., and Church, G. M. (2011) Precise Manipulation of Chromosomes in Vivo Enables Genome-Wide Codon Replacement, Science 333, 348353. [85] DiCarlo, J. E., Conley, A. J., Penttila, M., Jantti, J., Wang, H. H., and Church, G. M. (2013) Yeast Oligo-Mediated Genome Engineering (YOGE), ACS Synthetic Biology 2, 741-749. [86] Barrangou, R., Fremaux, C., Deveau, H., Richards, M., Boyaval, P., Moineau, S., Romero, D. A., and Horvath, P. (2007) CRISPR provides acquired resistance against viruses in prokaryotes, Science 315, 1709-1712. [87] Makarova, K. S., Haft, D. H., Barrangou, R., Brouns, S. J. J., Charpentier, E., Horvath, P., Moineau, S., Mojica, F. J. M., Wolf, Y. I., Yakunin, A. F., van der Oost, J., and Koonin, E. V. (2011) Evolution and classification of the CRISPR-Cas systems, Nature Reviews Microbiology 9, 467-477. [88] Jiang, W. Y., Bikard, D., Cox, D., Zhang, F., and Marraffini, L. A. (2013) RNA-guided editing of bacterial genomes using CRISPR-Cas systems, Nature Biotechnology 31, 233239. [89] DiCarlo, J. E., Norville, J. E., Mali, P., Rios, X., Aach, J., and Church, G. M. (2013) Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems, Nucleic Acids Research 41, 4336-4343. [90] Feng, Z. Y., Zhang, B. T., Ding, W. N., Liu, X. D., Yang, D. L., Wei, P. L., Cao, F. Q., Zhu, S. H., Zhang, F., Mao, Y. F., and Zhu, J. K. (2013) Efficient genome editing in plants using a CRISPR/Cas system, Cell Research 23, 1229-1232. [91] Yang, L., Mali, P., Kim-Kiselak, C., and Church, G. (2014) CRISPR-Cas-mediated targeted genome editing in human cells, Methods in Molecular Biology 1114, 245-267.

ACS Paragon Plus Environment

Page 20 of 25

Page 21 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

[92] Richter, C., Chang, J. T., and Fineran, P. C. (2012) Function and Regulation of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) / CRISPR Associated (Cas) Systems, Viruses-Basel 4, 2291-2311. [93] Fu, Y. F., Sander, J. D., Reyon, D., Cascio, V. M., and Joung, J. K. (2014) Improving CRISPR-Cas nuclease specificity using truncated guide RNAs, Nature Biotechnology 32, 279-284. [94] Cho, S. W., Kim, S., Kim, Y., Kweon, J., Kim, H. S., Bae, S., and Kim, J. S. (2014) Analysis of off-target effects of CRISPR/Cas-derived RNA-guided endonucleases and nickases, Genome Research 24, 132-141. [95] Cai, F., Axen, S. D., and Kerfeld, C. A. (2013) Evidence for the widespread distribution of CRISPR-Cas system in the Phylum Cyanobacteria, RNA Biology 10, 687-693. [96] Scholz, I., Lange, S. J., Hein, S., Hess, W. R., and Backofen, R. (2013) CRISPR-Cas Systems in the Cyanobacterium Synechocystis sp PCC6803 Exhibit Distinct Processing Pathways Involving at Least Two Cas6 and a Cmr2 Protein, Plos One 8. [97] Hein, S., Scholz, I., Voss, B., and Hess, W. R. (2013) Adaptation and modification of three CRISPR loci in two closely related cyanobacteria, RNA Biology 10, 852-864. [98] Ryan, O. W., Skerker, J. M., Maurer, M. J., Li, X., Tsai, J. C., Poddar, S., Lee, M. E., DeLoache, W., Dueber, J. E., Arkin, A. P., and Cate, J. H. (2014) Selection of chromosomal DNA libraries using a multiplex CRISPR system, eLife 3. [99] Barrangou, R., and Horvath, P. (2012) CRISPR. New Horizons in Phage Resistance and Strain identification, Annual Review of Food Science and Technology, Vol 3 3, 143-162. [100] Barrangou, R., and Marraffini, L. A. (2014) CRISPR-Cas Systems: Prokaryotes Upgrade to Adaptive Immunity, Molecular Cell 54, 234-244. [101] Xia, H., Li, T., Deng, F., and Hu, Z. (2013) Freshwater cyanophages, Virologica Sinica 28, 253-259. [102] Buzby, J. S., Porter, R. D., and Stevens, S. E., Jr. (1985) Expression of the Escherichia coli lacZ gene on a plasmid vector in a cyanobacterium, Science 230, 805-807. [103] Boyanapalli, R., Bullerjahn, G. S., Pohl, C., Croot, P. L., Boyd, P. W., and McKay, R. M. (2007) Luminescent whole-cell cyanobacterial bioreporter for measuring Fe availability in diverse marine environments, Applied and Environmental Microbiology 73, 10191024. [104] Kimura, A., Hamada, T., Morita, E. H., and Hayashi, H. (2002) A high temperaturesensitive mutant of Synechococcus sp. PCC 7002 with modifications in the endogenous plasmid, pAQ1, Plant & Cell Physiology 43, 217-223. [105] Iwaki, T., Haranoh, K., Inoue, N., Kojima, K., Satoh, R., Nishino, T., Wada, S., Ihara, H., Tsuyama, S., Kobayashi, H., and Wadano, A. (2006) Expression of foreign type I ribulose-1,5-bisphosphate carboxylase/ oxygenase (EC 4.1.1.39) stimulates photosynthesis in cyanobacterium Synechococcus PCC7942 cells, Photosynthesis Research 88, 287-297. [106] Gendel, S., Straus, N., Pulleyblank, D., and Williams, J. (1983) Shuttle cloning vectors for the cyanobacterium Anacystis nidulans, Journal of Bacteriology 156, 148-154. [107] Golden, S. S., and Sherman, L. A. (1984) Optimal conditions for genetic transformation of the cyanobacterium Anacystis nidulans R2, Journal of Bacteriology 158, 36-42. [108] Michel, K. P., Pistorius, E. K., and Golden, S. S. (2001) Unusual regulatory elements for iron deficiency induction of the idiA gene of Synechococcus elongatus PCC 7942, Journal of Bacteriology 183, 5015-5024.

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

[109] Erbe, J. L., Adams, A. C., Taylor, K. B., and Hall, L. M. (1996) Cyanobacteria carrying an smt-lux transcriptional fusion as biosensors for the detection of heavy metal cations, Journal of Industrial Microbiology 17, 80-83. [110] Niederholtmeyer, H., Wolfstadter, B. T., Savage, D. F., Silver, P. A., and Way, J. C. (2010) Engineering cyanobacteria to synthesize and export hydrophilic products, Applied and Environmental Microbiology 76, 3462-3466. [111] Binder, B. J., and Chisholm, S. W. (1990) Relationship between DNA cycle and growth rate in Synechococcus sp. strain PCC 6301, Journal of Bacteriology 172, 2313-2319. [112] Chen, A. H., Afonso, B., Silver, P. A., and Savage, D. F. (2012) Spatial and temporal organization of chromosome duplication and segregation in the cyanobacterium Synechococcus elongatus PCC 7942, PLoS One 7, e47837. [113] Holtman, C. K., Chen, Y., Sandoval, P., Gonzales, A., Nalty, M. S., Thomas, T. L., Youderian, P., and Golden, S. S. (2005) High-throughput functional analysis of the Synechococcus elongatus PCC 7942 genome, DNA Research 12, 103-115. [114] Mermet-Bouvier, P., and Chauvat, F. (1994) A conditional expression vector for the cyanobacteria Synechocystis sp. strains PCC6803 and PCC6714 or Synechococcus sp. strains PCC7942 and PCC6301, Current Microbiology 28, 145-148. [115] Ng, W. O., Zentella, R., Wang, Y., Taylor, J. S., and Pakrasi, H. B. (2000) PhrA, the major photoreactivating factor in the cyanobacterium Synechocystis sp. strain PCC 6803 codes for a cyclobutane-pyrimidine-dimer-specific DNA photolyase, Archives of Microbiology 173, 412-417. [116] Kunert, A., Vinnemeier, J., Erdmann, N., and Hagemann, M. (2003) Repression by Fur is not the main mechanism controlling the iron-inducible isiAB operon in the cyanobacterium Synechocystis sp. PCC 6803, FEMS Microbiology Letters 227, 255-262. [117] Ogawa, T., Marco, E., and Orus, M. I. (1994) A gene (ccmA) required for carboxysome formation in the cyanobacterium Synechocystis sp. strain PCC6803, Journal of Bacteriology 176, 2374-2378. [118] Voshol, G. P., Meyer, V., and Hondel, C. A. M. J. v. d. (2015) GTP_Binding protein Era: a novel gene target for biofuel production, BMC Biochemistry 15. [119] Singh, A. K., and Sherman, L. A. (2000) Identification of iron-responsive, differential gene expression in the cyanobacterium Synechocystis sp. strain PCC 6803 with a customized amplification library, Journal of Bacteriology 182, 3536-3543. [120] Yin, C., Li, W., Du, Y., Kong, R., and Xu, X. (2007) Identification of a gene, ccr-1 (sll1242), required for chill-light tolerance and growth at 15° C in Synechocystis sp. PCC 6803, Microbiology 153, 1261-1267. [121] Zhu, H., Ren, X., Wang, J., Song, Z., Shi, M., Qiao, J., Tian, X., Liu, J., Chen, L., and Zhang, W. (2013) Integrated OMICS guided engineering of biofuel butanol-tolerance in photosynthetic Synechocystis sp. PCC 6803, Biotechnology for Biofuels 6, 106. [122] Ma, A. T., Schmidt, C. M., and Golden, J. W. (2014) Regulation of Gene Expression in Diverse Cyanobacterial Species by Using Theophylline-Responsive Riboswitches, Applied and Environmental Microbiology 80, 6704-6713. [123] Taton, A., Lis, E., Adin, D. M., Dong, G., Cookson, S., Kay, S. A., Golden, S. S., and Golden, J. W. (2012) Gene transfer in Leptolyngbya sp. strain BL0902, a cyanobacterium suitable for production of biomass and bioproducts, PLoS One 7, e30901. [124] Clancy, K., Laptewicz, M., Tang, P., Liang, X., Kay, S., Zhang, W., Taton, A., Bishop, J., Lis, E., Taliana, A., Golden, S., Peterson, T., Golden, J., and Costa, G. (2012) De novo

ACS Paragon Plus Environment

Page 22 of 25

Page 23 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Assembly of a Novel Filamentous Blue-green Algal Genome for Leptolyngbya sp. Strain BL0902 Enabled by a Novel, Extra-long Read Sequencing Protocol, In Plant and Animal Genome XX, San Diego, CA. [125] Wolk, C. P., Vonshak, A., Kehoe, P., and Elhai, J. (1984) Construction of shuttle vectors capable of conjugative transfer from Escherichia coli to nitrogen-fixing filamentous cyanobacteria, Proceedings of the National Academy of Sciences of the United States of America 81, 1561-1565. [126] Desplancq, D., Kieffer, B., Schmidt, K., Posten, C., Forster, A., Oudet, P., Strub, J. M., Van Dorsselaer, A., and Weiss, E. (2001) Cost-effective and uniform 13C- and 15Nlabeling of the 24-kDa N-terminal domain of the Escherichia coli gyrase B by overexpression in the photoautotrophic cyanobacterium Anabaena sp. PCC 7120, Protein Expression and Purification 23, 207-217. [127] Higa, K. C., and Callahan, S. M. (2010) Ectopic expression of hetP can partially bypass the need for hetR in heterocyst differentiation by Anabaena sp. strain PCC 7120, Molecular Microbiology 77, 562-574. [128] Chaurasia, A. K., and Apte, S. K. (2011) Improved eco-friendly recombinant Anabaena sp. strain PCC7120 with enhanced nitrogen biofertilizer potential, Applied and Environmental Microbiology 77, 395-399. [129] Desplancq, D., Bernard, C., Sibler, A. P., Kieffer, B., Miguet, L., Potier, N., Van Dorsselaer, A., and Weiss, E. (2005) Combining inducible protein overexpression with NMR-grade triple isotope labeling in the cyanobacterium Anabaena sp. PCC 7120, Biotechniques 39, 405-411. [130] Wolk, C. P., Cai, Y., Cardemil, L., Flores, E., Hohn, B., Murry, M., Schmetterer, G., Schrautemeier, B., and Wilson, R. (1988) Isolation and complementation of mutants of Anabaena sp. strain PCC 7120 unable to grow aerobically on dinitrogen, Journal of Bacteriology 170, 1239-1244. [131] Zhang, W., Du, Y., Khudyakov, I., Fan, Q., Gao, H., Ning, D., Wolk, C. P., and Xu, X. (2007) A gene cluster that regulates both heterocyst differentiation and pattern formation in Anabaena sp. strain PCC 7120, Molecular Microbiology 66, 1429-1443. [132] Kaneko, T., Nakamura, Y., Wolk, C. P., Kuritz, T., Sasamoto, S., Watanabe, A., Iriguchi, M., Ishikawa, A., Kawashima, K., Kimura, T., Kishida, Y., Kohara, M., Matsumoto, M., Matsuno, A., Muraki, A., Nakazaki, N., Shimpo, S., Sugimoto, M., Takazawa, M., Yamada, M., Yasuda, M., and Tabata, S. (2001) Complete genomic sequence of the filamentous nitrogen-fixing cyanobacterium Anabaena sp. strain PCC 7120, DNA Research 8, 205-213; 227-253.

ACS Paragon Plus Environment

ACS Synthetic Biology

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

This figure is for the abstract 482x482mm (96 x 96 DPI)

ACS Paragon Plus Environment

Page 24 of 25

Page 25 of 25

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

ACS Synthetic Biology

Figure 1. Development and use of genome engineering tools in cyanobacteria. In order to design and rewrite genomes in cyanobacteria, genome-wide search tools (such as SCALEs, MAGE, TRMR and CRSIPR/Cas) need to be developed to aide in gene discovery. Once genes associated with particular phenotypes are known, their gene expression can be re-designed using standardized parts. The design can then be tested in vivo and the strain can be improved using iterative cycles. It is important to note here that molecular tools depicted in each step of the cycle are not necessarily restricted to the step depicted but can be used in any step of the design-build-test cycle. 1422x1422mm (96 x 96 DPI)

ACS Paragon Plus Environment