Subscriber access provided by MT SINAI SCH OF MED
Article
Multiplexed sgRNA Expression Allows Versatile Single Nonrepetitive DNA Labeling and Endogenous Gene Regulation Shipeng Shao, Lei Chang, Yuao Sun, Yingping Hou, Xiaoying Fan, and Yujie Sun ACS Synth. Biol., Just Accepted Manuscript • DOI: 10.1021/acssynbio.7b00268 • Publication Date (Web): 29 Aug 2017 Downloaded from http://pubs.acs.org on August 30, 2017
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
ACS Synthetic Biology is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
1
Multiplexed sgRNA Expression Allows Versatile Single Non-
2
repetitive DNA Labeling and Endogenous Gene Regulation
3
Shipeng Shao1, Lei Chang1, Yuao Sun1, Yingping Hou 1, Xiaoying Fan1, Yujie Sun1,*
4
1
5
(BIOPIC), School of Life Sciences, Peking University, Beijing 100871, China.
State Key Laboratory of Membrane Biology, Biodynamic Optical Imaging Center
6
*E-mail:
[email protected] 7
*Phone#: 86-10-62744060
8
1
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
9
ABSTRACT: :
10
The CRISPR/Cas9 system has made significant contribution to genome
11
editing, gene regulation and chromatin studies in recent years. High-throughput
12
and systematic investigations into the multiplexed biological systems require
13
simultaneous expression and coordinated functioning of multiple sgRNAs.
14
However, current co-transfection based sgRNA co-expression systems remain
15
inefficient and virus-based transfection approaches are relatively costly and labor
16
intensive. Here we established a vector-independent method allowing multiple
17
sgRNA expression cassettes to be assembled in series into a single plasmid. This
18
synthetic biology-based strategy excels in its efficiency, controllability and
19
scalability. Taking the flexibility advantage of this all-in-one sgRNA expressing
20
system, we further explored its applications in single non-repetitive genomic locus
21
imaging as well as coordinated gene regulation in live cells. With its full potency,
22
our method will facilitate the research in understanding genome structure,
23
function and dynamics.
24 25
Keywords: CRISPR/Cas9, Single-guide RNA, Chromosomal dynamics, Single
26
non-repetitive gene, Golden Gate cloning, Transcription regulation
27
2
ACS Paragon Plus Environment
Page 2 of 32
Page 3 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
28
ACS Synthetic Biology
INTRODUCTION
29
Since its discovery, the clustered regularly inter-spaced short palindromic
30
repeats (CRISPR)-associated Cas9 nuclease has largely revolutionized the fields
31
in genome editing, gene regulation and chromatin studies in live cells
32
Compared to protein-based recognition systems such as ZFN and TALE,
33
CRISPR recognizes target DNA via Watson-Crick base pairing, allowing it to
34
perform high-throughput chromatin targeting in an effective and efficient way 2.
35
Emerging new versions of the CRISPR/Cas9 system have been adopted to
36
expand our capacity for precise genome manipulation
37
carry tremendous potentials in both biomedical researches and therapeutic
38
genomic targeting 8.
39
1
.
3-7
. These powerful toolkits
Simultaneous expression and coordinated functioning of multiple sgRNAs in 9
40
live cells are often necessary in circumstances such as simultaneous knockout
41
and regulation of multiple endogenous genes 10. Notably, CRISPR-based labeling
42
and imaging of genomic loci with non-repetitive sequences in live cells also
43
require to tile the loci with multiple sgRNAs 11.
44
There are several approaches available for simultaneous delivery of multiple
45
sgRNAs to scale up the targeting capacity of the CRISPR/Cas9 system. For
46
instance, cells can be transfected with a mixture of multiple sgRNA plasmids by
47
chemi-transfection or electro-transfection
48
transfection of different plasmids is relatively inefficient and also results in
49
variable expression levels of each sgRNA in single cells. Similarly, delivery of
50
multiple sgRNAs using lentivirus infection
12, 13
. However, such transient co-
11
can also lead to variable efficacy in
3
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 4 of 32
51
delivery and expression. Alternatively, although microinjection can deliver multiple
52
sgRNAs into a single cell without the limitation of plasmid type and number
53
suffers from low throughput and requires special equipment and superb skills.
54
With the evolution of synthetic biology, new cloning methods have emerged in the
55
last two years to achieve high speed, efficiency and flexibility in assembly of
56
multiple DNA fragments
57
used for fast assembly of TALE repeats
58
sgRNAs
59
reaction is quite limited following current protocols as the assembly efficiency
60
decreases dramatically as the number of cassettes increases. Even though a
61
recently developed hierarchical assembly method could assemble multiple
62
sgRNAs, as it relies on receptor plasmids for every step, the whole procedure
63
requires several rounds of transformation, verification, plasmid extraction, and
64
thus on average takes more than 20 days to assembly 20 sgRNA expression
65
cassettes into a single plasmid
66
sgRNA expression system needs to balance in ease of implementation, delivery
67
efficiency, precise stoichiometry control, and cost of time.
14
, it
15-17
. For instance, the Golden Gate cloning method was 18
and for assembly of short serial
9, 19
. However, the number of sgRNA cassettes assembled in one
20
. Collectively, a well-rounded multiplexing
68
In this work, we have developed a hierarchical cloning strategy to assemble
69
multiple sgRNA expression cassettes into a single vector. We demonstrated that
70
this new approach is not only robust and time-saving in assembly of different
71
numbers of sgRNA expression cassettes at will but also able to assure precise
72
control of stoichiometry of different sgRNAs. Using this all-in-one sgRNA
73
expressing system, we successfully labeled and imaged non-repetitive genomic
4
ACS Paragon Plus Environment
Page 5 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
74
loci, MUC4 and HER2 genes, in living human cells. Additionally, we demonstrated
75
that the all-in-one sgRNA expressing system is highly efficient in gene activation
76
and repression. Using multiple sgRNAs to tile the promoter of endogenous gene
77
SOX2 to recruit multiple transcriptional regulators, we found that the regulation
78
efficiency by this synergistic gene regulation approach was superior to that by
79
single sgRNA and co-transfection of multiple sgRNAs. We also showed that the
80
all-in-one sgRNA expressing system allows parallel expression of different
81
sgRNAs with relatively stable stoichiometry in their expression level, leading to
82
reliable simultaneous activation and/or repression of multiple endogenous genes
83
in the same cell. In summary, our modular sgRNA assembly system can be
84
readily automated and will be highly useful for applications in cell biology and
85
genetics, such as gene editing, gene regulation, pathway engineering and live cell
86
chromatin labeling.
87
5
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
88
RESULTS
89
Plasmid-independent hierarchical assembly of multiple sgRNA expression
90
cassettes
91
To achieve high-throughput assembly of multiple sgRNA cassettes, here we
92
have developed a plasmid-independent, PCR-based hierarchical assembly
93
strategy. Using a series of three one-pot Golden Gate cloning with type II
94
restriction endonuclease, we demonstrated the robustness and high efficiency of
95
our method in assembly of repeated units (Figure 1A). Firstly, PCR-amplified
96
conventional or functionally modified sgRNA scaffolds and U6 promoter were
97
ligated with respectively synthesized targeting oligos by Golden Gate cloning.
98
Then the reaction mix was amplified by PCR with U6-forward and sgRNA-
99
scaffold-reverse primers without purification (Figure 1B). Thirdly, we designed
100
junctions allowing ordered ligation to form the intermediate fragments. We used
101
these junctions to assemble five sgRNA cassettes into one by a second round of
102
Golden Gate cloning (Figure 1C). Lastly, a third Golden Gate reaction was
103
performed to assemble the five-cassette-intermediates into the final construct
104
carrying all 20 sgRNAs. The whole assembly procedure takes about 4-5 days.
105
Random selection of end clones were verified (Figure 1D). Aside from high-
106
efficiency, high-speed and high-throughput in generating a multi-sgRNA cluster,
107
our approach also provides the flexibility for assembling different numbers of
108
sgRNAs using varied combinations of the forward and reverse primers. To see
109
whether the multiplexing sgRNA expression system will influence the expression
110
level of each sgRNA, we performed RT-PCR to quantify the expression level of
6
ACS Paragon Plus Environment
Page 6 of 32
Page 7 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
111
individual sgRNA (Supplementary Figure 1). The results indicated that all sgRNAs
112
were expressed at a similar expression level.
113
Establishment of a stable cell line expressing the SunTag system
114
We fused the SunTag signal amplification system
21
to dCas9 for the
115
enhancement of fluorescence signal that one sgRNA can recruit. A previous
116
study has shown that telomeres labeled with dCas9-SunTag were nearly 20-fold
117
brighter compared to those labeled by dCas9 without alteration of the telomere
118
mobility
119
SunTag has not been realized. We reasoned that with dCas9-SunTag,
120
simultaneous expression of multiple sgRNAs in a single cell can realize non-
121
repetitive chromosomal locus labeling with high quality and efficiency (Figure 2A).
122
Transient expression of two components of the SunTag system usually
123
results in either incomplete labeling or excessive background from unbound
124
molecules. In order to minimize those effects, we integrated the SunTag
125
expression cassette into the genome using the PiggyBac transposon system
126
and generated a stable cell line with minimal expression level of dCas9-
127
(GCN4)X24-P2A-BFP and scFV-sfGFP-GB1 (Figure 2B and Supplementary
128
Figures 2 & 3). With the low expression level, diffusing dCas9 single molecules
129
were able to be observed in the GFP channel (Supplementary Figure 4A) and
130
repetitive chromosomal loci, such as telomeres or the 2rd exon of MUC4 gene,
131
were successfully labeled with dCas9-SunTag in live MDA-MB-231 cells
132
(Supplementary Figure 4B & C). In order to compare the signal-to-noise ratio
133
(SNR) between dCas9-GFP and the dCas9-SunTag system, the tandem array of
21
. However, labeling of non-repetitive chromosomal loci using dCas9-
7
ACS Paragon Plus Environment
22
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
134
5S ribosomal DNA (5S rDNA) on chromosome 1, which consists of 17 repeats,
135
was labeled and quantified (Supplementary Figure 5). The results showed that
136
the SNR was improved nearly 6 folds with dCas9-SunTag. The significant
137
improvement in SNR allows imaging with low laser power and short exposure
138
time, enabling long-term tracking of chromosomal loci.
139
Single non-repetitive sequence labeling by multiplexed sgRNA expression
140
We next tried the multiple sgRNA expressing strategy for non-repetitive
141
sequence labeling in living cells. We first chose MUC4 gene as a demonstration
142
as it has tandem repeat regions advantageous for the validation by FISH.
143
Specifically, 20 sgRNAs were deigned against a stretch of DNA (about 1.5 kb)
144
and cloned into one plasmid by the hierarchical assembly method. The SunTag
145
cell line was transfected with the all-in-one plasmid and imaged 24 h after
146
transfection.
147
To verify the specificity and efficiency of non-repetitive MUC4 gene labeling,
148
we performed two-color imaging with CRISPR probes targeting the non-repetitive
149
sequence in the first intron of MUC4 and FISH probes targeting the polymorphic
150
tandem repeats in the third exon. Because the harsh conditions of FISH sample
151
preparation disrupted the fluorescence signal of GFP, we performed immuno-
152
fluorescence against GFP. Colocalization of the two channels indicated that the
153
CRISPR probes indeed labeled the MUC4 locus (Figure 2C). In order to quantify
154
the robustness of our method, we also measured the cell labeling efficiency and
155
loci number per cell (Supplementary Figure 6).
8
ACS Paragon Plus Environment
Page 8 of 32
Page 9 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
156
By labeling specific non-repetitive genomic loci with CRISPR, we could
157
investigate chromosome reorganization at different cell cycle stages (Figure 2D &
158
Supplementary Figure 7). Similar results were also obtained for another non-
159
repetitive gene HER2 (Supplementary Figure 8). These results indicate that
160
combination of the all-in-one sgRNA expression plasmid and SunTag signal
161
amplification system can be used to label arbitrary sequences in the genome,
162
visualize their positions in the nucleus, and study the relationship between the
163
gene position and its expression 23.
164
We further tracked the dynamics of the MUC4 locus labeled with dCas9-
165
SugTag (Supplementary Movie 1). Two typical trajectories in the same cell were
166
observed (Figure 3A). No obvious correlation was found between the two allelic
167
loci, indicating the heterogeneity of the surroundings of allelic genes. We
168
calculated the average mean square displacement of 17 different loci and
169
extracted the anomalous diffusion scaling exponents (α = 0.34) (Figure 3B),
170
indicating that the movement of chromosomal loci is subdiffusive. We also ruled
171
out the possibility that the observed movement of chromatin locus was caused by
172
localization error (Supplementary Figure 9) or stage drift (Supplementary Figure
173
10).
174
To obtain a more robust measurement of the subdiffusive behavior of the δ v(τ).
175
MUC4 locus, we calculated the average velocity autocorrelation function C
176
This function describes what degree the average velocity over a time interval δ is
177
correlated with the average velocity over another time interval δ that is separated
178
by τ from the first one
24
. The correlation function C
9
ACS Paragon Plus Environment
δ v(τ)
was calculated for
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 10 of 32
δ v(τ)
179
different values of δ and plotted as a function of the time lag τ. For all δ, C
180
dropped into negative values before finally decaying to zero, indicating the
181
negative correlation of MUC4 loci (Figure 3C, left). We then analyzed the
182
behavior of C
183
shown (Figure 3C, right). This indicates that the motion of MUC4 segments
184
possessed the same negative correlation property at different temporal scales.
185
Taken together, the results indicate that the MUC4 locus underwent a ‘back and
186
forward’ movement, i.e. motion in one direction is likely to be followed by motion
187
in the opposite direction, which may arise from restoring forces of its surrounding
188
chromatin fiber.
189
Synergistic regulation of endogenous genes by multiplexed sgRNA
190
expression
δ v(τ)
as a function of the ratio (τ/δ), and a single master curve was
191
Besides the single non-repetitive DNA sequence imaging, the multiplexed
192
sgRNA expression strategy can also be used for synergistic regulation of
193
endogenous genes as multiple sgRNAs can recruit multiple copies of proteins to
194
function synergistically. We designed sgRNAs targeting the human SOX2 gene in
195
HEK293T cells (Figure 4A). We found that the efficiency of synergistic activation
196
of endogenous genes was much higher than that of single sgRNA-mediated
197
activation or cotransfection of multiple sgRNAs. Furthermore, similar results were
198
obtained for Cas9-KRAB mediated SOX2 repression experiments (Figure 4B).
199
Coordinated activation and repression of endogenous genes by multiplexed
200
sgRNA
10
ACS Paragon Plus Environment
Page 11 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
201
To examine the capability of coordinated gene regulation of our multiplexed
202
sgRNA expression method, we designed 20 sgRNAs each targeting a randomly
203
chosen gene. HEK293T cells were transfected with the all-in-one sgRNA
204
expressing plasmid with dCas9-SunTag-VP64 or dCas9-KRAB for activation or
205
repression, respectively. The results indicate that all 20 randomly chosen genes
206
were up-regulated or down-regulated simultaneously (Supplementary Figure 11 &
207
Supplementary Figure 12).
208
At last, we tested the all-in-one plasmid for simultaneous activation and
209
repression of different genes in the same cell. Simultaneous up-regulation of a
210
subset of endogenous genes while down-regulation of another subset of
211
endogenous genes in a defined manner can open up novel opportunities for
212
systems biology as it raises the possibility of manipulating transcriptional
213
networks. We used functional modified sgRNA for this application as different
214
sgRNAs with different aptamers insertion can recruit different transcriptional
215
regulators (Figure 5A). The sgRNAs targeting the first group of 10 genes were
216
inserted with PP7 at their tetraloop and loop2, while sgRNAs targeting the second
217
group of 10 genes were inserted with MS2 at the same position 6. With co-
218
transfection of tdPCP-VPR
219
expressing plasmid, regulation of different genes was as expected and the
220
regulation result was orthogonal for the corresponding genes (Figure 5B). These
221
results indicate that the multiplexed sgRNA expression strategy can be expanded
222
with modified sgRNA for multi-function applications.
25
, tdMCP-KRAB, dCas9, and the all-in-one sgRNA
223
11
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
224
Page 12 of 32
DISCUSSION
225
CRISPR based technologies have revolutionized the fields of genome editing
226
and gene regulation. Simultaneous expression and coordinated functioning of
227
multiple sgRNAs will add a new dimension to the evolving CRISPR system,
228
further facilitating our understanding of the physiological and pathological
229
conditions.
230
Here we have reported a hierarchical assembly strategy for rapid assembly
231
of multiple sgRNA expression cassettes into a single vector. Compared to the
232
current methods, this system ensures high speed, efficiency, capacity and
233
flexibility. Based on Golden Gate cloning, our hierarchical assembly strategy only
234
takes 4-5 days to assemble 20 sgRNA expression cassettes into one plasmid with
235
high fidelity.
236
Moreover, the hierarchical assembly strategy also offers the flexibility in
237
tuning the number of sgRNAs assembled simply by choosing different
238
combination of amplification primers. Meanwhile, the expression level of different
239
sgRNAs in the all-in-one vector could also be differentiated by using different
240
promoters
241
sgRNA expression system can also be used in expressing sgRNAs for other
242
orthogonal CRISPR systems, including SA Cas9
243
single all-in-one vector can be used for multiplexing operations on the same
244
genomic region, such as imaging, transactivation, repression and chromatin
245
epigenetic modification 29.
19
. Additionally, besides the SP CRISPR system, the multiplexed
26
, Cpf1
12
ACS Paragon Plus Environment
27
and C2c2
28
. Lastly, a
Page 13 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
246
In this work, we have demonstrated assembly of 20 sgRNAs in one single
247
vector. Regarding the capacity of the all-in-one sgRNA expression vector, as the
248
upper limit of the sgRNA number that can be assembled is determined by the
249
carrying capacity of the plasmid, there are several possible ways to enhance the
250
system. Firstly, generating multiple sgRNAs from mRNA
251
sgRNAs
252
needed for the transcription of multiple sgRNAs. Secondly, since the longest
253
fragment in our all-in-one construct is U6 promoter, using other expressing
254
systems will further increase the capacity of our cloning methods. With the
255
improvement of the DNA assembly methods, it is possible that a library of
256
sgRNAs targeting a long DNA sequence can be assembled into one vector, which
257
will greatly aid on the study of structure and dynamics of whole chromosomes or
258
large chromatin domains in living cells. So far, we have only demonstrated the
259
functionality of our all-in-one multiple sgRNA expression vectors in cultured cells.
260
Due to the excessive length and repetition of the multiple sgRNA expression
261
vector, it would be quite challenging to pack it in virus and apply it in in vivo
262
studies.
263
METHODS
264
Hierarchical assembly of multiple sgRNA expression cassettes
31
30
or tRNA-flanked
can make the construct much shorter because only one promoter is
265
All primers were ordered from Ruibo Biotech (Beijing). The U6 promoter,
266
conventional and modified sgRNA expression cassettes were amplified from
267
plasmids
268
psgRNA2.02-MS2-ccdB (Addgene #: 99911, Supplementary Map 2), and
psgRNA-ccdB
(Addgene
#:
99915,
Supplementary
13
ACS Paragon Plus Environment
Map
1),
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
269
psgRNA2.02-PP7-ccdB (Addgene #: 99912, Supplementary Map 3). The oligos
270
(F + R, 100 µM each) for each targeting sgRNA were phosphorylated by T4 PNK
271
(NEB) and annealed in a thermocycler by using the following parameters: 37 °C
272
for 30 min; 95 °C for 5 min; ramp down to 25 °C at 5 °C min-1. The
273
phosphorylated and annealed oligos were then diluted to 1:100 by adding 1 µl
274
oligos to 99 µl ddH2O.
275
For the first assembly step, U6 promoter (10 ng), annealed oligos (1 µM) and
276
sgRNA scaffold (5 ng) were mixed with BsmBI (1 µl, NEB) and T4 ligase (0.5 µl,
277
NEB) together for Golden Gate reaction in 20 separate tubes by using the
278
following parameters: 37 °C for 10 min; 16 °C for 10 min; 10 cycle; 55 °C for 10
279
min; 85 °C for 10 min. The reaction mix was then diluted by 200 folds in ddH2O. 1
280
µl diluted mix was used as template for PCR reaction. The primer sequence and
281
allocation have been described in Supplementary Tables 1 and 2. After the PCR
282
reaction, gel electrophoresis was used to check the reaction. After gel extraction,
283
a QIAquick gel extraction kit was used to ensure that the product contains only
284
the intended U6+oligos+sgRNA ligation product.
285
For the second assembly step, the 20 ligation fragments (10 ng each) were
286
mixed in 4 separate tubes (1-5 in tube #1, 6-10 in tube #2, 11-15 in tube #3, 16-
287
20 in tube #4) with BsaI (1 µl, NEB) and T4 ligase (0.5 µl, NEB) together for
288
Golden Gate reaction by using the following parameters: 37 °C for 10 min; 16 °C
289
for 10 min; 10 cycle; 37 °C for 10 min; 85 °C for 10 min. The reaction mix was
290
then diluted by 200 folds in ddH2O. 1 µl diluted mix was used as template for the
291
PCR reaction. The primer sequences and allocation were described in
14
ACS Paragon Plus Environment
Page 14 of 32
Page 15 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
292
Supplementary Tables 1 and 2. After the PCR reaction, gel electrophoresis was
293
used to check the reaction. After gel extraction, a QIAquick gel extraction kit was
294
used to ensure that the product contains only the intended 5 ligation length
295
product. This PCR step needs to be optimized to ensure the maximum
296
productivity due to the repetitive sequence of the 5 sgRNA expression cassettes.
297
If the intermediates (5 sgRNA expression plasmids) were needed, the Golden
298
Gate reaction should be performed with the corresponding receptor plasmids.
299
For the third assembly step, the 4 ligated fragments (100 ng each) and 50 ng
300
receptor plasmid pMulti-sgRNA-LacZα (Addegene #: 99913, Supplementary Map
301
4) or pMulti-sgRNA-LacZα-DsRed (Addegene #: 99914, Supplementary Map 5)
302
were mixed with BpiI (1 µl, Thermo Fisher Scientific) and T4 ligase (0.5 µl, NEB)
303
together for Golden Gate reaction by using the following parameters: 37 °C for 10
304
min; 16 °C for 10 min; 10 cycle; 37 °C for 10 min; 85 °C for 10 min. The
305
competent E. coli strain Stbl3 (TransGen Biotech) was transformed with the final
306
reaction mix and cultured in 30 °C to minimize the opportunity of losing the repeat
307
units. The plasmids were linearized and checked by gel electrophoresis to ensure
308
that the final plasmid contains the proper length of repeats. The more detailed
309
information about the protocol to assemble 20 sgRNAs was included in
310
Supplementary Information as Supplementary Protocol 1.
311
Construction of other plasmids used in this work
312
The NLSSV40-dCas9-(NLSSV40)X3-(GCN4-V4)X24-NLSSV40-P2A-BFP fragment
313
was amplified by PCR from plasmid pHRdSV40-NLS-dCas9-(GCN4-V4)X24-NLS-
314
P2A-BFP-dWPRE (Addgene #: 60910) and then ligated into PiggyBac plasmid
15
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
(home-made)
Page 16 of 32
315
pB-TRE3G-BsmBI-EF1α-PuroR-P2A-rtTA
by
Golden
Gate
316
assembly with BsmBI and T4 ligase (NEB). The scFV-sfGFP-GB1-NLSSV40
317
fragment was amplified from plasmid pHR-scFV-GCN4-sfGFP-GB1-NLS-dWPRE
318
(Addgene #: 60906) and then ligated into PiggyBac plasmid pB-TRE3G-BsmBI-
319
EF1α-HygroR-P2A-rtTA (home-made) by Golden Gate assembly with BsmBI and
320
T4 ligase (NEB). The tdMCP fragment was amplified from phage-ubc-nls-ha-
321
tdMCP-gfp (Addgene #: 40649), and the KRAB was amplified from pHAGE-EF1α-
322
dCas9-KRAB (Addgene #: 40649) and then ligated into plasmid pB-CMV (home-
323
made). The tdPCP fragment was amplified from phage-ubc-nls-ha-tdPCP-gfp (A
324
Addgene #: 40650), and the VPR was amplified from SP-dCas9-VPR (Addgene
325
#: 63798) and then ligated into plasmid pB-CMV (home-made). The telomere,
326
rDNA10, MUC4-E2 targeting sgRNAs and other sgRNA expression plasmids
327
were made according to procedures described previously 6. All primers and
328
sequences used in this work were listed in Supplementary Tables 1-9.
329
Cell Culture, Transfection, Stable Cell Line Construction
330
Human breast cancer cell line MDA-MB-231 and HEK293T were maintained
331
in Dulbecco’s modified Eagle medium (DMEM) with high glucose (Lifetech), 10%
332
FBS (Lifetech), 1X penicillin/streptomycin (Lifetech). All cells were maintained at
333
37 °C and 5% CO2 in a humidified incubator. All plasmids were transiently
334
transfected with Lipofectamine 2000 (Lifetech) in accordance with the
335
manufacturer’s protocol. To construct a stable cell line, MDA-MB-231 cells were
336
spread onto a 6-well plate one day before transfection. On the next day, the cells
337
were transfected with 500 ng pB-dCas9-(GCN4)X24, 500 ng pB-scFV-sfGFP, and
16
ACS Paragon Plus Environment
Page 17 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
338
200 ng pCAG-hyPBase using Lipo 2000. Hygromycin (200 µg/ml) and puromycin
339
(5 µg/ml) were used for the selection of positive clone for about 2 weeks. Cells
340
with proper expression level of dCas9-(GCN4)X24 and scFV-sfGFP were selected
341
using fluorescence-activated cell sorting analysis (FACS). Cells were trypsinized
342
and centrifuged at 300xg for 2 min at RT. The supernatant was removed, and the
343
cells were resuspended in 1× PBS. Fortessa flow analyzers (BD Biosciences)
344
were used for FACS with the following settings. BFP was measured using a 405
345
nm laser and a 450/50 filter and sfGFP was measured using a 488 nm laser and
346
a 530/30 filter. The FACS results were analyzed by FlowJo.
347
To choose a “minimal expression level” of dCas9-SunTag stable clone, a
348
window (as indicated by the yellow window in Supplementary Figure 2A) which is
349
far lower than the averaged transient expression level (several hundred times
350
lower) and slightly higher than the untransfected cells was applied while gating
351
the cells. After the expansion of each clone, the expression level should be
352
compared with the untransfected cells (Supplementary Figure 2B). A locus with
353
low repetitive sequence or non-repetitive sequence was labeled to quantified the
354
labeling results. Based on the labeling performance, optimization can be achieved
355
by choosing a fluorescence intensity gradient. The clone with the best
356
performance was chosen as the stable cell line in the following experiments.
357
FISH and Immunofluorescence
358
The FISH probe targeting the third exon of MUC4 was directly synthesized
359
by Invitrogen with Cy5 labeled at the 5’ end. The sequence of the FISH probe
360
was as following: 5’-Cy5-CTTCCTGTCACCGAC-3’. The transfected cells were 17
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
361
washed with PBS briefly once and incubated with extraction buffer (0.1 M PIPS, 1
362
mM EGTA, 1 mM MgCl2, 0.2% TritonX-100) for 1 min to extract cytoplasm
363
diffusive proteins. Cells were then washed once with PBS and fixed with 4% PFA
364
for 15 minutes. Cells were permeabilized with 0.1% saponin/0.1% Triton X-100 in
365
PBS for 10 min at room temperature (RT) and then washed twice in PBS for 5
366
min at RT. Cells were incubated for 1 h in 20% glycerol/ PBS at RT and then
367
frozen/ thawed 3 times in liquid nitrogen. Cells were washed twice in PBS for 5
368
min at RT, incubated in 0.1 M HCl for 30 min at RT and then washed in PBS for 5
369
min at RT. Cells were permeabilized with 0.5% saponin/0.5% Triton X-100 in PBS
370
for 30 min at RT and washed twice in PBS for 5 min at RT. Cells were
371
equilibrated in 50% formamide/2x SSC for 20 min at RT. The probe was added in
372
hybridization buffer and the dish was warmed to 78 °C for precisely 2 min. Cells
373
were incubated overnight at 37 °C in a dark humidified chamber. The next day,
374
cells were washed in the following buffer sequentially, 50% formamide/2X SSC
375
for 15 min at 45 °C, 0.2X SSC for 15 min at 63 °C, 2X SSC for 5 min at 45 °C.
376
and 2X SSC for 5 min at RT.
377
For immunofluorescence, cells were incubated in 0.5% Triton followed by 30
378
min 5% BSA blocking at RT. The cells were then stained with human anti-GFP
379
antibodies (proteintech #66002-1-Ig) in blocking buffer for 60 min, washed with
380
PBS, and then stained with Cy3-labeled secondary antibodies in blocking buffer
381
for 1 h. The labeled cells were washed with PBS, post-fixed with 4% PFA for 10
382
min at RT. Lastly, the cells were stained with DAPI (5 µg/ml in PBS) for 2 min in
383
at RT and then wash 3 times with PBS.
18
ACS Paragon Plus Environment
Page 18 of 32
Page 19 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
384 385
ACS Synthetic Biology
qRT–PCR About 48 h post transfection in six-well plates with proper amount of plasmids,
386
total RNA were extracted by TRIzol (Invitrogen). Complementary DNAs (cDNAs)
387
were synthesized using RevertAid First Strand cDNA synthesis Kit (Thermo
388
Fisher Scientific) and qPCR was performed using SYBR Green PCR Master Mix
389
(Roche). GADPH was used as the internal reference. All primers used in the
390
qPCR step were designed by primer-BLSAT on NCBI website. We used the
391
“semi-quantitative PCR” to analysis the data. Briefly, in each group (experimental
392
and control), the GADPH and the gene to be examined (Sample) were amplified
393
using corresponding primers in two different tubes with the same genomic DNA
394
input (10 ng). Each group has three biological replicates. We performed a two-
395
step qRT- PCR to quantify the expression level of the genes of interest. The
396
program is comprised of two reactions: one reaction for the conversion of RNA to
397
cDNA and a separate second reaction for qPCR amplification of the gene of
398
interest. The set of thermal profiles for two-step qRT-PCR is given as following.
399
Profile 1, directs cDNA synthesis by reverse transcriptase using a 30-minute,
400
42°C synthesis protocol. Profile 2, Segment 1 directs a 95°C incubation, used for
401
qPCR enzyme activation, and Segment 2 directs 40 cycles of qPCR amplification.
402
To quantify the expression level of the genes of interest, the Ct values of
403
both the sample and GAPDH were calculated by setting a cutoff of the relative
404
fluorescence intensity. The threshold value has to be set at a point where all
405
samples being analyzed display the same rate of increase in the fluorescence
406
intensity, and ideally this increase responds to an exponential function. We used
19
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
407
the amplification-based threshold algorithm (Default Method in the MxPro QPCR
408
Software) to determine the Ct value automatically. Then the change of RNA
409
expression level is calculated as follows:
410
CtSample-CtGAPDH=∆Ct
411
Relative sample RNA expression level = 2-∆Ct
412
RNA expression level change = 2-∆Ct(experimental)/2-∆Ct(control)
413
Image acquisition and analysis
414
Briefly, all images were taken on a custom Olympus IX83 inverted
415
microscope equipped with a 100X UPlanSApo, N.A. = 1.49, oil-immersion phase
416
objective and Andor iXon Ultra897 EMCCD. The details about the configurations
417
of the custom microscope was listed in Supplementary Table 10. The microscope
418
stage incubation chamber was maintained at 37 °C and 5% CO2. A 405-nm laser
419
was used to excite BFP and DAPI; a 488-nm laser was used to excite sfGFP; a
420
561-nm laser was used to excite Cy3; a 647-nm laser was used to excite Cy5.
421
The laser power was modulated by an acousto-optic tunable filter (AOTF). Cells
422
were washed three times by PBS and imaged in phenol red-free DMEM (Thermo
423
Fisher, 21063029) for live cell imaging. Movies of short time scale chromatin loci
424
dynamics in living MDA-MB-231 cells were acquired at 50 ms exposure time (20
425
Hz) with only one layer. We used highly inclined thin illumination (HILO) to avoid
426
stray-light reflection and to reduce background from cell auto-fluorescence in the
427
live cell tracking experiment. For the experiment to count the number of loci in
428
each cell and the percentage of positive cell, cells were fixed by 4% PFA and
429
stained by DAPI for 2 min. Then images were acquired in PBS by 30 layers with a
20
ACS Paragon Plus Environment
Page 20 of 32
Page 21 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
430
step size of 0.5 µm, each layer with a 500 ms exposure time in wide-field
431
illumination mode.
432
Analysis of MSD and velocity autocorrelation was carried out using custom
433
MATLAB scripts. The MSD analysis method has been described previously 6. To
434
quantify the influence of localization error on the accuracy of MSD analysis, we
435
compared the localization error with displacement of successive frame. The
436
localization error is calculated as followed: A , = B + × 2πσ σ
437
s = (σx + σy)/2
438
Where B is the background level, A is the height of the Gaussian peak, x0 is
439
the x center of the Gaussian peak, y0 is the y center of the Gaussian peak, σx is
440
the x standard deviation, σy is the y standard deviation, and s is the localization
441
error.
442 443
We then calculated all displacements between of two successive frames in the trajectories as followed: = ∆0 2 + ∆0 2 #$
444
where and ∆% and ∆% were the difference between two successive frames.
445
Average velocity autocorrelation function was calculated as described
446
previously by Joseph S. Lucas et al. 24. The function can be expressed as follows: & ' ( ) = < +, + ) ∙ +, > +, =
/, + 0 − /, 0
21
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
447
Where /, is the two-dimensional coordinate at time point ,, δ is the time interval
448
over which the average velocity is calculated and ) is the lag time over which the
449
correlation in velocities is calculated.
22
ACS Paragon Plus Environment
Page 22 of 32
Page 23 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
450
AUTHOR INFORMATION
451
Corresponding Author
452
*Phone#: 86-10-62744060
453
Author Contributions
454
S.S. and Y.S. conceived and designed the experiments. S.S. and YA.S.
455
performed the cloning experiments. S.S., L.C. and Y.H. performed the FISH
456
experiments. S.S. and X.F. performed the qPCR experiments. S.S. analyzed all
457
the data. S.S. and Y.S. wrote the manuscript.
458
Notes
459
The authors declare no competing financial interests.
460
ACKNOWLEDGMENTS
E-Mail:
[email protected] 461
We thank Dr. Ronald D. Vale (University of California, San Francisco) for
462
providing SunTag plasmids, Dr. Wensheng Wei (School of life sciences, Peking
463
University) for providing sgRNA expression plasmids, Dr. Feng Zhang (Broad
464
Institute) for providing plasmid px330 (Addgene Plasmid #42230), Dr. Scot Wolfe
465
(University of Massachusetts Medical School) for pHAGE-EF1α-dCas9-KRAB
466
(Addgene Plasmid #50919), Dr. George Church (Harvard University) for SP-
467
dCas9-VPR (Addgene Plasmid #63798) and Dr. Wei Guo (Department of Biology,
468
University of Pennsylvania) for providing the MDA-MB-231 cell line. We also
469
thank Ms Xuefang Zhang from the National Center for Protein Sciences Beijing
470
(Peking University) for her excellent assistance with FACS. This work is
471
supported by grants from the 863 Program SS2015AA020406 and National
23
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
472
Science Foundation of China 21573013, 21390412, 31271423 and 31327901, for
473
Y.S.
24
ACS Paragon Plus Environment
Page 24 of 32
Page 25 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
474
SUPPORTING INFORMATION
475
Supplementary protocol 1, Supplementary Figures 1-12, Supplementary Tables
476
1-10, and Supplementary Plamid Maps 1-5.
477
REFERENCES
478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521
[1] Barrangou, R., and Doudna, J. A. (2016) Applications of CRISPR technologies in research and beyond, Nat. Biotechnol., 933-941. [2] Gaj, T., Gersbach, C. A., and Barbas, C. F., 3rd. (2013) ZFN, TALEN, and CRISPR/Cas-based methods for genome engineering, Trends Biotechnol. 31, 397-405. [3] Dominguez, A. A., Lim, W. A., and Qi, L. S. (2016) Beyond editing: repurposing CRISPR-Cas9 for precision genome regulation and interrogation, Nat. Rev. Mol. Cell Biol. 17, 5-15. [4] Fu, Y., Rocha, P. P., Luo, V. M., Raviram, R., Deng, Y., Mazzoni, E. O., and Skok, J. A. (2016) CRISPR-dCas9 and sgRNA scaffolds enable dual-colour live imaging of satellite sequences and repeat-enriched individual loci, Nat Commun 7, 11707. [5] Chen, B., Hu, J., Almeida, R., Liu, H., Balakrishnan, S., Covill-Cooke, C., Lim, W. A., and Huang, B. (2016) Expanding the CRISPR imaging toolset with Staphylococcus aureus Cas9 for simultaneous imaging of multiple genomic loci, Nucleic Acids Res. 44, e75. [6] Shao, S., Zhang, W., Hu, H., Xue, B., Qin, J., Sun, C., Sun, Y., Wei, W., and Sun, Y. (2016) Long-term dualcolor tracking of genomic loci by modified sgRNAs of the CRISPR/Cas9 system, Nucleic Acids Res. 44, e86. [7] Ma, H., Tu, L. C., Naseri, A., Huisman, M., Zhang, S., Grunwald, D., and Pederson, T. (2016) Multiplexed labeling of genomic loci with dCas9 and engineered sgRNAs using CRISPRainbow, Nat. Biotechnol. 34, 528-530. [8] Xiao-Jie, L., Hui-Ying, X., Zun-Ping, K., Jin-Lian, C., and Li-Juan, J. (2015) CRISPR-Cas9: a new and promising player in gene therapy, J. Med. Genet. 52, 289-296. [9] Sakuma, T., Nishikawa, A., Kume, S., Chayama, K., and Yamamoto, T. (2014) Multiplex genome engineering in human cells using all-in-one CRISPR/Cas9 vector system, Scientific reports 4, 5400. [10] Jusiak, B., Cleto, S., Perez-Pinera, P., and Lu, T. K. (2016) Engineering Synthetic Gene Circuits in Living Cells with CRISPR Technology, Trends Biotechnol. 34, 535-547. [11] Chen, B., Gilbert, L. A., Cimini, B. A., Schnitzbauer, J., Zhang, W., Li, G. W., Park, J., Blackburn, E. H., Weissman, J. S., Qi, L. S., and Huang, B. (2013) Dynamic Imaging of Genomic Loci in Living Human Cells by an Optimized CRISPR/Cas System, Cell 155, 1479-1491. [12] Cheng, A. W., Wang, H., Yang, H., Shi, L., Katz, Y., Theunissen, T. W., Rangarajan, S., Shivalila, C. S., Dadon, D. B., and Jaenisch, R. (2013) Multiplexed activation of endogenous genes by CRISPR-on, an RNA-guided transcriptional activator system, Cell Res. 23, 1163-1171. [13] Zalatan, J. G., Lee, M. E., Almeida, R., Gilbert, L. A., Whitehead, E. H., La Russa, M., Tsai, J. C., Weissman, J. S., Dueber, J. E., Qi, L. S., and Lim, W. A. (2015) Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds, Cell 160, 339-350. [14] Niu, Y., Shen, B., Cui, Y., Chen, Y., Wang, J., Wang, L., Kang, Y., Zhao, X., Si, W., Li, W., Xiang, A. P., Zhou, J., Guo, X., Bi, Y., Si, C., Hu, B., Dong, G., Wang, H., Zhou, Z., Li, T., Tan, T., Pu, X., Wang, F., Ji, S., Zhou, Q., Huang, X., Ji, W., and Sha, J. (2014) Generation of gene-modified cynomolgus monkey via Cas9/RNA-mediated gene targeting in one-cell embryos, Cell 156, 836-843. [15] Weninger, A., Killinger, M., and Vogl, T. (2016) Key Methods for Synthetic Biology: Genome Engineering and DNA Assembly, In Synthetic Biology, pp 101-141, Springer. [16] Engler, C., Gruetzner, R., Kandzia, R., and Marillonnet, S. (2009) Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes, PLoS One 4, e5553. [17] Peccoud, J., Weber, E., Engler, C., Gruetzner, R., Werner, S., and Marillonnet, S. (2011) A Modular Cloning System for Standardized Assembly of Multigene Constructs, PLoS One 6, e16765. [18] Zhang, F., Cong, L., Lodato, S., Kosuri, S., Church, G. M., and Arlotta, P. (2011) Efficient construction of 25
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559
sequence-specific TAL effectors for modulating mammalian transcription, Nat. Biotechnol. 29, 149-153. [19] Kabadi, A. M., Ousterout, D. G., Hilton, I. B., and Gersbach, C. A. (2014) Multiplex CRISPR/Cas9-based genome engineering from a single lentiviral vector, Nucleic Acids Res. 42, e147. [20] Vad-Nielsen, J., Lin, L., Bolund, L., Nielsen, A. L., and Luo, Y. (2016) Golden Gate Assembly of CRISPR gRNA expression array for simultaneously targeting multiple genes, Cell. Mol. Life Sci. 73, 43154325. [21] Tanenbaum, M. E., Gilbert, L. A., Qi, L. S., Weissman, J. S., and Vale, R. D. (2014) A protein-tagging system for signal amplification in gene expression and fluorescence imaging, Cell 159, 635-646. [22] Ding, S., Wu, X., Li, G., Han, M., Zhuang, Y., and Xu, T. (2005) Efficient transposition of the piggyBac (PB) transposon in mammalian cells and mice, Cell 122, 473-483. [23] Kumaran, R. I., Thakar, R., and Spector, D. L. (2008) Chromatin dynamics and gene positioning, Cell 132, 929-934. [24] Lucas, J. S., Zhang, Y., Dudko, O. K., and Murre, C. (2014) 3D trajectories adopted by coding and regulatory DNA elements: first-passage times for genomic interactions, Cell 158, 339-352. [25] Chavez, A., Scheiman, J., Vora, S., Pruitt, B. W., Tuttle, M., E, P. R. I., Lin, S., Kiani, S., Guzman, C. D., Wiegand, D. J., Ter-Ovanesyan, D., Braff, J. L., Davidsohn, N., Housden, B. E., Perrimon, N., Weiss, R., Aach, J., Collins, J. J., and Church, G. M. (2015) Highly efficient Cas9-mediated transcriptional programming, Nat. Methods 12, 326-328. [26] Kleinstiver, B. P., Prew, M. S., Tsai, S. Q., Topkar, V. V., Nguyen, N. T., Zheng, Z., Gonzales, A. P., Li, Z., Peterson, R. T., Yeh, J. R., Aryee, M. J., and Joung, J. K. (2015) Engineered CRISPR-Cas9 nucleases with altered PAM specificities, Nature 523, 481-485. [27] Zetsche, B., Gootenberg, J. S., Abudayyeh, O. O., Slaymaker, I. M., Makarova, K. S., Essletzbichler, P., Volz, S. E., Joung, J., van der Oost, J., Regev, A., Koonin, E. V., and Zhang, F. (2015) Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system, Cell 163, 759-771. [28] Abudayyeh, O. O., Gootenberg, J. S., Konermann, S., Joung, J., Slaymaker, I. M., Cox, D. B., Shmakov, S., Makarova, K. S., Semenova, E., Minakhin, L., Severinov, K., Regev, A., Lander, E. S., Koonin, E. V., and Zhang, F. (2016) C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector, Science 353, aaf5573. [29] Morita, S., Noguchi, H., Horii, T., Nakabayashi, K., Kimura, M., Okamura, K., Sakai, A., Nakashima, H., Hata, K., Nakashima, K., and Hatada, I. (2016) Targeted DNA demethylation in vivo using dCas9peptide repeat and scFv-TET1 catalytic domain fusions, Nat. Biotechnol. 34, 1060-1065. [30] Angelici, B., Mailand, E., Haefliger, B., and Benenson, Y. (2016) Synthetic Biology Platform for Sensing and Integrating Endogenous Transcriptional Inputs in Mammalian Cells, Cell reports 16, 25252537. [31] Port, F., and Bullock, S. L. (2016) Augmenting CRISPR applications in Drosophila with tRNA-flanked sgRNAs, Nat. Methods 13, 852-854.
26
ACS Paragon Plus Environment
Page 26 of 32
Page 27 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 1. Hierarchical assembly of multiplexed sgRNA expression plasmid. (A). Workflow for the hierarchical assembly of 20 sgRNAs. U6 promoter and sgRNA scaffold are PCR amplified and then ligated with each annealed oligo in 20 separated Golden Gate reactions with a type IIs restriction endonuclease BsmBI (Step 1). Each Golden Gate reaction mix was then used as template for amplification without purification by 20 pair of primers, which can be grouped into 4 categories, each member in a group with different adaptors (Step 2). The purified 20 sgRNAs expression cassettes were then separated into 4 group and implemented with the second round Golden Gate reaction using BsaI (Step 3). The four fragments each carried 5 sgRNAs expression cassettes were purified and then ligated together into a third round Golden Gate reaction with a receptor plasmid using BpiI (Step 4). Due to the repetitive sequence of each sgRNA expression cassette, each step should be verified by agar gel electrophoresis to guarantee that each reaction was carried out successfully. (B). A typical gel of the first round PCR result for the successful ligation of U6, annealed oligos, sgRNA scaffold. Trans2K plus II DNA ladder (TransGen Biotech) was shown in the left. The 20 individual sgRNA
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
expression cassettes all show the expected length on the gel, indicating that this cloning method is quite robust. A detail to note is that the first and fifth sgRNA in each group are slightly longer than the remaining one. This is because two handles are added to the 5’ of the first sgRNA and 3’ of the fifth sgRNA to facilitate the PCR of the five sgRNAs in the following step. (C). A typical gel of the second round PCR result for the successful ligation of five sgRNA expression cassettes into one fragment. Trans2K plus II DNA ladder was shown in the left. Due to the repetitive nature of the five sgRNAs fragment, laddering effect was observed. The indicated repeat number was shown in the right. (D). A typical gel of the verification of the final all-in-one plasmid. Trans15K DNA ladder (TransGen Biotech) was shown in the left. All the plasmids were linearized by endonuclease enzyme BsaI. The empty vector was also shown as a control. All the tested six plasmids have the right insertion size, suggesting the high efficiency of the assembly process.
174x237mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 28 of 32
Page 29 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 2. Single non-repetitive sequence labeling by multiplexed sgRNAs expression. (A). Schematic diagram of a non-repetitive sequence labeling of the genome. With the cell line that expressed proper level of SunTag system and 20 different sgRNAs tiling a short range sequence, arbitrary target loci can be labeled. The all-in-one plasmid carries 20 sgRNAs expression cassette each with a U6 promoter and poly U terminator so that individual sgRNA can be expressed independently. Six sgRNAs are shown as a demo here. The background fluorescent signal arises from the unbound dCas9 with multiple sfGFP molecules and the free sfGFP single molecules. Minimal level and proper stoichiometry (1:24) of SunTag system is the key point for the successful labeling. In principle, the 20 sgRNAs can direct 20 dCas9sunTag molecule, that is 480 sfGFP molecules, to the target loci to gather the fluorescent signal. (B). Workflow for the establishment of live imaging system with proper expression level. Piggybac plasmids dCas9-(GCN4)X24-P2A-BFP and scFV-sfGFP were used to construct the stable cell line. The MDA-MB-231 cells were co-transfected with SunTag expression plasmids in PiggyBac backbone and transposase expression plasmid, followed by puromycin and hygromycin selection for about 2 weeks. sfGFP and BFP
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
double positive single clones with minimal background expression and high labeling efficiency were selected and transient transfected with multiple sgRNA expression plasmid to test the performance of each clone in subsequent live imaging experiment. (C). The labeling result of non-repetitive sequence in human cells. By transfection of 20 sgRNAs from an allin-one plasmid targeting the first intron of human MUC4 gene, bright fluorescent puncta (indicated by arrow) can be visualized. A Cy5-labeled probe was used to perform DNA fluorescence in situ hybridization against the third exon of MUC4 to verify the labeling result of CRISPR. The nucleus was stained with DAPI. The merged image showed the colocalization of CRISPR signal and FISH signal. The region targeted by CRISPR is about 1.5 kb which has 20 binding sites, while the region targeted by FISH is about 9 kb which has 67 binding sites. Because the FISH probes target region is a repetitive sequence, so we only need one probe to get high signal-to-noise ratio image. All scale bars are 5 µm. (D). Single non-repetitive sequence can be labeled in different cell cycle stages. In G1 of interphase, two loci can be visualized. In G2 of interphase, four loci appeared as closely located pairs. During mitosis, the duplicated loci were allocated into two cells. In anaphase and telophase, the position of MUC4 loci nearly mirrored each other. All scale bars are 10 µm.
164x230mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 30 of 32
Page 31 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 3 Live cell tracking of single non-repetitive genomic locus. (A). Typical image and trajectories of the MUC4 locus in live cell. Trajectories were displayed by the colorcoded time points with green for the starting, blue for the intermediate and red for the ending frames. The time lookup table was in the right. Scale bar: 5 µm. (B). Time-averaged MSD plotted as a function of time lag (delay) for the MUC4 region in live cells. MSD was shown for delay up to 10 % of total imaging time. 17 trajectories were analyzed individually (left) or averaged among all the trajectories (right). Dashed lines indicate a subdiffusive scaling exponent (α) of 0.34, demonstrating the locus was undergoing constraint diffusion. (C). Velocity autocorrelation analysis of MUC4 locus in live cells. Velocity was calculated over discretization intervals (δ) ranging from 0.05 s to 5 s in 0.05 s intervals (left). Velocity autocorrelation curves for different values of δ, plotted against a rescaled timelag (t/δ) (right). The color scheme represents the values of δ from small (blue) to large (red).
194x224mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 ACS Paragon Plus Environment
Page 32 of 32
Page 33 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
Figure 4 Synergy activation of the endogenous genes by multiplexed sgRNAs targeting one region. (A). Synergy gene activation by 20 sgRNAs targeting the promoter region of SOX2 gene. HEK293T cells were transfected with either individual plasmid (single or all-in-one) or cocktail of five plasmids. dCas9SunTag-VP64 are used for the maximum activation efficiency. qRT-PCR were used for the quantification of the activation results. (B). Synergy gene repression by 20 sgRNAs targeting the promoter region of SOX2 gene in HEK293T cells. The same plasmids are used as in (A) except dCas9-KRAB are used instead. Cells without transfection were used as control. All values are mean ± s.e.m. with n = 3 biological replicates.
176x183mm (300 x 300 DPI)
ACS Paragon Plus Environment
ACS Synthetic Biology
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 5 Multiplexed modified sgRNA expression activate and repress multiple endogenous genes simultaneously. (A). 10 sgRNA-MS2 (2.02) and 10 sgRNA-PP7 (2.02) were assembled into one plasmid. dCas9, tdPCP-VPR, tdMCP-KRAB and all-in-one sgRNA expression plasmid were cotransfected to HEK293T cells. 10 genes were activated and 10 other genes were repressed simultaneously in one cell. (B). HEK293T Cells were analyzed by qRT-PCR 2 days after transfection. The genes that are targeted with sgRNA-PP7 (2.02) were upregulated while the gene that are targeted with 10 sgRNA-MS2 (2.02) were downregulated without crosstalk. All values are mean ± s.e.m. with n = 3 biological replicates.
184x152mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 34 of 32
Page 35 of 32
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
ACS Synthetic Biology
80x39mm (300 x 300 DPI)
ACS Paragon Plus Environment