Subscriber access provided by University of Florida | Smathers Libraries
Article
Small RNA profiles from virus-infected fresh market vegetables Alessandra Frizzi, Yuanji Zhang, John Kao, Charles Hagen, and Shihshieh Huang J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/jf503756v • Publication Date (Web): 11 Nov 2014 Downloaded from http://pubs.acs.org on November 16, 2014
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
Journal of Agricultural and Food Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 31
1 2
Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
Alessandra Frizzi1,#, Yuanji Zhang2,#, John Kao3, Charles Hagen1 and Shihshieh Huang1,* 1
Calgene Campus, Monsanto Company, 1920 Fifth St, Davis, CA 95616, USA Chesterfield Campus, Monsanto Company, 700 Chesterfield Parkway West, Chesterfield, MO 63017, USA 3 Monsanto Vegetable Seeds - Woodland, 37437 State Hwy 16, Woodland, CA 95695, USA 2
#
These authors contributed equally to this work *Correspondence (fax +1 530 792-2005; email
[email protected])
Keywords: Vegetable, small RNA, siRNA, miRNA, RNAi, RNA silencing, genetic
19
engineering, Tomato spotted wilt virus (TSWV), Potato virus Y (PVY), Watermelon
20
mosaic virus (WMV), Iris yellow spot virus (IYSV)
1
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
21
Abstract
22
Functional small RNAs, such as short interfering RNAs (siRNAs) and microRNAs
23
(miRNAs), exist in freshly consumed fruits and vegetables. These siRNAs can be
24
derived from either endogenous sequences or from viruses that infect them.
25
Symptomatic tomatoes, watermelons, zucchini and onions were purchased from
26
grocery stores and investigated by small RNA sequencing. By aligning the obtained
27
small RNA sequences to sequences of known viruses, four different viruses were
28
identified as infecting these fruits and vegetables. Many of these virally-derived small
29
RNAs along with endogenous small RNAs were found to be highly complementary to
30
human genes. However, the established history of safe consumption of these
31
vegetables suggests that this sequence homology has little biological relevance. By
32
extension, these results provide evidence for the safe use by humans and animals of
33
genetically engineered crops using RNA-based suppression technologies, especially
34
vegetable crops with virus resistance conferred by expression of siRNAs or miRNAs
35
derived from viral sequences.
36 37
INTRODUCTION
38
RNA interference (RNAi) or RNA silencing is a broadly used gene regulation
39
mechanism in eukaryotes. It has been implicated in many aspects of biological
40
processes such as regulation of growth and development, defense against pathogens
41
and response to stress (1-3). Active research over the last decade has revealed the
42
molecular basis of RNAi to a great extent (4-7). Most of the known RNAi components,
43
some conserved across the kingdoms and some unique among few species, can be 2
ACS Paragon Plus Environment
Page 2 of 31
Page 3 of 31
Journal of Agricultural and Food Chemistry
44
divided into two groups. They are either involved in generation of the small RNAs, or
45
part of various RNAi complexes directly or indirectly associated with the small RNAs. At
46
the core of the RNAi mechanism lies the small RNA whose primary sequence is used to
47
guide the RNAi machinery to its specific gene target. The DNA or mRNA of the targeted
48
genes, which bear a complementary sequence to the small RNA, is then modified by
49
the RNAi machinery. These modifications, including DNA methylation, mRNA
50
degradation or translational inhibition, eventually lead to the down-regulation of target
51
genes and result in biological changes. Furthermore, these studies also found that
52
exogenous small RNAs, whether they are either synthesized chemically or generated
53
through transgenic expression can trigger RNAi with a similar biological effect as long
54
as the required sequence signature is present.
55
The characteristic of eukaryotic RNAi whereby the sequence signature of an
56
exogenous small RNA can trigger a gene specific down-regulation in living organisms
57
has immense implications in medicine and plant biotechnology. RNAi-based drugs
58
treating diseases such as macular degeneration (AMD), cancer, asthma and glaucoma,
59
or combating viruses including Respiratory syncytial virus (RSV), Hepatitis C virus
60
(HCV) and Human immunodeficiency virus (HIV), are under clinical development (8, 9).
61
In the case of human therapeutics, RNAi-based drugs have met with tremendous
62
challenges presented by the need to achieve delivery in the face of a myriad of
63
biological and physicochemical barriers (10, 11). These barriers have limited the
64
efficacy, and thus the commercial success of this promising technology in therapeutics
65
to date (9, 11). However, RNA-based suppression technology has been successfully
66
applied in agriculture for more than a decade. The first transgenic food crop, Flavr Savr
3
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
67
tomato, commercialized in 1994, employed antisense technology (now known as a part
68
of the RNA silencing mechanism) to maintain fruit firmness for easy handling (12, 13).
69
Subsequently, virus resistant squash and papaya were introduced to the market in the
70
late 1990’s. Although it was once thought to be mediated by transgenically expressed
71
viral coat protein, the virus resistance in these plants is more likely to result from
72
transgenic siRNAs derived from the viral coat protein gene. A list of RNA silencing
73
based biotech crops that are under development for commercial use was previously
74
summarized by Frizzi and Huang (14).
75
Although theoretically the 20+ nt size of small RNAs allow them to specifically
76
identify their intended mRNA targets, mismatches and gaps in base-pairing between
77
miRNAs or siRNAs and their target mRNAs are commonly found. This suggests that not
78
all the bases of a small RNA are necessary for target recognition. In plants, mRNA
79
slicing is the principal mode of post-transcriptional RNA silencing regulation; a guiding
80
small RNA requires strong base-pairings with its target at nucleotides 2-13 but
81
sequence identity is less critical at nucleotides 15-20 (15, 16). Because the RNAi
82
mechanism effectively parses sequence mismatches and ambiguities, it has been
83
shown through in vitro studies conducted with transfected small RNAs that small RNAs
84
can cause gene suppression of unintended gene targets (17). This outcome, commonly
85
referred to as the “off-target” effect has led to increased scrutiny regarding the
86
specificity of RNAi-based drugs and also been proposed as a putative concern for food
87
crops that are engineered to produce small RNAs. However, this “off-target” effect has
88
been characterized using in vitro systems and is orders of magnitude less potent than
4
ACS Paragon Plus Environment
Page 4 of 31
Page 5 of 31
Journal of Agricultural and Food Chemistry
89
“on-target” suppression (9, 11). Such suppression also requires special sequence
90
contexts, thermodynamic criteria, target accessibility and other conditions (18, 19).
91
To investigate the level of significance of regularly-consumed small RNAs that
92
share sequence homology with human genes, the small RNA profiles of fresh market
93
fruits and vegetables were examined. These were locally purchased tomatoes,
94
watermelons, zucchini and onions that were selected because their appearance
95
suggested they were likely to be infected with viral disease(s). Using a next-generation
96
high throughput sequencing method, between 1.6 and 3.6 million small RNA sequences
97
were obtained from each sample. Although most of the small RNAs found in these
98
vegetables were likely derived from endogenous genome, up to 9.5% were identified as
99
sequences originating from viruses. Many of these small RNAs were found to share
100
sequence similarity with human genes. Small RNA profiles such as these resemble
101
those of plants genetically modified to express fragments of virus sequences for the
102
purposes of conferring virus resistance (14). The analysis of the small RNAs presented
103
here provides insights into the plant-virus interaction and also evidence of a history of
104
safe consumption for dietary small RNAs. By extension, this research supports the
105
safety of biotechnology derived crops employing RNA-based gene suppression when
106
consumed by humans or animals.
107 108
MATERIALS AND METHODS
109
Plant materials. Vegetables suspected of virus diseases based on physical
110
characteristics were selectively purchased at local farmers’ markets or supermarkets in
111
the region of Yolo County, California. The tomato fruits displayed lesions with a chlorotic
5
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
112
concentric ring pattern typical of those caused by Tomato spotted wilt virus, (TSWV,
113
family Bunyaviridae, genus tospovirus). The onion bulbs with attached green leaves
114
exhibited elongated lesions typical of those caused by the Iris yellow spot virus (INSV,
115
family Bunyaviridae, genus tospovirus). The squash fruits had mosaic symptoms
116
consistent with those caused by viruses such as the Watermelon mosaic virus (WMV),
117
Zucchini yellow mosaic virus (ZYMV) or Papaya ring spot virus (PRSV) (all viruses in
118
the family Potyviridae, genus potyvirus). The melon fruit was chosen based on mottling
119
symptoms that might be caused by a number of different potyviruses.
120
Plant RNA isolation and small RNA sequencing. Symptomatic portions of the
121
fruits/bulbs were harvested and ground in liquid nitrogen. Total RNA was extracted
122
using TRIzol® (Invitrogen, Carlsbad, CA) following the manufacturer’s recommended
123
protocol. RNA was quantified by Nanodrop® (Thermo scientific, Wilmington, DE), and
124
its integrity was verified by gel electrophoresis. Small RNA library construction and
125
sequencing was performed with Illumina technology (Illumina Inc, San Diego, CA) as
126
described previously (20). Four to six million raw reads were generated from each
127
library. After the 5’ and 3’ adaptors were identified and removed from the raw reads,
128
reads with sequence length from 18 to 26 nt were parsed out for further analysis (Figure
129
1). The sequences of these raw reads can be found in NCBI under BioProject accession
130
PRJNA265505.
131
Small RNA sequence analysis. The parsed small RNAs were mapped to plant
132
virus sequences downloaded from www.dpvweb.net on 03/02/2012. Small RNA
133
matching was performed using “SHRiMP” v2 (21). Only perfectly mapped small RNAs
134
were counted. RNAi plays an important anti-viral role in plants. Upon virus infection,
6
ACS Paragon Plus Environment
Page 6 of 31
Page 7 of 31
Journal of Agricultural and Food Chemistry
135
small RNAs are generated with the help of host plant Dicer-like protein from viral
136
dsRNAs either as viral replicative intermediates or resulted from host RNA-dependent
137
RNA polymerases on viral templates. The viral siRNAs are from both strands of the
138
dsRNA, and predominantly 21 and 22 nt in length. To remove noise from the virus
139
mapping results such as non-virus sequence contamination, we applied three filters: 1)
140
A virus is called to have infested the plant if only its sequence has at least ten perfectly
141
matched and sequence distinct small RNAs, 2) 21 and 22 nt small RNAs account for at
142
least 60% of the small RNAs that mapped to a virus sequence, and 3) the abundance
143
of small RNAs mapping to either strand of a virus sequences reflects a minimum of 5%
144
of overall mapped small RNA abundance to the virus sequence. Virus-mapped small
145
RNAs were then compared to the human reference RNA set downloaded from
146
genome.ucsc.edu on 02/07/2012. For this analysis, up to two mismatches in 21 nt small
147
RNAs and three mismatches in 22 nt small RNAs were allowed. Similarly, all small
148
RNAs not mapped to virus sequences (‘non-virus-mapped’), which were presumably
149
derived from endogenous plant sequences were also compared to the human reference
150
RNA set. To match this subset of plant small RNAs against human reference genes,
151
human miRNAs, ribosomal RNAs, small nucleolar RNAs and small nuclear RNAs were
152
excluded from the reference RNA set and up to two mismatches were allowed for each
153
match. Detailed lists of the virus-mapped small RNAs and small RNAs homologous to
154
human genes can be found in the supplemental tables.
155 156
RESULTS
7
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
157
Deep sequencing of vegetable small RNA populations. RNA extracted from
158
tomato (fruit), melon (fruit), zucchini (fruit) and onion (bulb and stem) was sequenced
159
using Illumina technology. Small RNA sequences ranging in length from 18 to 26 nt
160
were parsed out for subsequent analyses (Figure 1 and Table 1). The peaks shown
161
here are typical of plant small RNA distribution, showing major peaks at 21 and 24 nt,
162
consistent with plant dicer-like protein (DCL) cleavage products (22). The 21 nt class
163
was more predominant than the 24 nt for all the samples but melon. The total number of
164
reads varied from 1,636,524 to 3,609,317, with unique reads well represented in all the
165
libraries (Table 1).
166
Size class distribution of virus-mapped small RNAs. Several filters were
167
applied to map the parsed plant-derived small RNA sequences to the virus genome
168
collections obtained from www.dpvweb.net (see Small RNA sequence analysis in
169
Materials and Methods). The 21-22 nt size class siRNAs are significantly more
170
abundant than the 24 nt in all four samples (Figure 2), consistent with the size of small
171
RNA products of post transcriptional gene silencing usually associated with viral
172
infection. However, the number of reads of the virus-mapped melon small RNA
173
population is over 100 fold less than in the other vegetable libraries (Table 1).
174
Validation of virus-mapped matches. The genomic sequences of the top two
175
virus matches generated from mapping the siRNA libraries to the virus sequence
176
collection (except for the melon library where only one matching virus was identified)
177
were plotted against the sequences of their respective small RNA matches to illustrate
178
their distribution (Figure 3). In addition to applying the filters described earlier, the
179
presence of small RNAs across the entire length of the putative virus hit serves as a
8
ACS Paragon Plus Environment
Page 8 of 31
Page 9 of 31
Journal of Agricultural and Food Chemistry
180
validation criterion suggesting the legitimate presence of the virus in the sample (Figure
181
3A, 3B, 3C, 3D and 3F). In the case of the melon sample, although the number of small
182
RNA matches is smaller in number (possibly reflecting low virus titer in the tissue) the
183
virus hit seems to be authentic. Conversely, a sparsely distributed match pattern
184
(Figure 3E and 3G) is an indication that the matching small RNAs likely originated from
185
sources other than the matched virus. Due to sequence similarities between viruses,
186
some small RNAs match to one virus and also cross-match to another: for example, in
187
the zucchini sample, most of the small RNAs mapped to Soybean mosaic virus, SMV
188
(Figure 3E) were also mapped to WMV (Figure 3D) and in the onion sample, all the
189
small RNAs mapped to Tomato yellow fruit ring virus, TYFRV (Figure 3G) also mapped
190
to INSV (Figure 3F).
191
In the case of the Tomato spotted wilt virus, segment S matched in the tomato
192
sample (Figure 3A), the small RNA reads in both the sense and antisense orientations
193
come predominantly from the regions corresponding to the non structural and
194
nucleocapsid protein coding sequences, and noticeably less from the intergenic region
195
that is not actively transcribed. For PVY (Figure 3B), WMV (Figure 3D), and INSV
196
segment L (Figure 3F) all regions are evenly represented (notice the change of scale),
197
consistent with the generation of a single transcript over the genomic segment. The
198
relative abundant of these virus-derived small RNAs across the virus genome and their
199
sequence signatures could also reveal the biochemical properties of plant small RNA
200
processing. For example, using virus-derived small RNA sequences from TSWV
201
infected tomato and Nicotiana benthamiana, details of small RNA differential processing
9
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
202
were examined previously (23). Our small RNA sequencing results which include three
203
additional viruses could be applied to such studies in the future.
Page 10 of 31
204
Characterization of tomato library reads. The tomato small RNA library was
205
further investigated and mapped to the Monsanto in-house assembled tomato reference
206
genome. Only sequences with perfect matches to the tomato and/or viral genomes are
207
included in the analysis and these matches accounted for 78% of all reads (Table 3).
208
The remaining, non-matching reads could be mostly sequencing errors and some
209
representing single nucleotide polymorphisms (SNPs) since the sample is sourced from
210
a different tomato variety than the reference sequence. As previously observed, the
211
abundance of 21-22 nt classes appears to have been enhanced by the virus derived
212
small RNAs (Figure 4A). The 24-nt class small RNAs, mostly derived from the
213
endogenous tomato genome, are both abundant and rich in unique sequences (Figure
214
4A and 4B). This is probably due to their roles in genome-wide transcriptional regulation
215
and chromatin modification (22).
216
Comparison of vegetable small RNAs to the human transcriptome. To
217
evaluate if small RNAs derived from food have sequence similarity to human genes, we
218
compared our libraries of small RNA sequences to small RNAs using a database of
219
human reference RNAs. A recent study suggested that ingestion of a rice-derived
220
miRNA matching a human gene over its open reading frame could cause transcript
221
suppression in humans (24), claims which have been strongly challenged in the peer-
222
reviewed literature by several independent research groups collectively (25-27).
223
Nevertheless, similar matching criteria were used in this evaluation.
10
ACS Paragon Plus Environment
Page 11 of 31
Journal of Agricultural and Food Chemistry
224
We divided the small RNAs in each library into virus-mapped and non-virus-
225
mapped reads, representing exogenous and endogenous sources for the sequences,
226
respectively. To simplify this bioinfomatic evaluation, the positions of the mismatches,
227
which sometimes can have various degree of impact in target gene suppression efficacy
228
(15, 16), were not given special considerations. The results in Table 4 summarize that,
229
when allowing up to 2 mismatches in the 21 nt size class and 3 mismatches in 22 size
230
class, many human genes are homologous to virally-derived small RNAs. A total of
231
1580, 43, 884 and 1399 human genes have at least one sub-region sharing sequence
232
similarity to virus-mapped small RNAs in the tomato, melon, zucchini and onion small
233
RNA libraries, respectively (Table S1, S2, S3 and S4). Except for the melon library,
234
where fewer virus-mapped small RNAs were found, virus-derived small RNAs that
235
match to human gene transcripts appear abundant. For example, in the tomato library,
236
21816 reads or 11.1% of the virus-mapped reads are highly homologous to 1580
237
human genes (Table 4). In some instances, thousands of virus-derived small RNAs can
238
match a single human gene. The top three human genes matched by the most virus-
239
mapped small RNAs in the tomato, zucchini and onion are listed in Table 5. However,
240
there is no human gene match shared by the virus-derived small RNAs in all four
241
libraries.
242
Beyond the virus-derived small RNAs, the remaining small RNAs in these fruit or
243
vegetable small RNA libraries are probably derived from endogenous plant sequences.
244
These small RNAs can also share sequence homology to human genes. To eliminate
245
trivial and low-complexity matches, the miRNAs, ribosomal RNAs, small nucleolar RNAs
246
and small nuclear RNAs in the human reference RNA set that could be matched from
11
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 12 of 31
247
the corresponding fruit or vegetable ribosomal RNA sequences were excluded from the
248
sequence search. Up to two mismatches were allowed for each small RNA and the
249
results are shown in Table 6. These results imply that up to 10.9% of the total small
250
RNAs in regularly consumed vegetables share high degree of sequence similarity with
251
human genes. Furthermore, 2774 human gene matches are shared by all four libraries
252
(Table S5, S6, S7 and S8). This suggests that some human genes can have frequent
253
matches by small RNAs derived from different fruits and vegetables. Although most of
254
these matched sequences are few in numbers, numbers of endogenous small RNA with
255
high copy numbers appear to have significant sequence similarity with human genes.
256
For example, miRNA159a that is abundant in all four libraries can be mapped to a
257
human gene encoding polycystic kidney and hepatic disease 1, PKHD1 protein (Table
258
S5, S6, S7 and S8).
259 260
DISCUSSION
261
A typical plant small RNA size distribution is characterized by abundant 21 and
262
24 nt size classes; the 24 nt class is commonly present at an abundance similar to, or
263
more than that of the 21 nt size class (22). The 24 nt class small RNA is responsible for
264
transcriptional regulation and chromatin modification (28), while the 21 nt class small
265
RNA is mostly involved in post-transcriptional (29) and antiviral defense (30, 31). Except
266
for the melon sample, the small RNA size profiles of all other fruit and vegetable
267
samples displayed a higher abundance of 21 nt-sized small RNAs (Figure 1). This is
268
likely due to the presence of viruses in these tissues that induced an antiviral RNA
269
silencing response, resulting in the accumulation of 21 nt small RNAs. Indeed,
12
ACS Paragon Plus Environment
Page 13 of 31
Journal of Agricultural and Food Chemistry
270
significant amounts of total small RNA populations are found to have originated from
271
viral sequences, which partially contributed to the greater abundance of the 21 nt class
272
seen in tomato, zucchini and onion samples (Figure 2). However, directly comparing
273
the small RNA populations between known infected and uninfected plants would be
274
necessary to confirm these results.
275
Previously, we have shown that it is possible to reassemble significant portions of
276
TSWV genome from overlapping small RNA sequences obtained by small RNA
277
sequencing of tomato infected with the virus (20). Here we report that several viruses
278
can be easily and precisely detected in fresh market vegetables using small RNA
279
sequencing. Furthermore, except for the WMV in the melon sample where the virus
280
accumulation was likely to be low, the contigs assembled from the small RNA
281
sequences from other samples embody large portions of the virus genomes. For
282
example, as much as 89% of TSWV, 84% of PVY, 76% of WMV and 96% of INSV total
283
genomic sequences are covered by the small RNAs in their respective samples (data
284
not shown). Therefore, it is possible to use small RNA sequencing to assemble the
285
genome of a novel virus especially when the endogenous small RNAs of host plant can
286
be identified and excluded. With gradual reduction in cost since its invention, small RNA
287
sequencing has the potential to be used as a one-step diagnostic and discovery tool for
288
plant viruses in the future.
289
Since RNA-mediated gene suppression in plants and animals operate through
290
fundamentally similar mechanisms and synthetic dsRNA or siRNA/miRNA are known to
291
work in plant or animal cells in vitro in a sequence-specific manner, some have
292
cautioned the use of this RNA-based gene suppression in plant biotechnology.
13
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 14 of 31
293
Recently, a study claiming that a miRNA, miR168a, from rice can target a mouse gene
294
for suppression through oral ingestion has intensified the debate (24). The experimental
295
design and assumptions of this study have been disputed (11, 27, 32). This
296
phenomenon of significant miRNA uptake in mice, nonhuman primates and humans has
297
not been reproduced by others (25-27) and the reported physiological impacts by Zhang
298
and colleagues were not reproduced in a properly controlled feeding study with higher
299
miR168a exposures (27). In addition, these observations are inconsistent with the
300
weight of the evidence for the safety of ingested nucleic acids, as evidenced by
301
biological barriers to their uptake and/or activity in higher organisms, their extensive
302
history of safe consumption and the delivery challenges faced by pharmaceutical
303
companies developing oligonucleotide based therapeutics (11, 33). The human diet is
304
made up of largely eukaryotic sources which contain an abundance of small RNAs.
305
Therefore, it is not surprising that such small RNAs were found, intact, in food through
306
small RNA sequencing (24-27, 34). Studies have also demonstrated that some
307
endogenous long dsRNAs in crop plants share sequence complementarity of at least 21
308
nucleotides with human genes (33, 34).
309
In the current study, we have sequenced the small RNAs in fruits and vegetables
310
readily available for purchase, and likely to be ingested by consumers in an
311
unprocessed or raw state. Sequences highly homologous to human genes were
312
abundant in these fruits and vegetables, and we found that small RNAs originating
313
within the plant and from infecting viruses can have matches to human genes. However,
314
the consumption of fresh fruits and vegetables like these is not associated with an
315
obvious heath risk - indeed the opposite is true (35, 36).
14
ACS Paragon Plus Environment
Page 15 of 31
Journal of Agricultural and Food Chemistry
316
It is logical to assume that small RNAs produced by transgenes are
317
fundamentally no different than endogenously-derived plant small RNAs. The data
318
presented here provide additional support for the assumption that small RNAs derived
319
from transgenic sources are no more impactful to human health than the multitudes of
320
endogenous plant small RNAs present in foods regularly consumed every day. This
321
opinion is shared by the independent statutory agency Food Standards Australia New
322
Zealand (FSANZ) who have noted that “there is no scientific basis for suggesting that
323
small dsRNAs present in some GM foods have different properties or pose a greater
324
risk than those already naturally abundant in conventional foods” (37). Coupled with the
325
long history of safe consumption of these dietary small RNAs, the results described
326
here provide further evidence to suggest that RNA-based gene suppression in biotech
327
crops is unlikely to present a direct safety hazard to humans or animals.
328 329 330
ABBREVIATIONS USED
331
TSWV, Tomato spotted wilt virus; PVY, Potato virus Y; WMV, Watermelon mosaic virus;
332
SMV, Soybean mosaic virus; INSV, Iris yellow spot virus, TYFRV, Tomato yellow fruit
333
ring virus; ZYMV, Zucchini yellow mosaic virus; PRSV, Papaya ring spot virus.
334 335
ACKNOWLEDGEMENTS
336
We thank Daniel Ader, Mingya Huang and Mitchell Sudkamp for constructing the small
337
RNA libraries and performing the small RNA sequencing, and Jay Petrick for his
15
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
338
valuable comments on the manuscript. We also thank Sofia Castiglioni for her
339
contribution to the artwork in the Table of Contents/Abstract Graphic.
340 341
REFERENCES
342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387
1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11.
12. 13. 14. 15.
16. 17. 18. 19. 20.
21. 22.
23. 24.
Hannon, G. J., RNA interference. Nature 2002, 418, 244-51. Baulcombe, D., RNA silencing in plants. Nature 2004, 431, 356-63. Mello, C. C.; Conte, D., Jr., Revealing the world of RNA interference. Nature 2004, 431, 338-42. Kim, V. N., Small RNAs: classification, biogenesis, and function. Mol Cells 2005, 19, 1-15. Brodersen, P.; Voinnet, O., The diversity of RNA silencing pathways in plants. Trends Genet 2006, 22, 268-80. Mallory, A. C.; Vaucheret, H., Functions of microRNAs and related small RNAs in plants. Nature genetics 2006, 38 Suppl, S31-6. Siomi, H.; Siomi, M. C., On the road to reading the RNA-interference code. Nature 2009, 457, 396-404. Tiemann, K.; Rossi, J. J., RNAi-based therapeutics-current status, challenges and prospects. EMBO Mol Med 2009, 1, 142-51. Vaishnaw, A. K.; Gollob, J.; Gamba-Vitalo, C.; Hutabarat, R.; Sah, D.; Meyers, R.; de Fougerolles, T.; Maraganore, J., A status report on RNAi therapeutics. Silence 2010, 1, 14. O'Neill, M. J.; Bourre, L.; Melgar, S.; O'Driscoll, C. M., Intestinal delivery of non-viral gene therapeutics: physiological barriers and preclinical models. Drug Discov Today 2011, 16, 203-218. Petrick, J. S.; Brower-Toland, B.; Jackson, A. L.; Kier, L. D., Safety assessment of food and feed from biotechnology-derived crops employing RNA-mediated gene regulation to achieve desired traits: A scientific review. Regulatory toxicology and pharmacology 2013, 66, 167-76. Sanders, R. A.; Hiatt, W., Tomato transgene structure and silencing. Nat Biotechnol 2005, 23, 287-9. Krieger, E. K.; Allen, E.; Gilbertson, L. A.; Roberts, J. K.; Hiatt, W.; Sanders, R. A., The Flavr Savr Tomato, an early example of RNAi technology. HortScience 2008, 43, 962-964. Frizzi, A.; Huang, S., Tapping RNA silencing pathways for plant biotechnology. Plant Biotechnol J 2010, 8, 655-77. Parizotto, E. A.; Dunoyer, P.; Rahm, N.; Himber, C.; Voinnet, O., In vivo investigation of the transcription, processing, endonucleolytic activity, and functional relevance of the spatial distribution of a plant miRNA. Genes Dev 2004, 18, 2237-42. Debernardi, J. M.; Rodriguez, R. E.; Mecchia, M. A.; Palatnik, J. F., Functional specialization of the plant miR396 regulatory network through distinct microRNA-target interactions. PLoS Genet 2012, 8, e1002419. Jackson, A. L.; Burchard, J.; Schelter, J.; Chau, B. N.; Cleary, M.; Lim, L.; Linsley, P. S., Widespread siRNA "off-target" transcript silencing mediated by seed region sequence complementarity. RNA 2006, 12, 1179-87. Liu, L.; Li, Q. Z.; Lin, H.; Zuo, Y. C., The effect of regions flanking target site on siRNA potency. Genomics 2013, 102, 215-222. Naito, Y.; Jun Yoshimura, J.; Morishita, S.; Ui-Tei, K., siDirect 2.0: updated software for designing functional siRNA with reduced seed-dependent off-target effect. BMC Bioinformatics 2009, 10, 392. Hagen, C.; Frizzi, A.; Kao, J.; Jia, L.; Huang, M.; Zhang, Y.; Huang, S., Using small RNA sequences to diagnose, sequence, and investigate the infectivity characteristics of vegetable-infecting viruses. Archives of Virology 2011, 156, 1209-1216. Rumble, S. M.; Lacroute, P.; Dalca, A. V.; Fiume, M.; Sidow, A.; Brudno, M., SHRiMP: accurate mapping of short color-space reads. PLoS Comput Biol 2009, 5, e1000386. Nobuta, K.; Lu, C.; Shrivastava, R.; Pillay, M.; De Paoli, E.; Accerbi, M.; Arteaga-Vazquez, M.; Sidorenko, L.; Jeong, D.-H.; Yen, Y.; Green, P. J.; Chandler, V. L.; Meyers, B. C., Distinct size distribution of endogeneous siRNAs in maize: Evidence from deep sequencing in the mop1-1 mutant. Proc Natl Acad Sci U S A 2008, 105, 14958-63. Mitter, N.; Koundal, V.; Williams, S.; Pappu, H., Differential expression of tomato spotted wilt virus-derived viral small RNAs in infected commercial and experimental host plants. PLoS One 2013, 8, e76276. Zhang, L.; Hou, D.; Chen, X.; Li, D.; Zhu, L.; Zhang, Y.; Li, J.; Bian, Z.; Liang, X.; Cai, X.; Yin, Y.; Wang, C.; Zhang, T.; Zhu, D.; Zhang, D.; Xu, J.; Chen, Q.; Ba, Y.; Liu, J.; Wang, Q.; Chen, J.; Wang, J.; Wang, M.; 16
ACS Paragon Plus Environment
Page 16 of 31
Page 17 of 31
388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419
Journal of Agricultural and Food Chemistry
25. 26.
27. 28. 29. 30. 31. 32. 33.
34.
35. 36. 37.
Zhang, Q.; Zhang, J.; Zen, K.; Zhang, C.-Y., Exogenous plant MIR168a specifically targets mammalian LDLRAP1: evidence of cross-kingdom regulation by microRNA. Cell research 2011, 22, 107-26. Snow, J. W.; Hale, A.; Isaacs, S. K.; Baggish, A. L.; Chan, S. Y., Ineffective delivery of diet-derived microRNAs to recipient animal organisms. RNA Biology 2013, 10, 1-10. Witwer, K. W.; McAlexander, M. A.; Queen, S. E.; Adams, R. J., Real-time quantitative PCR and droplet digital PCR for plant miRNAs in mammalian blood provide little evidence for general uptake of dietary miRNAs. RNA Biology 2013, 10, 1-7. Dickinson, B.; Zhang, Y. J.; Petrick, J. S.; Heck, G.; Ivashuta, S.; Marshall, W. S., Lack of detectable oral bioavailability of plant microRNAs after feeding in mice. Nature Biotechnology 2013, 31, 965-967. Matzke, M. A.; Birchler, J. A., RNAi-mediated pathways in the nucleus. Nat Rev Genet 2005, 6, 24-35. Vaucheret, H., Post-transcriptional small RNA pathways in plants: mechanisms and regulations. Genes Dev 2006, 20, 759-71. Voinnet, O., Induction and suppression of RNA silencing: insights from viral infections. Nat Rev Genet 2005, 6, 206-20. Wang, M. B.; Metzlaff, M., RNA silencing and antiviral defense in plants. Curr Opin Plant Biol 2005, 8, 21622. Zhang, Y.; Wiggins, B. E.; Lawrence, C.; Petrick, J.; Ivashuta, S.; Heck, G., Analysis of plant-derived miRNAs in animal small RNA datasets. BMC Genomics 2012, 13, 381. Jensen, P. D.; Zhang, Y.; Wiggins, B. E.; Petrick, J. S.; Zhu, J.; Kerstetter, R. A.; Heck, G. R.; Ivashuta, S. I., Computational sequence analysis of predicted long dsRNA transcriptomes of major crops reveals sequence complementarity with human genes. GM Crops and Food: Biotechnology in Agriculture and the Food Chain 2013, 4, 1-8. Ivashuta, S. I.; Petrick, J. S.; Heisel, S. E.; Zhang, Y.; Guo, L.; Reynolds, T. L.; Rice, J. F.; Allen, E.; Roberts, J. K., Endogenous small RNAs in grain: semi-quantification and sequence homology to human and animal genes. Food Chem Toxicol 2009, 47, 353-60. Bazzano, L. A., The High Cost of Not Consuming Fruits and Vegetables. Journal of the American Dietetic Association 2006, 106, 1364-1368. Steffen, L. M., Eat your fruit and vegetables. The Lancet 2006, 367, 278-279. FSANZ Response to Heinemann et al on the regulation of GM crops and foods developed using gene silencing; May, 2013.
17
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465
Figure/Table legends Figure 1. Relative abundance of small RNA size classes. The relative abundance of small RNA with length ranging from 18-26 nt is shown in the graph. All the samples had the typical plant small RNA size distribution with peaks at 21 and 24 nt. With the exception of the melon sample, the 21 nt class was more abundant than the 24 nt class. Figure 2. Relative abundance of small RNA size classes from small RNAs mapped to the virus sequence database. For all samples, most of the virus-mapped reads belong to the 21 and 22 nt size classes. These small RNAs, usually associated with posttranscriptional gene silencing, are likely generated upon viral infection. Figure 3. Validation of virus-mapped small RNA reads to their respective matching viral sequences. The top two virus matches for each sample were plotted against small RNAs mapped to them. Except for E and G, the small RNAs are mapped across the entire virus sequences. Therefore, there is a high probability that such virus is present in the samples. On the contrary, only sparse matches were found on E and G, which suggests that these mapped small RNAs were derived from different sources. Figure 4. Categories of reads and unique reads in small RNA size classes from the tomato sample. The total number of reads (A) and the unique number of reads (B) of tomato small RNAs that matched perfectly to the in-house assembled reference tomato genome (red line), virus genome database (green line) or both (purple line). Together they account for 78% (Table 3) of the all reads (blue line). Table 1. Total numbers of small RNA reads obtained from each sample library and the numbers of reads that mapped to the virus sequences. Only parsed (18-26 nt) reads were used in the mapping. Table 2. The top two virus matches and the numbers of small RNA reads matched in each sample library. The melon sample had only one virus match. For viruses that have more than one genome segments, only one segment is shown in the table, though all other segments had similar significant matches. Table 3. The number of total and unique small RNA reads in each category in the tomato sample. Table 4. Virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virus-mapped reads, 21- and 22-nt size classes, in each sample (Figure 2) were matched against the human reference RNA set downloaded from genome.ucsc.edu (02/07/2012), allowing up to two or three mismatches for each 21- and 22-nt small RNA read, respectively. Table 5. Examples of human RNAs matching the sequence of virus-mapped small RNA reads. Except for the melon sample where only few matches were found, the top three human RNA matches in each sample are listed. The majority of the reads matched to a 18
ACS Paragon Plus Environment
Page 18 of 31
Page 19 of 31
466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504
Journal of Agricultural and Food Chemistry
single sub-region of the transcript as shown using the base number referenced in GenBank. Table 6. Non-virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virus-mapped small RNA reads were removed from each small RNA library. The rest of the small RNA reads, presumably derived from endogenous sequences, were matched against the human reference RNA set downloaded from genome.ucsc.edu. The miRNAs, ribosomal RNAs, small nucleolar RNAs and small nuclear RNAs were excluded from the human reference RNA set and allowed up to two mismatches for each plant small RNA. The number of human genes matched as well as the numbers of unique and total small RNA reads whose sequences matched these human genes are shown. Supplemental tables (all in an Excel file) Table S1. List of virus-mapped small RNA reads in the tomato sample that share sequence homology to human reference RNA set. Table S2. List of virus-mapped small RNA reads in the melon sample that share sequence homology to human reference RNA set. Table S3. List of virus-mapped small RNA reads in the zucchini sample that share sequence homology to human reference RNA set. Table S4. List of virus-mapped small RNA reads in the onion sample that share sequence homology to human reference RNA set. Table S5. List of non-virus-mapped small RNA reads in the tomato sample that share sequence homology to human reference RNA set. Table S6. List of non-virus-mapped small RNA reads in the melon sample that share sequence homology to human reference RNA set. Table S7. List of non-virus-mapped small RNA reads in the zucchini sample that share sequence homology to human reference RNA set. Table S8. List of non-virus-mapped small RNA reads in the onion sample that share sequence homology to human reference RNA set.
19
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 20 of 31
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table of Contents/Abstract Graphics
Plant virus
Small RNAs
Endogenous
Virus derived
? Human mRNAs
1 ACS Paragon Plus Environment
Page 21 of 31
Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 1
Relative abundance
50%
Tomato Melon
40%
Zucchini Onion
30% 20% 10% 0% 18
19
20
21
22
23
24
25
26
Length (nt) Figure 1. Relative abundance of small RNA size classes. The relative abundance of small RNA with length ranging from 18-26 nt is shown in the graph. All the samples had the typical plant small RNA size distribution with peaks at 21 and 24 nt. With the exception of the melon sample, the 21 nt class was more abundant than the 24 nt class.
2 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 22 of 31
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 2
Relative abundance
60%
Tomato
50%
Melon Zucchini
40%
Onion 30% 20% 10% 0% 18
19
20
21
22
23
24
25
26
Length (nt) Figure 2. Relative abundance of small RNA size classes from small RNAs mapped to the virus sequence database. For all samples, most of the virus-mapped reads belong to the 21 and 22 nt size classes. These small RNAs, usually associated with post-transcriptional gene silencing, are likely generated upon viral infection.
3 ACS Paragon Plus Environment
1 316 631 946 1261 1576 1891 2206 2521 2836 3151 3466 3781 4096 4411 4726 5041 5356 5671 5986 6301 6616 6931 7246 7561 7876 8191 8506 8821 9136 9451 9766
Read abundance (blue: positive strand; red: negative strand)
0 0
-2000 - 50
-4000 - 100
-6000 - 150
-8000 - 200 1 314 627 940 1253 1566 1879 2192 2505 2818 3131 3444 3757 4070 4383 4696 5009 5322 5635 5948 6261 6574 6887 7200 7513 7826 8139 8452 8765 9078 9391
1 97 193 289 385 481 577 673 769 865 961 1057 1153 1249 1345 1441 1537 1633 1729 1825 1921 2017 2113 2209 2305 2401 2497 2593 2689 2785 2881
Page 23 of 31 Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 3
A Tomato/Tomato spotted wilt virus, segment S B Tomato/Potato virus Y
8000 200
6000 150
4000 100
2000 50
C Melon/Watermelon mosaic virus
4
3
2
1
-1 0
-2
-3
-4
4
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 24 of 31
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 3 continued
1 326 651 976 1301 1626 1951 2276 2601 2926 3251 3576 3901 4226 4551 4876 5201 5526 5851 6176 6501 6826 7151 7476 7801 8126 8451 8776 9101 9426 9751
F Onion/Iris yellow spot virus, segment L
400 300 200 100 0 -100 -200 -300 -400 -500 -600
G Onion/Tomato yellow fruit ring virus, partial L gene
4000
400
3000
300
2000
200
1000
100
0
0
-1000
1 - 00
-3000
2 - 00
1 288 575 862 1149 1436 1723 2010 2297 2584 2871 3158 3445 3732 4019 4306 4593 4880 5167 5454 5741 6028 6315 6602 6889 7176 7463 7750 8037 8324 8611
-2000
1 28 55 82 109 136 163 190 217 244 271 298 325 352 379 406 433 460 487 514 541 568 595 622 649 676 703 730 757 784 811
3000 2000 1000 0 -1000 -2000 -3000 -4000 -5000 -6000 -7000
1 311 621 931 1241 1551 1861 2171 2481 2791 3101 3411 3721 4031 4341 4651 4961 5271 5581 5891 6201 6511 6821 7131 7441 7751 8061 8371 8681 8991 9301
E Zucchini/Soybean mosaic virus
D Zucchini/Watermelon mosaic virus
Genome/gene base coverage Figure 3. Validation of virus-mapped small RNA reads to their respective matching viral sequences. The top two virus matches for each sample were plotted against small RNAs mapped to them. Except for E and G, the small RNAs are mapped across the entire virus sequences. Therefore, there is a high probability that such virus is present in the samples. On the contrary, only sparse matches were found on E and G, which suggests that these mapped small RNAs were derived from different sources. 5 ACS Paragon Plus Environment
Page 25 of 31
Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 4
B
A 1048576 262144 65536 16384 4096 1024 256 64 16 4 1
Number of reads
Number of unique reads
262144
18 19 20 21 22 23 24 25 26
65536 16384 4096 1024 256 64 16 4 1 18 19 20 21 22 23 24 25 26
Length (nt)
Length (nt)
All Tomato genome-mapped Virus-mapped Overlap
Figure 4. Categories of reads and unique reads in small RNA size classes from the tomato sample. The total number of reads (A) and the unique number of reads (B) of tomato small RNAs that matched perfectly to the inhouse assembled reference tomato genome (red line), virus genome database (green line) or both (purple line). Together they account for 78% (Table 3) of the all reads (blue line). 6 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 26 of 31
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 1
Parsed reads (18-26 nt)
Solanum lycopersicum Tomato RNA (fruit)
Virus-mapped reads Unique Total reads Unique reads Total reads reads 5315393 3609317 1209206 196069 23929
Melon
Cucumis melo
Melon RNA (fruit)
4825754
1636524
536672
185
175
Zucchini
Cucurbita pepo
Zucchini RNA (fruit)
5184745
3344051
838271
191085
22949
Allium cepa
Onion RNA (bulb + stem) 6038349
3145549
915603
301913
33144
Sample ID Tomato
Onion
Species
Tissue
Raw reads
Table 1. Total numbers of small RNA reads obtained from each sample library and the numbers of reads that mapped to the virus sequences. Only parsed (18-26 nt) reads were used in the mapping.
7 ACS Paragon Plus Environment
Page 27 of 31
Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 2
Sample Tomato Melon Zucchini Onion
Virus species matched
Mapped reads
GenBank accession
Tomato spotted wilt virus (TSWV), segment S
HQ402595
Total reads 100079
Potato virus Y (PVY)
HQ912865
12076
4040
Watermelon mosaic virus (WMV)
FJ623474
128
122
Watermelon mosaic virus (WMV)
EU660583
146333
17542
D00507
3607
419
Iris yellow spot virus (INSV), segment L
FJ623474
108325
15979
Tomato yellow fruit ring virus (TYFRV), partial L gene
AJ493271
802
91
Soybean mosaic virus (SMV)
Unique reads 6211
Table 2. The top two virus matches and the numbers of small RNA reads matched in each sample library. The melon sample had only one virus match. For viruses that have more than one genome segments, only one segment is shown in the table, though all other segments had similar significant matches.
8 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 28 of 31
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 3
Sample Tomato
All reads Unique Total Reads Reads 3609317 120206
Tomato genome-mapped reads Total reads 2596723
Virus-mapped reads
Overlap reads
Unique reads Total Reads Unique Reads Total reads 824996 196069 23929 2374
Unique reads 676
Table 3. The number of total and unique small RNA reads in each category in the tomato sample.
9 ACS Paragon Plus Environment
Page 29 of 31
Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 4
Sample
Number of human RNAs matched
Number of unique reads
Total reads
Tomato
1580
1786
21816
% of total virus-mapped reads sharing homology with human genes 11.1%
Melon
43
22
22
11.9%
Zucchini
884
1558
16634
8.7%
Onion
1399
1936
18005
6.0%
Table 4. Virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virusmapped reads, 21- and 22-nt size classes, in each sample (Figure 2) were matched against the human reference RNA set downloaded from genome.ucsc.edu (02/07/2012), allowing up to two or three mismatches for each 21- and 22-nt small RNA read, respectively.
10 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 30 of 31
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 5
Sample
Human RNA matched
GenBank ID
Region matched
Multiple EGF-like-domains 8 (MEGF8), mRNA NM_001410 10253-10274 Family with sequence similarity 190, member A (FAM190A), NM_001145065 5152-5170 Tomato transcript variant 1, mRNA Rho-associated, coiled-coil containing protein kinase 2 NM_004850 4525-4545, 997-1017 (ROCK2), mRNA WD repeat domain 3 (WDR3), mRNA NM_006784 3424-3444 Homo sapiens regulator of G-protein signaling 7 (RGS7), NM_002924 1273-1295 Zucchini mRNA Proline-rich transmembrane protein 2 (PRRT2), transcript NM_145239 1444-1464 variant 1, mRNA Triosephosphate isomerase 1 pseudogene 3 (TPI1P3), nonNR_027338 639-659 coding RNA Olfactory receptor, family 6, subfamily P, member 1 (OR6P1), Onion NM_001160325 635-654 mRNA DYX1C1-CCPG1 readthrough (non-protein coding) (DYX1C12962-2983, 3037NR_037923 CCPG1), non-coding RNA 3056
Number of Total reads unique reads 20 3451 1
2419
6
2337
6
1019
14
904
4
462
6
1043
10
941
11
689
Table 5. Examples of human RNAs matching the sequence of virus-mapped small RNA reads. Except for the melon sample where only few matches were found, the top three human RNA matches in each sample are listed. The majority of the reads matched to a single sub-region of the transcript as shown using the base number referenced in GenBank.
11 ACS Paragon Plus Environment
Page 31 of 31
Journal of Agricultural and Food Chemistry
Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 6
Sample Tomato Melon Zucchini Onion
Number of human RNAs Number of unique reads Total reads matched
% of total non-virus-mapped reads sharing homology with human genes
13683 8664
33189 14961
337610 157019
9.4% 9.6%
8286 13851
14633 34193
144654 342541
4.3% 10.9%
Table 6. Non-virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virusmapped small RNA reads were removed from each small RNA library. The rest of the small RNA reads, presumably derived from endogenous sequences, were matched against the human reference RNA set downloaded from genome.ucsc.edu. The miRNAs, ribosomal RNAs, small nucleolar RNAs and small nuclear RNAs were excluded from the human reference RNA set and allowed up to two mismatches for each plant small RNA. The number of human genes matched as well as the numbers of unique and total small RNA reads whose sequences matched these human genes are shown.
12 ACS Paragon Plus Environment