Small RNA Profiles from Virus-Infected Fresh Market Vegetables

Nov 11, 2014 - (12, 13) Subsequently, virus-resistant squash and papaya were introduced to the market in the late 1990s. Although it was once thought ...
0 downloads 0 Views 2MB Size
Subscriber access provided by University of Florida | Smathers Libraries

Article

Small RNA profiles from virus-infected fresh market vegetables Alessandra Frizzi, Yuanji Zhang, John Kao, Charles Hagen, and Shihshieh Huang J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/jf503756v • Publication Date (Web): 11 Nov 2014 Downloaded from http://pubs.acs.org on November 16, 2014

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Journal of Agricultural and Food Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 31

1 2

Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables

3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

Alessandra Frizzi1,#, Yuanji Zhang2,#, John Kao3, Charles Hagen1 and Shihshieh Huang1,* 1

Calgene Campus, Monsanto Company, 1920 Fifth St, Davis, CA 95616, USA Chesterfield Campus, Monsanto Company, 700 Chesterfield Parkway West, Chesterfield, MO 63017, USA 3 Monsanto Vegetable Seeds - Woodland, 37437 State Hwy 16, Woodland, CA 95695, USA 2

#

These authors contributed equally to this work *Correspondence (fax +1 530 792-2005; email [email protected])

Keywords: Vegetable, small RNA, siRNA, miRNA, RNAi, RNA silencing, genetic

19

engineering, Tomato spotted wilt virus (TSWV), Potato virus Y (PVY), Watermelon

20

mosaic virus (WMV), Iris yellow spot virus (IYSV)

1

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

21

Abstract

22

Functional small RNAs, such as short interfering RNAs (siRNAs) and microRNAs

23

(miRNAs), exist in freshly consumed fruits and vegetables. These siRNAs can be

24

derived from either endogenous sequences or from viruses that infect them.

25

Symptomatic tomatoes, watermelons, zucchini and onions were purchased from

26

grocery stores and investigated by small RNA sequencing. By aligning the obtained

27

small RNA sequences to sequences of known viruses, four different viruses were

28

identified as infecting these fruits and vegetables. Many of these virally-derived small

29

RNAs along with endogenous small RNAs were found to be highly complementary to

30

human genes. However, the established history of safe consumption of these

31

vegetables suggests that this sequence homology has little biological relevance. By

32

extension, these results provide evidence for the safe use by humans and animals of

33

genetically engineered crops using RNA-based suppression technologies, especially

34

vegetable crops with virus resistance conferred by expression of siRNAs or miRNAs

35

derived from viral sequences.

36 37

INTRODUCTION

38

RNA interference (RNAi) or RNA silencing is a broadly used gene regulation

39

mechanism in eukaryotes. It has been implicated in many aspects of biological

40

processes such as regulation of growth and development, defense against pathogens

41

and response to stress (1-3). Active research over the last decade has revealed the

42

molecular basis of RNAi to a great extent (4-7). Most of the known RNAi components,

43

some conserved across the kingdoms and some unique among few species, can be 2

ACS Paragon Plus Environment

Page 2 of 31

Page 3 of 31

Journal of Agricultural and Food Chemistry

44

divided into two groups. They are either involved in generation of the small RNAs, or

45

part of various RNAi complexes directly or indirectly associated with the small RNAs. At

46

the core of the RNAi mechanism lies the small RNA whose primary sequence is used to

47

guide the RNAi machinery to its specific gene target. The DNA or mRNA of the targeted

48

genes, which bear a complementary sequence to the small RNA, is then modified by

49

the RNAi machinery. These modifications, including DNA methylation, mRNA

50

degradation or translational inhibition, eventually lead to the down-regulation of target

51

genes and result in biological changes. Furthermore, these studies also found that

52

exogenous small RNAs, whether they are either synthesized chemically or generated

53

through transgenic expression can trigger RNAi with a similar biological effect as long

54

as the required sequence signature is present.

55

The characteristic of eukaryotic RNAi whereby the sequence signature of an

56

exogenous small RNA can trigger a gene specific down-regulation in living organisms

57

has immense implications in medicine and plant biotechnology. RNAi-based drugs

58

treating diseases such as macular degeneration (AMD), cancer, asthma and glaucoma,

59

or combating viruses including Respiratory syncytial virus (RSV), Hepatitis C virus

60

(HCV) and Human immunodeficiency virus (HIV), are under clinical development (8, 9).

61

In the case of human therapeutics, RNAi-based drugs have met with tremendous

62

challenges presented by the need to achieve delivery in the face of a myriad of

63

biological and physicochemical barriers (10, 11). These barriers have limited the

64

efficacy, and thus the commercial success of this promising technology in therapeutics

65

to date (9, 11). However, RNA-based suppression technology has been successfully

66

applied in agriculture for more than a decade. The first transgenic food crop, Flavr Savr

3

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

67

tomato, commercialized in 1994, employed antisense technology (now known as a part

68

of the RNA silencing mechanism) to maintain fruit firmness for easy handling (12, 13).

69

Subsequently, virus resistant squash and papaya were introduced to the market in the

70

late 1990’s. Although it was once thought to be mediated by transgenically expressed

71

viral coat protein, the virus resistance in these plants is more likely to result from

72

transgenic siRNAs derived from the viral coat protein gene. A list of RNA silencing

73

based biotech crops that are under development for commercial use was previously

74

summarized by Frizzi and Huang (14).

75

Although theoretically the 20+ nt size of small RNAs allow them to specifically

76

identify their intended mRNA targets, mismatches and gaps in base-pairing between

77

miRNAs or siRNAs and their target mRNAs are commonly found. This suggests that not

78

all the bases of a small RNA are necessary for target recognition. In plants, mRNA

79

slicing is the principal mode of post-transcriptional RNA silencing regulation; a guiding

80

small RNA requires strong base-pairings with its target at nucleotides 2-13 but

81

sequence identity is less critical at nucleotides 15-20 (15, 16). Because the RNAi

82

mechanism effectively parses sequence mismatches and ambiguities, it has been

83

shown through in vitro studies conducted with transfected small RNAs that small RNAs

84

can cause gene suppression of unintended gene targets (17). This outcome, commonly

85

referred to as the “off-target” effect has led to increased scrutiny regarding the

86

specificity of RNAi-based drugs and also been proposed as a putative concern for food

87

crops that are engineered to produce small RNAs. However, this “off-target” effect has

88

been characterized using in vitro systems and is orders of magnitude less potent than

4

ACS Paragon Plus Environment

Page 4 of 31

Page 5 of 31

Journal of Agricultural and Food Chemistry

89

“on-target” suppression (9, 11). Such suppression also requires special sequence

90

contexts, thermodynamic criteria, target accessibility and other conditions (18, 19).

91

To investigate the level of significance of regularly-consumed small RNAs that

92

share sequence homology with human genes, the small RNA profiles of fresh market

93

fruits and vegetables were examined. These were locally purchased tomatoes,

94

watermelons, zucchini and onions that were selected because their appearance

95

suggested they were likely to be infected with viral disease(s). Using a next-generation

96

high throughput sequencing method, between 1.6 and 3.6 million small RNA sequences

97

were obtained from each sample. Although most of the small RNAs found in these

98

vegetables were likely derived from endogenous genome, up to 9.5% were identified as

99

sequences originating from viruses. Many of these small RNAs were found to share

100

sequence similarity with human genes. Small RNA profiles such as these resemble

101

those of plants genetically modified to express fragments of virus sequences for the

102

purposes of conferring virus resistance (14). The analysis of the small RNAs presented

103

here provides insights into the plant-virus interaction and also evidence of a history of

104

safe consumption for dietary small RNAs. By extension, this research supports the

105

safety of biotechnology derived crops employing RNA-based gene suppression when

106

consumed by humans or animals.

107 108

MATERIALS AND METHODS

109

Plant materials. Vegetables suspected of virus diseases based on physical

110

characteristics were selectively purchased at local farmers’ markets or supermarkets in

111

the region of Yolo County, California. The tomato fruits displayed lesions with a chlorotic

5

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

112

concentric ring pattern typical of those caused by Tomato spotted wilt virus, (TSWV,

113

family Bunyaviridae, genus tospovirus). The onion bulbs with attached green leaves

114

exhibited elongated lesions typical of those caused by the Iris yellow spot virus (INSV,

115

family Bunyaviridae, genus tospovirus). The squash fruits had mosaic symptoms

116

consistent with those caused by viruses such as the Watermelon mosaic virus (WMV),

117

Zucchini yellow mosaic virus (ZYMV) or Papaya ring spot virus (PRSV) (all viruses in

118

the family Potyviridae, genus potyvirus). The melon fruit was chosen based on mottling

119

symptoms that might be caused by a number of different potyviruses.

120

Plant RNA isolation and small RNA sequencing. Symptomatic portions of the

121

fruits/bulbs were harvested and ground in liquid nitrogen. Total RNA was extracted

122

using TRIzol® (Invitrogen, Carlsbad, CA) following the manufacturer’s recommended

123

protocol. RNA was quantified by Nanodrop® (Thermo scientific, Wilmington, DE), and

124

its integrity was verified by gel electrophoresis. Small RNA library construction and

125

sequencing was performed with Illumina technology (Illumina Inc, San Diego, CA) as

126

described previously (20). Four to six million raw reads were generated from each

127

library. After the 5’ and 3’ adaptors were identified and removed from the raw reads,

128

reads with sequence length from 18 to 26 nt were parsed out for further analysis (Figure

129

1). The sequences of these raw reads can be found in NCBI under BioProject accession

130

PRJNA265505.

131

Small RNA sequence analysis. The parsed small RNAs were mapped to plant

132

virus sequences downloaded from www.dpvweb.net on 03/02/2012. Small RNA

133

matching was performed using “SHRiMP” v2 (21). Only perfectly mapped small RNAs

134

were counted. RNAi plays an important anti-viral role in plants. Upon virus infection,

6

ACS Paragon Plus Environment

Page 6 of 31

Page 7 of 31

Journal of Agricultural and Food Chemistry

135

small RNAs are generated with the help of host plant Dicer-like protein from viral

136

dsRNAs either as viral replicative intermediates or resulted from host RNA-dependent

137

RNA polymerases on viral templates. The viral siRNAs are from both strands of the

138

dsRNA, and predominantly 21 and 22 nt in length. To remove noise from the virus

139

mapping results such as non-virus sequence contamination, we applied three filters: 1)

140

A virus is called to have infested the plant if only its sequence has at least ten perfectly

141

matched and sequence distinct small RNAs, 2) 21 and 22 nt small RNAs account for at

142

least 60% of the small RNAs that mapped to a virus sequence, and 3) the abundance

143

of small RNAs mapping to either strand of a virus sequences reflects a minimum of 5%

144

of overall mapped small RNA abundance to the virus sequence. Virus-mapped small

145

RNAs were then compared to the human reference RNA set downloaded from

146

genome.ucsc.edu on 02/07/2012. For this analysis, up to two mismatches in 21 nt small

147

RNAs and three mismatches in 22 nt small RNAs were allowed. Similarly, all small

148

RNAs not mapped to virus sequences (‘non-virus-mapped’), which were presumably

149

derived from endogenous plant sequences were also compared to the human reference

150

RNA set. To match this subset of plant small RNAs against human reference genes,

151

human miRNAs, ribosomal RNAs, small nucleolar RNAs and small nuclear RNAs were

152

excluded from the reference RNA set and up to two mismatches were allowed for each

153

match. Detailed lists of the virus-mapped small RNAs and small RNAs homologous to

154

human genes can be found in the supplemental tables.

155 156

RESULTS

7

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

157

Deep sequencing of vegetable small RNA populations. RNA extracted from

158

tomato (fruit), melon (fruit), zucchini (fruit) and onion (bulb and stem) was sequenced

159

using Illumina technology. Small RNA sequences ranging in length from 18 to 26 nt

160

were parsed out for subsequent analyses (Figure 1 and Table 1). The peaks shown

161

here are typical of plant small RNA distribution, showing major peaks at 21 and 24 nt,

162

consistent with plant dicer-like protein (DCL) cleavage products (22). The 21 nt class

163

was more predominant than the 24 nt for all the samples but melon. The total number of

164

reads varied from 1,636,524 to 3,609,317, with unique reads well represented in all the

165

libraries (Table 1).

166

Size class distribution of virus-mapped small RNAs. Several filters were

167

applied to map the parsed plant-derived small RNA sequences to the virus genome

168

collections obtained from www.dpvweb.net (see Small RNA sequence analysis in

169

Materials and Methods). The 21-22 nt size class siRNAs are significantly more

170

abundant than the 24 nt in all four samples (Figure 2), consistent with the size of small

171

RNA products of post transcriptional gene silencing usually associated with viral

172

infection. However, the number of reads of the virus-mapped melon small RNA

173

population is over 100 fold less than in the other vegetable libraries (Table 1).

174

Validation of virus-mapped matches. The genomic sequences of the top two

175

virus matches generated from mapping the siRNA libraries to the virus sequence

176

collection (except for the melon library where only one matching virus was identified)

177

were plotted against the sequences of their respective small RNA matches to illustrate

178

their distribution (Figure 3). In addition to applying the filters described earlier, the

179

presence of small RNAs across the entire length of the putative virus hit serves as a

8

ACS Paragon Plus Environment

Page 8 of 31

Page 9 of 31

Journal of Agricultural and Food Chemistry

180

validation criterion suggesting the legitimate presence of the virus in the sample (Figure

181

3A, 3B, 3C, 3D and 3F). In the case of the melon sample, although the number of small

182

RNA matches is smaller in number (possibly reflecting low virus titer in the tissue) the

183

virus hit seems to be authentic. Conversely, a sparsely distributed match pattern

184

(Figure 3E and 3G) is an indication that the matching small RNAs likely originated from

185

sources other than the matched virus. Due to sequence similarities between viruses,

186

some small RNAs match to one virus and also cross-match to another: for example, in

187

the zucchini sample, most of the small RNAs mapped to Soybean mosaic virus, SMV

188

(Figure 3E) were also mapped to WMV (Figure 3D) and in the onion sample, all the

189

small RNAs mapped to Tomato yellow fruit ring virus, TYFRV (Figure 3G) also mapped

190

to INSV (Figure 3F).

191

In the case of the Tomato spotted wilt virus, segment S matched in the tomato

192

sample (Figure 3A), the small RNA reads in both the sense and antisense orientations

193

come predominantly from the regions corresponding to the non structural and

194

nucleocapsid protein coding sequences, and noticeably less from the intergenic region

195

that is not actively transcribed. For PVY (Figure 3B), WMV (Figure 3D), and INSV

196

segment L (Figure 3F) all regions are evenly represented (notice the change of scale),

197

consistent with the generation of a single transcript over the genomic segment. The

198

relative abundant of these virus-derived small RNAs across the virus genome and their

199

sequence signatures could also reveal the biochemical properties of plant small RNA

200

processing. For example, using virus-derived small RNA sequences from TSWV

201

infected tomato and Nicotiana benthamiana, details of small RNA differential processing

9

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

202

were examined previously (23). Our small RNA sequencing results which include three

203

additional viruses could be applied to such studies in the future.

Page 10 of 31

204

Characterization of tomato library reads. The tomato small RNA library was

205

further investigated and mapped to the Monsanto in-house assembled tomato reference

206

genome. Only sequences with perfect matches to the tomato and/or viral genomes are

207

included in the analysis and these matches accounted for 78% of all reads (Table 3).

208

The remaining, non-matching reads could be mostly sequencing errors and some

209

representing single nucleotide polymorphisms (SNPs) since the sample is sourced from

210

a different tomato variety than the reference sequence. As previously observed, the

211

abundance of 21-22 nt classes appears to have been enhanced by the virus derived

212

small RNAs (Figure 4A). The 24-nt class small RNAs, mostly derived from the

213

endogenous tomato genome, are both abundant and rich in unique sequences (Figure

214

4A and 4B). This is probably due to their roles in genome-wide transcriptional regulation

215

and chromatin modification (22).

216

Comparison of vegetable small RNAs to the human transcriptome. To

217

evaluate if small RNAs derived from food have sequence similarity to human genes, we

218

compared our libraries of small RNA sequences to small RNAs using a database of

219

human reference RNAs. A recent study suggested that ingestion of a rice-derived

220

miRNA matching a human gene over its open reading frame could cause transcript

221

suppression in humans (24), claims which have been strongly challenged in the peer-

222

reviewed literature by several independent research groups collectively (25-27).

223

Nevertheless, similar matching criteria were used in this evaluation.

10

ACS Paragon Plus Environment

Page 11 of 31

Journal of Agricultural and Food Chemistry

224

We divided the small RNAs in each library into virus-mapped and non-virus-

225

mapped reads, representing exogenous and endogenous sources for the sequences,

226

respectively. To simplify this bioinfomatic evaluation, the positions of the mismatches,

227

which sometimes can have various degree of impact in target gene suppression efficacy

228

(15, 16), were not given special considerations. The results in Table 4 summarize that,

229

when allowing up to 2 mismatches in the 21 nt size class and 3 mismatches in 22 size

230

class, many human genes are homologous to virally-derived small RNAs. A total of

231

1580, 43, 884 and 1399 human genes have at least one sub-region sharing sequence

232

similarity to virus-mapped small RNAs in the tomato, melon, zucchini and onion small

233

RNA libraries, respectively (Table S1, S2, S3 and S4). Except for the melon library,

234

where fewer virus-mapped small RNAs were found, virus-derived small RNAs that

235

match to human gene transcripts appear abundant. For example, in the tomato library,

236

21816 reads or 11.1% of the virus-mapped reads are highly homologous to 1580

237

human genes (Table 4). In some instances, thousands of virus-derived small RNAs can

238

match a single human gene. The top three human genes matched by the most virus-

239

mapped small RNAs in the tomato, zucchini and onion are listed in Table 5. However,

240

there is no human gene match shared by the virus-derived small RNAs in all four

241

libraries.

242

Beyond the virus-derived small RNAs, the remaining small RNAs in these fruit or

243

vegetable small RNA libraries are probably derived from endogenous plant sequences.

244

These small RNAs can also share sequence homology to human genes. To eliminate

245

trivial and low-complexity matches, the miRNAs, ribosomal RNAs, small nucleolar RNAs

246

and small nuclear RNAs in the human reference RNA set that could be matched from

11

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 12 of 31

247

the corresponding fruit or vegetable ribosomal RNA sequences were excluded from the

248

sequence search. Up to two mismatches were allowed for each small RNA and the

249

results are shown in Table 6. These results imply that up to 10.9% of the total small

250

RNAs in regularly consumed vegetables share high degree of sequence similarity with

251

human genes. Furthermore, 2774 human gene matches are shared by all four libraries

252

(Table S5, S6, S7 and S8). This suggests that some human genes can have frequent

253

matches by small RNAs derived from different fruits and vegetables. Although most of

254

these matched sequences are few in numbers, numbers of endogenous small RNA with

255

high copy numbers appear to have significant sequence similarity with human genes.

256

For example, miRNA159a that is abundant in all four libraries can be mapped to a

257

human gene encoding polycystic kidney and hepatic disease 1, PKHD1 protein (Table

258

S5, S6, S7 and S8).

259 260

DISCUSSION

261

A typical plant small RNA size distribution is characterized by abundant 21 and

262

24 nt size classes; the 24 nt class is commonly present at an abundance similar to, or

263

more than that of the 21 nt size class (22). The 24 nt class small RNA is responsible for

264

transcriptional regulation and chromatin modification (28), while the 21 nt class small

265

RNA is mostly involved in post-transcriptional (29) and antiviral defense (30, 31). Except

266

for the melon sample, the small RNA size profiles of all other fruit and vegetable

267

samples displayed a higher abundance of 21 nt-sized small RNAs (Figure 1). This is

268

likely due to the presence of viruses in these tissues that induced an antiviral RNA

269

silencing response, resulting in the accumulation of 21 nt small RNAs. Indeed,

12

ACS Paragon Plus Environment

Page 13 of 31

Journal of Agricultural and Food Chemistry

270

significant amounts of total small RNA populations are found to have originated from

271

viral sequences, which partially contributed to the greater abundance of the 21 nt class

272

seen in tomato, zucchini and onion samples (Figure 2). However, directly comparing

273

the small RNA populations between known infected and uninfected plants would be

274

necessary to confirm these results.

275

Previously, we have shown that it is possible to reassemble significant portions of

276

TSWV genome from overlapping small RNA sequences obtained by small RNA

277

sequencing of tomato infected with the virus (20). Here we report that several viruses

278

can be easily and precisely detected in fresh market vegetables using small RNA

279

sequencing. Furthermore, except for the WMV in the melon sample where the virus

280

accumulation was likely to be low, the contigs assembled from the small RNA

281

sequences from other samples embody large portions of the virus genomes. For

282

example, as much as 89% of TSWV, 84% of PVY, 76% of WMV and 96% of INSV total

283

genomic sequences are covered by the small RNAs in their respective samples (data

284

not shown). Therefore, it is possible to use small RNA sequencing to assemble the

285

genome of a novel virus especially when the endogenous small RNAs of host plant can

286

be identified and excluded. With gradual reduction in cost since its invention, small RNA

287

sequencing has the potential to be used as a one-step diagnostic and discovery tool for

288

plant viruses in the future.

289

Since RNA-mediated gene suppression in plants and animals operate through

290

fundamentally similar mechanisms and synthetic dsRNA or siRNA/miRNA are known to

291

work in plant or animal cells in vitro in a sequence-specific manner, some have

292

cautioned the use of this RNA-based gene suppression in plant biotechnology.

13

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 14 of 31

293

Recently, a study claiming that a miRNA, miR168a, from rice can target a mouse gene

294

for suppression through oral ingestion has intensified the debate (24). The experimental

295

design and assumptions of this study have been disputed (11, 27, 32). This

296

phenomenon of significant miRNA uptake in mice, nonhuman primates and humans has

297

not been reproduced by others (25-27) and the reported physiological impacts by Zhang

298

and colleagues were not reproduced in a properly controlled feeding study with higher

299

miR168a exposures (27). In addition, these observations are inconsistent with the

300

weight of the evidence for the safety of ingested nucleic acids, as evidenced by

301

biological barriers to their uptake and/or activity in higher organisms, their extensive

302

history of safe consumption and the delivery challenges faced by pharmaceutical

303

companies developing oligonucleotide based therapeutics (11, 33). The human diet is

304

made up of largely eukaryotic sources which contain an abundance of small RNAs.

305

Therefore, it is not surprising that such small RNAs were found, intact, in food through

306

small RNA sequencing (24-27, 34). Studies have also demonstrated that some

307

endogenous long dsRNAs in crop plants share sequence complementarity of at least 21

308

nucleotides with human genes (33, 34).

309

In the current study, we have sequenced the small RNAs in fruits and vegetables

310

readily available for purchase, and likely to be ingested by consumers in an

311

unprocessed or raw state. Sequences highly homologous to human genes were

312

abundant in these fruits and vegetables, and we found that small RNAs originating

313

within the plant and from infecting viruses can have matches to human genes. However,

314

the consumption of fresh fruits and vegetables like these is not associated with an

315

obvious heath risk - indeed the opposite is true (35, 36).

14

ACS Paragon Plus Environment

Page 15 of 31

Journal of Agricultural and Food Chemistry

316

It is logical to assume that small RNAs produced by transgenes are

317

fundamentally no different than endogenously-derived plant small RNAs. The data

318

presented here provide additional support for the assumption that small RNAs derived

319

from transgenic sources are no more impactful to human health than the multitudes of

320

endogenous plant small RNAs present in foods regularly consumed every day. This

321

opinion is shared by the independent statutory agency Food Standards Australia New

322

Zealand (FSANZ) who have noted that “there is no scientific basis for suggesting that

323

small dsRNAs present in some GM foods have different properties or pose a greater

324

risk than those already naturally abundant in conventional foods” (37). Coupled with the

325

long history of safe consumption of these dietary small RNAs, the results described

326

here provide further evidence to suggest that RNA-based gene suppression in biotech

327

crops is unlikely to present a direct safety hazard to humans or animals.

328 329 330

ABBREVIATIONS USED

331

TSWV, Tomato spotted wilt virus; PVY, Potato virus Y; WMV, Watermelon mosaic virus;

332

SMV, Soybean mosaic virus; INSV, Iris yellow spot virus, TYFRV, Tomato yellow fruit

333

ring virus; ZYMV, Zucchini yellow mosaic virus; PRSV, Papaya ring spot virus.

334 335

ACKNOWLEDGEMENTS

336

We thank Daniel Ader, Mingya Huang and Mitchell Sudkamp for constructing the small

337

RNA libraries and performing the small RNA sequencing, and Jay Petrick for his

15

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

338

valuable comments on the manuscript. We also thank Sofia Castiglioni for her

339

contribution to the artwork in the Table of Contents/Abstract Graphic.

340 341

REFERENCES

342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387

1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11.

12. 13. 14. 15.

16. 17. 18. 19. 20.

21. 22.

23. 24.

Hannon, G. J., RNA interference. Nature 2002, 418, 244-51. Baulcombe, D., RNA silencing in plants. Nature 2004, 431, 356-63. Mello, C. C.; Conte, D., Jr., Revealing the world of RNA interference. Nature 2004, 431, 338-42. Kim, V. N., Small RNAs: classification, biogenesis, and function. Mol Cells 2005, 19, 1-15. Brodersen, P.; Voinnet, O., The diversity of RNA silencing pathways in plants. Trends Genet 2006, 22, 268-80. Mallory, A. C.; Vaucheret, H., Functions of microRNAs and related small RNAs in plants. Nature genetics 2006, 38 Suppl, S31-6. Siomi, H.; Siomi, M. C., On the road to reading the RNA-interference code. Nature 2009, 457, 396-404. Tiemann, K.; Rossi, J. J., RNAi-based therapeutics-current status, challenges and prospects. EMBO Mol Med 2009, 1, 142-51. Vaishnaw, A. K.; Gollob, J.; Gamba-Vitalo, C.; Hutabarat, R.; Sah, D.; Meyers, R.; de Fougerolles, T.; Maraganore, J., A status report on RNAi therapeutics. Silence 2010, 1, 14. O'Neill, M. J.; Bourre, L.; Melgar, S.; O'Driscoll, C. M., Intestinal delivery of non-viral gene therapeutics: physiological barriers and preclinical models. Drug Discov Today 2011, 16, 203-218. Petrick, J. S.; Brower-Toland, B.; Jackson, A. L.; Kier, L. D., Safety assessment of food and feed from biotechnology-derived crops employing RNA-mediated gene regulation to achieve desired traits: A scientific review. Regulatory toxicology and pharmacology 2013, 66, 167-76. Sanders, R. A.; Hiatt, W., Tomato transgene structure and silencing. Nat Biotechnol 2005, 23, 287-9. Krieger, E. K.; Allen, E.; Gilbertson, L. A.; Roberts, J. K.; Hiatt, W.; Sanders, R. A., The Flavr Savr Tomato, an early example of RNAi technology. HortScience 2008, 43, 962-964. Frizzi, A.; Huang, S., Tapping RNA silencing pathways for plant biotechnology. Plant Biotechnol J 2010, 8, 655-77. Parizotto, E. A.; Dunoyer, P.; Rahm, N.; Himber, C.; Voinnet, O., In vivo investigation of the transcription, processing, endonucleolytic activity, and functional relevance of the spatial distribution of a plant miRNA. Genes Dev 2004, 18, 2237-42. Debernardi, J. M.; Rodriguez, R. E.; Mecchia, M. A.; Palatnik, J. F., Functional specialization of the plant miR396 regulatory network through distinct microRNA-target interactions. PLoS Genet 2012, 8, e1002419. Jackson, A. L.; Burchard, J.; Schelter, J.; Chau, B. N.; Cleary, M.; Lim, L.; Linsley, P. S., Widespread siRNA "off-target" transcript silencing mediated by seed region sequence complementarity. RNA 2006, 12, 1179-87. Liu, L.; Li, Q. Z.; Lin, H.; Zuo, Y. C., The effect of regions flanking target site on siRNA potency. Genomics 2013, 102, 215-222. Naito, Y.; Jun Yoshimura, J.; Morishita, S.; Ui-Tei, K., siDirect 2.0: updated software for designing functional siRNA with reduced seed-dependent off-target effect. BMC Bioinformatics 2009, 10, 392. Hagen, C.; Frizzi, A.; Kao, J.; Jia, L.; Huang, M.; Zhang, Y.; Huang, S., Using small RNA sequences to diagnose, sequence, and investigate the infectivity characteristics of vegetable-infecting viruses. Archives of Virology 2011, 156, 1209-1216. Rumble, S. M.; Lacroute, P.; Dalca, A. V.; Fiume, M.; Sidow, A.; Brudno, M., SHRiMP: accurate mapping of short color-space reads. PLoS Comput Biol 2009, 5, e1000386. Nobuta, K.; Lu, C.; Shrivastava, R.; Pillay, M.; De Paoli, E.; Accerbi, M.; Arteaga-Vazquez, M.; Sidorenko, L.; Jeong, D.-H.; Yen, Y.; Green, P. J.; Chandler, V. L.; Meyers, B. C., Distinct size distribution of endogeneous siRNAs in maize: Evidence from deep sequencing in the mop1-1 mutant. Proc Natl Acad Sci U S A 2008, 105, 14958-63. Mitter, N.; Koundal, V.; Williams, S.; Pappu, H., Differential expression of tomato spotted wilt virus-derived viral small RNAs in infected commercial and experimental host plants. PLoS One 2013, 8, e76276. Zhang, L.; Hou, D.; Chen, X.; Li, D.; Zhu, L.; Zhang, Y.; Li, J.; Bian, Z.; Liang, X.; Cai, X.; Yin, Y.; Wang, C.; Zhang, T.; Zhu, D.; Zhang, D.; Xu, J.; Chen, Q.; Ba, Y.; Liu, J.; Wang, Q.; Chen, J.; Wang, J.; Wang, M.; 16

ACS Paragon Plus Environment

Page 16 of 31

Page 17 of 31

388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419

Journal of Agricultural and Food Chemistry

25. 26.

27. 28. 29. 30. 31. 32. 33.

34.

35. 36. 37.

Zhang, Q.; Zhang, J.; Zen, K.; Zhang, C.-Y., Exogenous plant MIR168a specifically targets mammalian LDLRAP1: evidence of cross-kingdom regulation by microRNA. Cell research 2011, 22, 107-26. Snow, J. W.; Hale, A.; Isaacs, S. K.; Baggish, A. L.; Chan, S. Y., Ineffective delivery of diet-derived microRNAs to recipient animal organisms. RNA Biology 2013, 10, 1-10. Witwer, K. W.; McAlexander, M. A.; Queen, S. E.; Adams, R. J., Real-time quantitative PCR and droplet digital PCR for plant miRNAs in mammalian blood provide little evidence for general uptake of dietary miRNAs. RNA Biology 2013, 10, 1-7. Dickinson, B.; Zhang, Y. J.; Petrick, J. S.; Heck, G.; Ivashuta, S.; Marshall, W. S., Lack of detectable oral bioavailability of plant microRNAs after feeding in mice. Nature Biotechnology 2013, 31, 965-967. Matzke, M. A.; Birchler, J. A., RNAi-mediated pathways in the nucleus. Nat Rev Genet 2005, 6, 24-35. Vaucheret, H., Post-transcriptional small RNA pathways in plants: mechanisms and regulations. Genes Dev 2006, 20, 759-71. Voinnet, O., Induction and suppression of RNA silencing: insights from viral infections. Nat Rev Genet 2005, 6, 206-20. Wang, M. B.; Metzlaff, M., RNA silencing and antiviral defense in plants. Curr Opin Plant Biol 2005, 8, 21622. Zhang, Y.; Wiggins, B. E.; Lawrence, C.; Petrick, J.; Ivashuta, S.; Heck, G., Analysis of plant-derived miRNAs in animal small RNA datasets. BMC Genomics 2012, 13, 381. Jensen, P. D.; Zhang, Y.; Wiggins, B. E.; Petrick, J. S.; Zhu, J.; Kerstetter, R. A.; Heck, G. R.; Ivashuta, S. I., Computational sequence analysis of predicted long dsRNA transcriptomes of major crops reveals sequence complementarity with human genes. GM Crops and Food: Biotechnology in Agriculture and the Food Chain 2013, 4, 1-8. Ivashuta, S. I.; Petrick, J. S.; Heisel, S. E.; Zhang, Y.; Guo, L.; Reynolds, T. L.; Rice, J. F.; Allen, E.; Roberts, J. K., Endogenous small RNAs in grain: semi-quantification and sequence homology to human and animal genes. Food Chem Toxicol 2009, 47, 353-60. Bazzano, L. A., The High Cost of Not Consuming Fruits and Vegetables. Journal of the American Dietetic Association 2006, 106, 1364-1368. Steffen, L. M., Eat your fruit and vegetables. The Lancet 2006, 367, 278-279. FSANZ Response to Heinemann et al on the regulation of GM crops and foods developed using gene silencing; May, 2013.

17

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465

Figure/Table legends Figure 1. Relative abundance of small RNA size classes. The relative abundance of small RNA with length ranging from 18-26 nt is shown in the graph. All the samples had the typical plant small RNA size distribution with peaks at 21 and 24 nt. With the exception of the melon sample, the 21 nt class was more abundant than the 24 nt class. Figure 2. Relative abundance of small RNA size classes from small RNAs mapped to the virus sequence database. For all samples, most of the virus-mapped reads belong to the 21 and 22 nt size classes. These small RNAs, usually associated with posttranscriptional gene silencing, are likely generated upon viral infection. Figure 3. Validation of virus-mapped small RNA reads to their respective matching viral sequences. The top two virus matches for each sample were plotted against small RNAs mapped to them. Except for E and G, the small RNAs are mapped across the entire virus sequences. Therefore, there is a high probability that such virus is present in the samples. On the contrary, only sparse matches were found on E and G, which suggests that these mapped small RNAs were derived from different sources. Figure 4. Categories of reads and unique reads in small RNA size classes from the tomato sample. The total number of reads (A) and the unique number of reads (B) of tomato small RNAs that matched perfectly to the in-house assembled reference tomato genome (red line), virus genome database (green line) or both (purple line). Together they account for 78% (Table 3) of the all reads (blue line). Table 1. Total numbers of small RNA reads obtained from each sample library and the numbers of reads that mapped to the virus sequences. Only parsed (18-26 nt) reads were used in the mapping. Table 2. The top two virus matches and the numbers of small RNA reads matched in each sample library. The melon sample had only one virus match. For viruses that have more than one genome segments, only one segment is shown in the table, though all other segments had similar significant matches. Table 3. The number of total and unique small RNA reads in each category in the tomato sample. Table 4. Virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virus-mapped reads, 21- and 22-nt size classes, in each sample (Figure 2) were matched against the human reference RNA set downloaded from genome.ucsc.edu (02/07/2012), allowing up to two or three mismatches for each 21- and 22-nt small RNA read, respectively. Table 5. Examples of human RNAs matching the sequence of virus-mapped small RNA reads. Except for the melon sample where only few matches were found, the top three human RNA matches in each sample are listed. The majority of the reads matched to a 18

ACS Paragon Plus Environment

Page 18 of 31

Page 19 of 31

466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504

Journal of Agricultural and Food Chemistry

single sub-region of the transcript as shown using the base number referenced in GenBank. Table 6. Non-virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virus-mapped small RNA reads were removed from each small RNA library. The rest of the small RNA reads, presumably derived from endogenous sequences, were matched against the human reference RNA set downloaded from genome.ucsc.edu. The miRNAs, ribosomal RNAs, small nucleolar RNAs and small nuclear RNAs were excluded from the human reference RNA set and allowed up to two mismatches for each plant small RNA. The number of human genes matched as well as the numbers of unique and total small RNA reads whose sequences matched these human genes are shown. Supplemental tables (all in an Excel file) Table S1. List of virus-mapped small RNA reads in the tomato sample that share sequence homology to human reference RNA set. Table S2. List of virus-mapped small RNA reads in the melon sample that share sequence homology to human reference RNA set. Table S3. List of virus-mapped small RNA reads in the zucchini sample that share sequence homology to human reference RNA set. Table S4. List of virus-mapped small RNA reads in the onion sample that share sequence homology to human reference RNA set. Table S5. List of non-virus-mapped small RNA reads in the tomato sample that share sequence homology to human reference RNA set. Table S6. List of non-virus-mapped small RNA reads in the melon sample that share sequence homology to human reference RNA set. Table S7. List of non-virus-mapped small RNA reads in the zucchini sample that share sequence homology to human reference RNA set. Table S8. List of non-virus-mapped small RNA reads in the onion sample that share sequence homology to human reference RNA set.

19

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 20 of 31

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table of Contents/Abstract Graphics

Plant virus

Small RNAs

Endogenous

Virus derived

? Human mRNAs

1 ACS Paragon Plus Environment

Page 21 of 31

Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 1

Relative abundance

50%

Tomato Melon

40%

Zucchini Onion

30% 20% 10% 0% 18

19

20

21

22

23

24

25

26

Length (nt) Figure 1. Relative abundance of small RNA size classes. The relative abundance of small RNA with length ranging from 18-26 nt is shown in the graph. All the samples had the typical plant small RNA size distribution with peaks at 21 and 24 nt. With the exception of the melon sample, the 21 nt class was more abundant than the 24 nt class.

2 ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 22 of 31

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 2

Relative abundance

60%

Tomato

50%

Melon Zucchini

40%

Onion 30% 20% 10% 0% 18

19

20

21

22

23

24

25

26

Length (nt) Figure 2. Relative abundance of small RNA size classes from small RNAs mapped to the virus sequence database. For all samples, most of the virus-mapped reads belong to the 21 and 22 nt size classes. These small RNAs, usually associated with post-transcriptional gene silencing, are likely generated upon viral infection.

3 ACS Paragon Plus Environment

1 316 631 946 1261 1576 1891 2206 2521 2836 3151 3466 3781 4096 4411 4726 5041 5356 5671 5986 6301 6616 6931 7246 7561 7876 8191 8506 8821 9136 9451 9766

Read abundance (blue: positive strand; red: negative strand)

0 0

-2000 - 50

-4000 - 100

-6000 - 150

-8000 - 200 1 314 627 940 1253 1566 1879 2192 2505 2818 3131 3444 3757 4070 4383 4696 5009 5322 5635 5948 6261 6574 6887 7200 7513 7826 8139 8452 8765 9078 9391

1 97 193 289 385 481 577 673 769 865 961 1057 1153 1249 1345 1441 1537 1633 1729 1825 1921 2017 2113 2209 2305 2401 2497 2593 2689 2785 2881

Page 23 of 31 Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 3

A Tomato/Tomato spotted wilt virus, segment S B Tomato/Potato virus Y

8000 200

6000 150

4000 100

2000 50

C Melon/Watermelon mosaic virus

4

3

2

1

-1 0

-2

-3

-4

4

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 24 of 31

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 3 continued

1 326 651 976 1301 1626 1951 2276 2601 2926 3251 3576 3901 4226 4551 4876 5201 5526 5851 6176 6501 6826 7151 7476 7801 8126 8451 8776 9101 9426 9751

F Onion/Iris yellow spot virus, segment L

400 300 200 100 0 -100 -200 -300 -400 -500 -600

G Onion/Tomato yellow fruit ring virus, partial L gene

4000

400

3000

300

2000

200

1000

100

0

0

-1000

1 - 00

-3000

2 - 00

1 288 575 862 1149 1436 1723 2010 2297 2584 2871 3158 3445 3732 4019 4306 4593 4880 5167 5454 5741 6028 6315 6602 6889 7176 7463 7750 8037 8324 8611

-2000

1 28 55 82 109 136 163 190 217 244 271 298 325 352 379 406 433 460 487 514 541 568 595 622 649 676 703 730 757 784 811

3000 2000 1000 0 -1000 -2000 -3000 -4000 -5000 -6000 -7000

1 311 621 931 1241 1551 1861 2171 2481 2791 3101 3411 3721 4031 4341 4651 4961 5271 5581 5891 6201 6511 6821 7131 7441 7751 8061 8371 8681 8991 9301

E Zucchini/Soybean mosaic virus

D Zucchini/Watermelon mosaic virus

Genome/gene base coverage Figure 3. Validation of virus-mapped small RNA reads to their respective matching viral sequences. The top two virus matches for each sample were plotted against small RNAs mapped to them. Except for E and G, the small RNAs are mapped across the entire virus sequences. Therefore, there is a high probability that such virus is present in the samples. On the contrary, only sparse matches were found on E and G, which suggests that these mapped small RNAs were derived from different sources. 5 ACS Paragon Plus Environment

Page 25 of 31

Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Figure 4

B

A 1048576 262144 65536 16384 4096 1024 256 64 16 4 1

Number of reads

Number of unique reads

262144

18 19 20 21 22 23 24 25 26

65536 16384 4096 1024 256 64 16 4 1 18 19 20 21 22 23 24 25 26

Length (nt)

Length (nt)

All Tomato genome-mapped Virus-mapped Overlap

Figure 4. Categories of reads and unique reads in small RNA size classes from the tomato sample. The total number of reads (A) and the unique number of reads (B) of tomato small RNAs that matched perfectly to the inhouse assembled reference tomato genome (red line), virus genome database (green line) or both (purple line). Together they account for 78% (Table 3) of the all reads (blue line). 6 ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 26 of 31

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 1

Parsed reads (18-26 nt)

Solanum lycopersicum Tomato RNA (fruit)

Virus-mapped reads Unique Total reads Unique reads Total reads reads 5315393 3609317 1209206 196069 23929

Melon

Cucumis melo

Melon RNA (fruit)

4825754

1636524

536672

185

175

Zucchini

Cucurbita pepo

Zucchini RNA (fruit)

5184745

3344051

838271

191085

22949

Allium cepa

Onion RNA (bulb + stem) 6038349

3145549

915603

301913

33144

Sample ID Tomato

Onion

Species

Tissue

Raw reads

Table 1. Total numbers of small RNA reads obtained from each sample library and the numbers of reads that mapped to the virus sequences. Only parsed (18-26 nt) reads were used in the mapping.

7 ACS Paragon Plus Environment

Page 27 of 31

Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 2

Sample Tomato Melon Zucchini Onion

Virus species matched

Mapped reads

GenBank accession

Tomato spotted wilt virus (TSWV), segment S

HQ402595

Total reads 100079

Potato virus Y (PVY)

HQ912865

12076

4040

Watermelon mosaic virus (WMV)

FJ623474

128

122

Watermelon mosaic virus (WMV)

EU660583

146333

17542

D00507

3607

419

Iris yellow spot virus (INSV), segment L

FJ623474

108325

15979

Tomato yellow fruit ring virus (TYFRV), partial L gene

AJ493271

802

91

Soybean mosaic virus (SMV)

Unique reads 6211

Table 2. The top two virus matches and the numbers of small RNA reads matched in each sample library. The melon sample had only one virus match. For viruses that have more than one genome segments, only one segment is shown in the table, though all other segments had similar significant matches.

8 ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 28 of 31

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 3

Sample Tomato

All reads Unique Total Reads Reads 3609317 120206

Tomato genome-mapped reads Total reads 2596723

Virus-mapped reads

Overlap reads

Unique reads Total Reads Unique Reads Total reads 824996 196069 23929 2374

Unique reads 676

Table 3. The number of total and unique small RNA reads in each category in the tomato sample.

9 ACS Paragon Plus Environment

Page 29 of 31

Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 4

Sample

Number of human RNAs matched

Number of unique reads

Total reads

Tomato

1580

1786

21816

% of total virus-mapped reads sharing homology with human genes 11.1%

Melon

43

22

22

11.9%

Zucchini

884

1558

16634

8.7%

Onion

1399

1936

18005

6.0%

Table 4. Virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virusmapped reads, 21- and 22-nt size classes, in each sample (Figure 2) were matched against the human reference RNA set downloaded from genome.ucsc.edu (02/07/2012), allowing up to two or three mismatches for each 21- and 22-nt small RNA read, respectively.

10 ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 30 of 31

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 5

Sample

Human RNA matched

GenBank ID

Region matched

Multiple EGF-like-domains 8 (MEGF8), mRNA NM_001410 10253-10274 Family with sequence similarity 190, member A (FAM190A), NM_001145065 5152-5170 Tomato transcript variant 1, mRNA Rho-associated, coiled-coil containing protein kinase 2 NM_004850 4525-4545, 997-1017 (ROCK2), mRNA WD repeat domain 3 (WDR3), mRNA NM_006784 3424-3444 Homo sapiens regulator of G-protein signaling 7 (RGS7), NM_002924 1273-1295 Zucchini mRNA Proline-rich transmembrane protein 2 (PRRT2), transcript NM_145239 1444-1464 variant 1, mRNA Triosephosphate isomerase 1 pseudogene 3 (TPI1P3), nonNR_027338 639-659 coding RNA Olfactory receptor, family 6, subfamily P, member 1 (OR6P1), Onion NM_001160325 635-654 mRNA DYX1C1-CCPG1 readthrough (non-protein coding) (DYX1C12962-2983, 3037NR_037923 CCPG1), non-coding RNA 3056

Number of Total reads unique reads 20 3451 1

2419

6

2337

6

1019

14

904

4

462

6

1043

10

941

11

689

Table 5. Examples of human RNAs matching the sequence of virus-mapped small RNA reads. Except for the melon sample where only few matches were found, the top three human RNA matches in each sample are listed. The majority of the reads matched to a single sub-region of the transcript as shown using the base number referenced in GenBank.

11 ACS Paragon Plus Environment

Page 31 of 31

Journal of Agricultural and Food Chemistry

Small RNA profiles from virus-infected fresh market vegetables, Frizzi et al. – Table 6

Sample Tomato Melon Zucchini Onion

Number of human RNAs Number of unique reads Total reads matched

% of total non-virus-mapped reads sharing homology with human genes

13683 8664

33189 14961

337610 157019

9.4% 9.6%

8286 13851

14633 34193

144654 342541

4.3% 10.9%

Table 6. Non-virus-mapped small RNA reads sharing sequence homology to human reference RNA set. The virusmapped small RNA reads were removed from each small RNA library. The rest of the small RNA reads, presumably derived from endogenous sequences, were matched against the human reference RNA set downloaded from genome.ucsc.edu. The miRNAs, ribosomal RNAs, small nucleolar RNAs and small nuclear RNAs were excluded from the human reference RNA set and allowed up to two mismatches for each plant small RNA. The number of human genes matched as well as the numbers of unique and total small RNA reads whose sequences matched these human genes are shown.

12 ACS Paragon Plus Environment