Vitis vinifera L. Single-Nucleotide Polymorphism Detection with High

Oct 1, 2015 - A total of 58 SNPs and 1 INDEL were found among the UFGT gene, allowing the discrimination of 18 different genotypes within the 22 grape...
1 downloads 10 Views 935KB Size
Subscriber access provided by University of Illinois at Urbana-Champaign Library

Article

Vitis vinifera L. single nucleotide polymorphism detection with High Resolution Melting analysis based on UDPglucose: flavonoid 3- O-glucosyltransferase gene Leonor Pereira, and Paula Martins-Lopes J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/acs.jafc.5b03463 • Publication Date (Web): 01 Oct 2015 Downloaded from http://pubs.acs.org on October 3, 2015

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Journal of Agricultural and Food Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 33

Journal of Agricultural and Food Chemistry

1 2 3 4

Vitis vinifera L. single nucleotide polymorphism detection with High Resolution Melting analysis based on UDP-glucose: flavonoid 3- Oglucosyltransferase gene

5 6

Leonor Pereira, Paula Martins-Lopes*

7

University of Trás-os-Montes and Alto Douro, P.O. Box 1013, 5000-911 Vila Real,

8

Portugal

9

University of Lisboa, Faculty of Sciences, BioISI – Biosystems & Integrative Sciences

10

Institute, Campo Grande, Lisboa, Portugal

11 12

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

13

ABSTRACT

14

Vitis vinifera L. is species with a large number of varieties, which differ in terms of

15

anthocyanin content. The genes involved on the anthocyanin biosynthesis pathway have

16

a direct effect in the anthocyanin profile of each variety, being potentially interesting for

17

varietal identification. The current study aimed at the design of an assay suitable for the

18

discrimination of the largest number of grapevine varieties. Two genes of the

19

anthocyanin pathway, chalcone isomerase (CHI) and UDP-glucose: flavonoid 3- O-

20

glucosyltransferase (UFGT), were sequenced in 22 grapevine varieties. The CHI gene

21

presented 5 SNPs within the sequence. A total of 58 SNPs and 1 INDEL were found

22

among the UFGT gene, allowing the discrimination of 18 different genotypes within the

23

22 grapevine varieties. A HRM assay designed for UFGT, containing 704 bp produced

24

differentiated melting-curves for each of the 18 haplotypes. The developed HRM assay

25

is efficient in the grapevine varietal discrimination.

26 27

Keywords

28

Vitis vinifera L., Anthocyanins, UFGT gene, SNP, HRM

29

ACS Paragon Plus Environment

Page 2 of 33

Page 3 of 33

Journal of Agricultural and Food Chemistry

30

INTRODUCTION

31

In the Vitaceae family, the Vitis genus is agronomically very important. Within this

32

genus, the only European species, Vitis vinifera L., represents one of the oldest

33

domesticated plants and is extremely relevant in the wine industry. The high

34

adaptability of the V. vinifera species to different environmental conditions1 makes it

35

difficult to unequivocally identify the grapevine varieties. The traditional methods used

36

in the identification and differentiation of grapevine varieties, based on ampelography

37

and ampelometry, are dependent on the plants’ phenology that is influenced by

38

environmental, phytosanitary and nutritional conditions. Nowadays there are several

39

molecular marker based methods developed to guarantee the grapevine variety

40

identification and they have been extended to must and wine samples, where

41

morphological characterization is not applicable.2

42

Among the main molecular marker systems, Simple Sequence Repeats (SSR) markers

43

represent one of the most suitable genetic tools currently adopted by the international

44

scientific community to define a grapevine variety.3 Recently, based on the whole

45

genome sequencing of 12X V. vinifera PN40024/reference genome4, sequence-based

46

molecular markers, as Single Nucleotide Polymorphism (SNP), have been generated.

47

According to Cabezas and collaborators5 SNP markers present several advantages

48

concerning varietal identification, namely: (1) mostly bi-allelic; (2) abundant

49

throughout the genome; (3) relatively stable during evolution and; (4) low mutation rate.

50

Furthermore, SNPs can be easily reproduced between laboratories and when using

51

different detection methodologies, since the different alleles are not distinguished based

52

on size but on the nucleotide presence at a given position. These features, in

53

combination with their high availability, makes SNPs the most popular marker system

54

for several genetic analyses.

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

55

High-resolution melting (HRM) represents a method that enables the genotyping of

56

SNPs in a large number of samples. The principle of HRM analysis is based on the

57

generation of different melting-curve profiles due to the sequence variation present in

58

the double-stranded DNA. Single-nucleotide changes represent the smallest genetic

59

variation and are divided into four classes, distinguished by the different melting

60

temperature (Tm) shifts they produce.6 SNP class 1 involves C/T and G/A, and SNP

61

class 2 involves C/A and G/T, base exchanges that are easily genotyped by HRM due to

62

Tm differences over 0.5 ºC.6 In contrast, in SNP class 3, only C/G base exchange occurs,

63

and SNP class 4 is described by A/T base exchange, producing very small Tm

64

differences (< 0.4 ºC for SNP class 3 and 3 bp). The

(Figure

1)

ACS Paragon Plus Environment

and

CHIfwd

(5’-

Page 7 of 33

Journal of Agricultural and Food Chemistry

129

PCR reactions were performed in a 20 µL volume, containing 20 ng of genomic DNA,

130

1X PCR buffer, 25 mM MgCl2, 0.2 mM dNTPs, 0.2 µM of each primer and 0.3 U of

131

Taq DNA polymerase (Roche). The reactions were incubated at 94 °C for 3 min,

132

followed by 30 cycles of 94 °C/1 min, 58 °C/1 min, 72 °C/2 min and a final step of 72

133

°C for 10 min.

134

PCR amplicons resulting from the amplification of UFGT and CHI genes were directly

135

sequenced (STAB VIDA; http://www.stabvida.com) and the genotypes of the twenty-

136

two grapevine varieties were obtained. Sequence alignment was performed using the

137

BioEdit program (http://www.mbio.ncsu.edu/BioEdit/BioEdit.html). The alignments of

138

reverse and forward sequences were applied to produce consensus sequences. The

139

sequences of each individual DNA fragments were aligned with original sequence to

140

identify the SNP presence.

141 142

High-resolution melting assay design.

143

HRM analysis was performed in a specific 704 bp fragment of the UFGT gene using

144

primers U_HRMfwd (5´- GCAATGTAATATCAAGTCC -3´) (Starting at 180 bp) and

145

U_HRMrev (5´-TTTCTTTCTTTGAGCCATT-3´) (Ending at 884 bp) (Figure 1).

146

PCR and HRM analysis was performed in a StepOne™ Real-Time PCR System

147

(Applied Biosystems®, California, USA) in final volume of 20 µL containing the

148

respective primer pair (5 pmol of each primer), 20 ng of gDNA and the MeltDoctor™

149

HRM Master Mix (Applied Biosystems®, California, USA). PCR/HRM included an

150

initial step of 10 min at 95 ºC and 40 cycles of 30 s at 95 ºC, 30 s at 58 ºC and 30 s at 72

151

ºC. The melting curve was obtained in continuous, performed as follow: 30 s at 95 ºC, 1

152

min at 65 ºC, 15 s at 95 ºC, rising 0.3 ºC/s and 15 s at 65 ºC. All reactions were

153

performed in triplicate. A High Resolution Melt Software v3.0.1 (Applied

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 8 of 33

154

Biosystems®, California, USA) was used to analyze the data. After normalization and

155

determining the temperature shift, the different melting curves of the several plots were

156

generated.

157

In order to validate the reference HRM profile for each grapevine variety, DNA from

158

the clones were analyzed using the developed HRM assay, using the previously

159

described conditions.

160 161

Phylogenetic analysis.

162

Predicted amino acid sequences were used for phylogenetic analysis using COBALT

163

multiple sequence alignment tool which takes into consideration the conserved domain

164

database,

165

va.ncbi.nlm.nih.gov/tools/cobalt/re_cobalt.cgi).26

166

RESULTS AND DISCUSSION

167

SNP identification.

168

The CHI gene is located in a central position in the anthocyanin pathway. This gene was

169

sequenced in twenty-two grapevine varieties (eighteen Portuguese and four

170

international) revealing the existence of only five SNPs among all these varieties (data

171

not shown). The low number of SNPs detected within this gene is in accordance with

172

previous studies making it a non-interesting marker for varietal identification

173

purposes.11, 27

174

On the contrary, the UFGT gene revealed to be highly polymorphic presenting a high

175

number of SNPs among the grapevine varieties studied. The UFGT gene length covered

176

1500 bp and within this region a total of 58 SNPs and an insertion were detected (Table

177

2). Eighteen SNPs and the insertion were found within the Exon 1 region (493 bp) with

protein

motif

database

and

sequence

similarity

ACS Paragon Plus Environment

(http://www.st-

Page 9 of 33

Journal of Agricultural and Food Chemistry

178

an average frequency of 1 SNP/25.9 nucleotides. Four SNPs were found in the Intron

179

(74 bp) with an average frequency of 1 SNP/18.5 nucleotides. The remaining 36 SNPs

180

were positioned within the Exon 2 region (881 bp) with an average frequency of 1

181

SNP/24.5 nucleotides. These results are interesting since the number of SNPs detected

182

within the coding-region is high, contrary to previous studies (sunflower28, cotton29, and

183

grapevine30) where the majority of the SNPs were located in the non-coding region of

184

the genes. The high SNP frequency found within this gene is particularly interesting

185

since UFGT has been previously associated with several different types of

186

anthocyanins11, which are directly involved in the skin berries color and the grapes

187

organoleptic characteristics.

188 189

High resolution melting-curve PCR analysis.

190

CHI gene was not considered for the HRM approach since the number of SNPs found

191

did not allow the discrimination of the varieties under study.

192

The HRM assay was designed for UFGT gene considering a fragment size of 704 bp.

193

The UFGT HRM fragment included 32 of the 58 SNPs present and the detected

194

insertion, considering the twenty-two grapevine varieties under study, with an average

195

frequency of 1 SNP/22 nucleotides. Considering the same fragment length Nicolè et

196

al.31 reported a frequency of 1 SNP/31.69 nucleotides, revealing that this gene is highly

197

polymorphic, and therefore interesting for varietal identification.

198

Fragment size influences the sensitivity of subsequent HRM analysis, and it is usually

199

advisable that they do not exceed 300 bp of length.32 In addition, long amplicons may

200

contain several melting domains, resulting in complex melting profiles meaning that

201

longer amplicons represent small differences in the melting curve caused by small

202

sequence variation. In plants, the studies using HRM report a PCR amplicon range of 50

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

33-35

Page 10 of 33

203

to 260 bp.8,

204

fragment size (704 bp). Therefore this method is quite promising as it broadens the

205

genotyping potential using HRM analysis. The UFGT targeted sequence generated

206

eighteen different melting curve profiles (Figure 2). The Tm values found within all the

207

samples were similar (Tm1 - 82.7 - 83.0 ºC; Tm2 – 86.6 - 86.9 ºC), not distinguishing the

208

different haplotypes. However, the shape of the melting curves was very informative,

209

and even when the grapevine variety sequence only differed by a single SNP, this

210

variation was detectable for: (a) CS, Sou and Vio and; (b) FP, TR and MF grapevine

211

varieties (Table 2). Additionally, all these SNPs belong to Class 4, A/T, which is the

212

most challenging genotype variation mainly due to the difficulty associated with the

213

difference in Tm.7

214

These results demonstrated the power, sensitivity and specificity of this particular HRM

215

assay that allows the identification of several complex genotypes and subsequently the

216

detection of different melting curves (Figure 2). The assay was successful in the

217

definition of the several haplotypes once it was based on a combination of a high

218

number of SNPs within the amplified fragment.

219

Even though the assay is based on a long fragment, this HRM assay proved to be highly

220

sensitive and was able to distinguish 18 haplotypes based on a combination of 33

221

nucleotide differences in a unique assay. Previous studies reported the detection of

222

several SNPs and INDELs but always considering simple events33,

223

combination of nucleotide differences. Never, until now, has it been reported that by

224

using a unique assay it was possible to detect such a high number of events in one

225

reaction. The designed HRM assay proved to be powerful and specific.

226

The shape of HRM curve for each grapevine variety needed to be further validated in

227

relation to their varietal specification, once there are a high number of clones available

To our knowledge this is the first paper considering such a large

ACS Paragon Plus Environment

36-39

, and not a

Page 11 of 33

Journal of Agricultural and Food Chemistry

228

within each grapevine variety.40 Therefore, DNA samples from different clones of each

229

grapevine variety were used and tested for reproducibility. The plants were sampled in

230

established clonal field collections, maintained jointly in private and public institutions,

231

including samples of different regions of the country. The melting-curve profiles

232

obtained for each variety were coincident among clones of the same haplotype,

233

indicating that such an assay can be used in the genotype identification of these

234

particular grapevine varieties (Figure 3). The shape of the melting curve profiles of five

235

red grapevine varieties used in the Douro region are presented, as an example in Figure

236

3. Among them, Cabernet Sauvignon (CS) is used as an international grapevine

237

reference variety. The clones of each grapevine variety present the same genetic profile,

238

reinforcing the assay’s robustness.

239

Four of the twenty-two grapevine varieties studied (Tinta Francisca, Alicante Bouschet,

240

Côdega do Larinho and Tinta Amarela) could not be distinguished using this particular

241

HRM assay (Figure 4), once they presented the same sequence for this particular region

242

(Table 2). The sequence of these grapevine varieties only differ from Touriga Franca

243

(TF), in a unique SNP at the 425 bp position (Table 2) which is discriminated using this

244

particular assay.

245

The robustness of the HRM assay could also be confirmed by the profiles obtained in

246

the clones of the undistinguishable grapevine varieties herein tested (Figure 4), once all

247

the samples tested presented the same shape HRM melting curve, as expected.

248

Considering all the nucleotide sequences, in particularly the SNPs found in the 22

249

grapevine varieties, the amino acid sequences were deduced and aligned. A total of 31

250

amino acid residues variations were found within the grapevine varieties studied (Table

251

3).

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

252

UFGT gene expression is correlated with the grapevine phenotype and its transcription

253

is needed for berry pigmentation (no color/red color).41, 42 The anthocyanin profile and

254

content can deeply influence the final wine quality.43 Considering the variation of the

255

red grapevine varieties amino acid sequences studied, a Cobalt tree was constructed,

256

giving rise to three major groups (Figure 5). Group A included CS, Sou, TR and Ruf;

257

Group B incorporated M, TB, TBr and TN; Group C comprised TFi, AB, TA, TF, DT

258

and TC. The Pinot Noir clones analyzed were completely distinguished from all the

259

grapevine varieties, and therefore they generated an independent branch. The

260

differences found between the amino acid sequence of Pinot Noir and the sequence of

261

the NCBI database were expected, once the NCBI reference genome is a hybrid of Pinot

262

Noir.4 Touriga Brasileira presented the same amino acid sequence as the reported in the

263

NCBI database relatively to the UFGT locus.

264

Although the UFGT gene expression is related with the anthocyanin profile of the

265

grapes41 there is a lack of information of the specific profile of each grapevine variety.

266

Some of the most widely used grapevine varieties have been characterized, among them

267

Pinot Noir, Merlot and Cabernet Sauvignon.

268

Pinot Noir grapes have a differentiated anthocyanin profile, with only monoglucosides

269

anthocyanins, in contrast with other grapevine varieties (Merlot, Cabernet Sauvignon,

270

Touriga Franca, Tinta Roriz, Touriga Nacional, Sousão, Rufete, Tinta Amarela, Tinta

271

Barroca, Tinto Cão) that present all the three types of anthocyanin (monoglucosides,

272

acetates and coumarates).44-47 This may justify the outstanding position of Pinot Noir in

273

relation to the other grapevine varieties studied. However, the amino acid variation

274

present within the UFGT sequence has a direct implication on the protein composition,

275

which may or not influence the protein function.

ACS Paragon Plus Environment

Page 12 of 33

Page 13 of 33

Journal of Agricultural and Food Chemistry

276

The results obtained through this work represent a good landmark in this particular area,

277

and should be considered as a suitable platform for further studies. The deduced amino

278

acid sequences sets the bases for a better understanding of the correlation between

279

anthocyanin content and the amino acid profiles.

280

The present work confirmed the low level of polymorphism of the CHI gene and

281

revealed a high level of polymorphism present within the UFGT gene of grapevine. The

282

UFGT gene allowed the discrimination of 18 different grapevine genotypes among 22

283

grapevine varieties. From the 58 SNPs and 1 INDEL a total of 31 amino acid residual

284

changes were predicted, enhancing the potential effect on the anthocyanin profile of

285

each variety, which opens doors for further omic studies considering this particular

286

gene.

287

This study provides the first report on the use of a large fragment in HRM assays,

288

thereby allowing the detection of multiple events (SNP and INDEL) in a unique assay,

289

which can be adapted for large-scale genotyping and mapping in Vitis. This assay can

290

be extended for grapevine varietal certification procedures, which are imperative

291

throughout the entire wine-chain (plant nurseries to wine), once it allows varietal

292

identification.

293 294

Accession numbers

295

Sequence data from this article can be found in the Gen-Bank database under the

296

accession numbers 1868281767; 1868281812; 1868281815; 1868281817; 1868281828; 1868281830;

297

1868281834; 1868281836; 1868281838; 1868281843; 1868281845; 1868281852; 1868281855;

298

1868281862; 1868281865; 1868281867; 1868281870; 1868281872; 1868281874; 1868281876;

299

1868281878; 1868281881; 1868281883; 1868281885; 1868281887; 1868281892; 1868281894;

300

1868281897; 1868281899; 1868281901; 1868281903; 1868281905; 1868281907; 1868281909;

301

1868281911; 1868281913; 1868281915; 1868281917; 1868281919; 1868281921; 1868281924;

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

302

1868281926; 1868281928; 1868281930; 1868281933; 1868281935; 1868281937; 1868281939;

303

1868281941; 1868281943; 1868281945; 1868281947; 1868281949; 1868281951; 1868281953;

304

1868281955; 1868281957; 1868281960; 1868281963.

305 306

Author information

307

Corresponding Author

308

*(P. M.-L.) Phone: ++351259350936 . Fax: + +351259350572

309

E-mail: [email protected]

310

Funding

311

This research was supported by the Portuguese Foundation for Science and Technology

312

(FCT) in the project Biosensor Development for Wine Traceability in the Douro Region

313

– WineBioCode PTDC/AGR-ALI/117341/2010-FCOMP-01-0124-FEDER-019439 and

314

a PhD grant (SFRH/BD/44781/2008).

315 316

Acknowledgements

317

The authors are grateful to Prof. Ântero Martins, Doctor José Eiras-Dias (INIAV), Dr.

318

Paulo Costa (ADVID) for helping us in the clonal leaf collection, to the Sogrape Vinhos

319

S.A., Real Companhia Velha, Sociedade Borges S.A., Direção Regional de Agricultura e

320

Pescas do Norte for letting us collect the samples in their fields.

321 322

REFERENCES

323 324

(1) Dal Santo, S.; Tornielli, G. B.; Zenoni, S.; Fasoli, M.; Farina, L.; Anesi, A.;

325

Guzzo, F.; Delledonne, M.; Pezzotti, M. The plasticity of the grapevine berry

326

transcriptome. Genome Biol. 2013, 14, R54.

327

ACS Paragon Plus Environment

Page 14 of 33

Page 15 of 33

Journal of Agricultural and Food Chemistry

328 329

(2) Pereira, L.; Guedes-Pinto, H.; Martins-Lopes, P. An Enhanced Method for Vitis vinifera L., DNA Extraction from Wines. Am. J. Enol. Vitic. 2011, 62, 247-252.

330 331 332

(3) Organisation Internationale de la Vigne et du Vin (OIV). 2nde Édition de la Liste des Descripteurs OIV pour les Variétés et Espèces de Vitis. 2009, Ed. OIV, Paris.

333 334

(4) Jaillon, O.; Aury, J.-M.; Noel, B.; Policriti, A.; Clepet, C. The grapevine genome

335

sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 2007,

336

449, 463-467.

337 338 339

(5) Cabezas, J. A.; Ibáñez, J.; Lijavetzky, D.; Vélez, D.; Bravo, G. et al. A 48 SNP set for grapevine cultivar identification. BMC Plant Biol. 2011, 11, 153.

340 341

(6) Liew, M.; Pryor, R.; Palais, R.; Meadows, C.; Erali, M.; Lyon, E.; Wittwer, C.

342

Genotyping of single-nucleotide polymorphisms by high-resolution melting of small

343

amplicons. Clin. Chem. 2004, 50, 1156-64.

344 345

(7) Pietzka, A. T.; Indra, A.; Stöger, A.; Zeinzinger, J.; Konrad, M.; Hasenberger,

346

P.; Allerberger, F.;

Ruppitsch, W. Rapid identification of multidrug-resistant

347

Mycobacterium tuberculosis isolates by rpoB gene scanning using high-resolution

348

melting curve PCR analysis. J. Antimicrob. Chemother. 2009, 63, 1121–1127.

349 350

(8) Han, Y.; Khu, D.-M.; Monteros, M. J. High-resolution melting analysis for SNP

351

genotyping and mapping in tetraploid alfalfa (Medicago sativa L.). Mol. Breed. 2012,

352

29, 489-501.

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

353 354 355

(9) Erali, M.; Wittwer, C. T. High resolution melting analysis for gene scanning. Methods 2010, 50, 250-261.

356 357

(10) Mackay, J. F.; Wright, C. D.; Bonfiglioli, R. G. A new approach to varietal

358

identification in plants by microsatellite high resolution melting analysis: application to

359

the verification of grapevine and olive cultivars. Plant Methods 2008, 4, 8.

360 361

(11) Cardoso, S.; Lau, W.; Eiras-Dias, J.E.; Fevereiro, P.; Maniatis, N. A Candidate-

362

Gene Association Study for Berry Colour and Anthocyanin Content in Vitis vinifera L..

363

PLoS ONE 2012, 7, 9.

364 365

(12) Castellarin, S. D.; Di Gaspero, G. Transcriptional control of anthocyanin

366

biosynthetic genes in extreme phenotypes for berry pigmentation of naturally occurring

367

grapevines. BMC Plant Biol. 2007, 7, 46.

368 369

(13) Mattivi, F.; Guzzon, R.; Vrhovsek, U.; Stefanini, M.; Velasco, R. Metabolite

370

profiling of grape: flavonols and anthocyanins. J. Agric. Food Chem. 2006, 54, 7692-

371

7702.

372 373

(14) Boss, P.; Davies C.; Robinson S. Analysis of the expression of anthocyanin

374

pathway genes in developing Vitis vinifera L. cv Shiraz grape berries and the

375

implications for pathway regulation. Plant Physiol. 1996, 111, 1059-1066.

376

ACS Paragon Plus Environment

Page 16 of 33

Page 17 of 33

Journal of Agricultural and Food Chemistry

377 378

(15) Holton, T.A., Cornish, E.C. Genetics and biochemistry of anthocyanin biosynthesis. Plant Cell 1995, 7, 1071-1083.

379 380 381

(16) Grotewold, E. The genetics and biochemistry of floral pigments. Annu. Rev. Plant Biol. 2006, 57, 761-780.

382 383

(17) Velasco, R.; Zharkikh, A.; Troggio, M.; Cartwright, D.A.; Cestaro, A. et al. A

384

High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine

385

Variety. PLoS ONE 2007, 2, 12.

386 387

(18) Sparvoli, F.; Martin, C.; Scienza, A.; Gavazzi, G.; Tonelli, C. Cloning and

388

molecular analysis of structural genes involved in flavonoid and stilbene biosynthesis in

389

grape (Vitis vinifera L.). Plant Mol. Biol. 1994, 24, 743-755.

390 391

(19) Kuhn, N.; Guan, L.; Dai, Z. W.; Wu, B.-H.; Lauvergeat, V.; Gomès, E.; Li, S.-

392

H.; Godoy, F.; Arce-Johnson, P.; Delrot, S. Berry ripening: recently heard through the

393

grapevine. J. Exp. Bot. 2014, 65, 4489-4490.

394 395

(20) Zheng, Y.; Li, J. H.; Xin, H. P.; Wang, N.; Guan, L.; Wu, B. H.; Li, S. H.

396

Anthocyanin profile and gene expression in berry skin of two red Vitis vinifera grape

397

cultivars that are sunlight dependent versus sunlight independent. Aust. J. Grape Wine

398

Res. 2013, 19, 238-248.

399 400

(21) Kobayashi, S.; Ishimaru, M.; Ding, C.K.; Yakushiji, H.; Goto, N. Comparison of

401

UDP-glucose: flavonoid 3-O-glucosyltransferase (UFGT) gene sequences between

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

402

white grapes (Vitis vinifera) and their sports with red skin. Plant Sci. 2001, 160, 543-

403

550.

404 405

(22) Cutanda-Perez, M.-C.; Ageorges, A.; Gomez, C.; Vialet, S.; Terrier, N.;

406

Romieu, C.; Torregrosa, L. Ectopic expression of VlmybA1 in grapevine activates a

407

narrow set of genes involved in anthocyanin synthesis and transport. Plant Mol. Biol.

408

2009, 69, 633-648.

409 410 411

(23) Doyle, J. J.; Doyle, J. L. Isolation of plant DNA from fresh tissue. Focus 1990, 12, 13-15.

412 413

(24) Shimada, N.; Aoki, T.; Sato, S.; Nakamura, Y.; Tabata, S.; Ayabe, S. A cluster

414

of genes encodes the two types of chalcone isomerase involved in the biosynthesis of

415

general flavonoids and legume-specific 5-deoxy(iso)flavonoids in Lotus japonicas.

416

Plant Physiol. 2003, 131, 941-951.

417 418

(25) Przysiecka, Ł.; Książkiewicz, M.; Wolko, B.; Naganowska, B. Structure,

419

expression profile and phylogenetic inference of chalcone isomerase-like genes from the

420

narrow-leafed lupin (Lupinus angustifolius L.) genome. Front. Plant Sci. 2015, 6, 268.

421 422 423

(26) Papadopoulos, J. S.; Agarwala, R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 2007, 23, 1073-1079.

424 425

(27) Salmaso, M.; Faes, G.; Segala, C.; Stefanini, M.; Salakhutdinov, I.; Zyprian, E.;

426

Toepfer,; R. Grando, M. S.; Velasco, R. Genome diversity and gene haplotypes in the

ACS Paragon Plus Environment

Page 18 of 33

Page 19 of 33

Journal of Agricultural and Food Chemistry

427

grapevine (Vitis vinifera L.), as revealed by single nucleotide polymorphisms. Mol.

428

Breed. 2004, 14, 385-395.

429 430

(28) Kolkman, J. M.; Berry, S. T.; Leon, A. J.; Slabaugh, M. B.; Tang, S.; Gao, W.;

431

Shintani, D. K.; Burke, J. M.; Knapp, S. J. Single nucleotide polymorphisms and

432

linkage disequilibrium in sunflower. Genetics 2007, 177, 457-468.

433 434

(29) An, C.; Saha, S.; Jenkins, J. N.; Ma, D. P.; Scheffler, B. E.; Kohel, R. J.; Yu, J.

435

Z.; Stelly, D. M. Cotton (Gossypium spp.) R2R3-MYB transcription factors SNP

436

identification phylogenomic characterization, chromosome localization and linkage

437

mapping. Theor. Appl. Genet. 2008, 116, 1015-1026.

438 439

(30) Riahia, L.; Zoghlamia, N.; Dereeperb, A.; Laucouc, V.; Mlikia, A.; This. P.

440

Single nucleotide polymorphism and haplotype diversity of the gene NAC4 in

441

grapevine. Ind. Crops Prod. 2013, 43, 718-724.

442 443

(31) Nicolè, S.; Barcaccia, G.; Erickson, D. L.; Kress, J. W.; Lucchin, M. The coding

444

region of the UFGT gene is a source of diagnostic SNP markers that allow single-locus

445

DNA genotyping for the assessment of cultivar identity and ancestry in grapevine (Vitis

446

vinifera L.). BMC Res. Notes 2013, 6, 502.

447 448 449

(32) Druml, B.; Cichna-Markl, M. High resolution melting (HRM) analysis of DNA – Its role and potential in food analysis. Food Chem. 2014, 158, 245-254.

450

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

451

(33) De Koeyer, D.; Douglass, K.; Murphy, A.; Whitney, S.; Nolan, L.; Song, Y.; De

452

Jong, W. Application of high-resolution DNA melting for genotyping and variant

453

scanning of diploid and autotetraploid potato. Mol. Breed. 2010, 25, 67-90.

454 455

(34) Ó Lochlainn, S.; Amoah, S.; Graham, N. S.; Alamer, K.; Rios J. J. et al. High

456

Resolution Melt (HRM) analysis is an efficient tool to genotype EMS mutants in

457

complex crop genomes. Plant Methods 2011, 7, 43.

458 459

(35) Xanthopoulou, A.; Ganopoulos, I.; Tsaballa, A.; Nianiou-Obeidat, I.; Kalivas,

460

A.; Tsaftaris, A.; Madesis, P. Summer Squash Identification by High-Resolution-

461

Melting (HRM) Analysis Using Gene-Based EST–SSR Molecular Marker. Plant. Mol.

462

Biol. Rep. 2014, 32, 395-405.

463 464

(36) Li, Y.-Dan; Chu, Z.-Zhan; Liu, X.-Guo; Jing, H.-Chun; Liu, Y.-Guang; Hao, D-

465

Yun A Cost-effective High-resolution Melting Approach using the EvaGreen Dye for

466

DNA Polymorphism Detection and Genotyping in Plants. J. Integr. Plant Biol. 2010,

467

52, 1036-1042.

468 469

(37) Yan, G.-Xin; Wu, X.-Ming; Li, D.; Zeng, C.-Li; Lv, P.-Jun; Gao, G.-Zhen;

470

Chen, B.-Yun; Xua, K.; Lv, X.-Dan Assessing high-resolution melt curve analysis for

471

accurate detection of DNA polymorphisms in the chloroplast gene accD of Crucifer

472

species. Biochem. Sys. Ecol. 2012, 44, 352-360.

473

ACS Paragon Plus Environment

Page 20 of 33

Page 21 of 33

Journal of Agricultural and Food Chemistry

474

(38) Han, Y.; Khu, D.-Man; Monteros, M. J. High-resolution melting analysis for

475

SNP genotyping and mapping in tetraploid alfalfa (Medicago sativa L.). Mol. Breed.

476

2012, 29, 489-501.

477 478

(39) Emanuelli, F.; Sordo, M.; Lorenzi, S.; Battilana, J.; Grando, M. S. Development

479

of user-friendly functional molecular markers for VvDXS gene conferring muscat flavor

480

in grapevine. Mol. Breed. 2014, 33, 235-241.

481 482 483

(40) Ocana, J.; Walter, B.; Schellenbaum, P. Stable MSAP Markers for the Distinction of Vitis vinifera cv Pinot Noir Clones. Mol. Biotechnol. 2013, 55, 236-248.

484 485

(41) Lijavetzky, D.; Ruiz-Garcia, L.; Cabezas, J.A.; De Andres, M.T.; Bravo, G.;

486

Ibáñez, A.; Carreño, J.; Cabello, F.; Martínez-Zapater, J.M. Molecular genetics of berry

487

colour variation in table grape. Mol. Gen. Genet. 2006, 276, 427-435.

488 489

(42) Pelsy, F.; Dumas, V.; Bévilacqua, L.; Hocquigny, S.; Merdinoglu D.

490

Chromosome Replacement and Deletion Lead to Clonal Polymorphism of Berry Color

491

in Grapevine. PLoS Genet. 2015, 11, e1005081.

492 493

(43) Bindon, K. A.; McCarthy, M. G.; Smith, P. A. Development of wine colour and

494

non-bleachable pigments during the fermentation and ageing of (Vitis vinifera L. cv.)

495

Cabernet Sauvignon wines differing in anthocyanin and tannin concentration. LWT -

496

Food Sci. Technol. 2014, 59, 923-932.

497

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

498

(44) Dimitrovska, M.; Bocevska, M.; Dimitrovski, D.; Doneva-Sapceska, D.

499

Evolution of anthocyanins during vinification of Merlot and Pinot Noir grapes to wines.

500

Acta Aliment. 2015, 44, 259-267.

501 502

(45) Costa, E.; Cosme, F.; Jordão, A. M.; Mendes-Faia, A. Anthocyanin profile and

503

antioxidant activity from 24 grape varieties cultivated in two Portuguese wine regions.

504

J. Int. Sci. Vigne Vin 2014, 48, 51-62.

505 506

(46) Jordão, A. M.; Simão, S.; Correia, A. C.; Gonçalves, F. J. Antioxidant activity

507

evolution during Portuguese red wine vinification and their relation with the

508

proanthocyanidin and anthocyanin composition. J. Food Proc. Pres. 2012, 36, 298-309.

509 510

(47) Jordão, A. M.; Ricardo da Silva, J. M.; Laureano, O. Evolution of anthocyanins

511

during grape maturation of two varieties (Vitis vinifera L.), Castelão Francês and

512

Touriga Francesa. Vitis 1998, 37, 93-94.

513 514 515 516 517 518

Figure captions

519 520

Figure 1. Schematic representation of the primers set used in the UFGT gene. The

521

empty space corresponds to the intron.

522

ACS Paragon Plus Environment

Page 22 of 33

Page 23 of 33

Journal of Agricultural and Food Chemistry

523

Figure 2. HRM Difference plot of twenty-two grapevine varieties for UFGT fragment

524

showing eighteen different genotypes.

525 526

Figure 3 HRM different curves of the most representative grapevine varieties used in

527

wine production in Portugal and corresponding clones for UFGT fragment.

528 529

Figure 4. Difference plot of the grapevine varieties and corresponding clones presenting

530

the same genotype for UFGT fragment.

531 532

Figure 5. Phylogenetic tree based on deduced amino acid sequences variations of

533

UFGT from the red grapevine varieties using Cobalt multiple sequence alignment tool.

534 535 536 537 538 539 540 541 542

Figures

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Figure 1.

Figure 2.

ACS Paragon Plus Environment

Page 24 of 33

Page 25 of 33

Journal of Agricultural and Food Chemistry

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Figure 3.

ACS Paragon Plus Environment

Page 26 of 33

Page 27 of 33

Journal of Agricultural and Food Chemistry

Figure 4.

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Figure 5.

ACS Paragon Plus Environment

Page 28 of 33

Page 29 of 33

Journal of Agricultural and Food Chemistry

List of Tables

Table 1. List of 22 grapevine varieties used for SNP identification, corresponding code and berry color. Grapevine variety name Code Berry color Alicante Bouschet AB Red Cabernet Sauvignon CS Red Chardonnay Ch White Côdega do Larinho CL White Donzelinho Tinto DT Red Fernão Pires FP White Gouveio Gou White Merlot M Red Malvasia Fina MF White Moscatel Galego MG White Pinot Noir PN Red Rufete Ruf Red Sousão Sou Red Tinta Amarela TA Red Tinta Barroca TB Red Touriga Brasileira TBr Red Tinto Cão TC Red Touriga Franca TF Red Tinta Francisca TFi Red Touriga Nacional TN Red Tinta Roriz TR Red Viosinho Vio White

543

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 30 of 33

Table 2. Single Nucleotide Polymorphisms identified in UFGT gene with information on the genotypes composition found across the 22 grapevine varieties (Vitis vinifera L.). Nucleotide position Sample

90

98

139

205

207

220

238

240

257

264

265

272

309

366

375

424

425

442

459

483

525

555

560

562

598

600

617

619

636

663

685

762

789

816

841

843

UFGT TBr Ch MG GOU CS Sou Vio FP TR MF Ruf M TB TN TF TFi AB TA CL TC DT PN

C . S S . S S S G G G G . . . . . . . . S . .

C . . . . . . . . . . . . . . Y Y Y Y Y Y Y Y

C . . . . . . . . . . . . . . S S S S S S S S

G . R R . R R R A A A R . . . . . . . . R . .

G . S S . S S S C C C S . . . . . . . . S . .

G . S S S S S S C C C C . . . S S S S S C C S

G . . . . . . . . . . R . . . R R R R R R R S

C . . . M . . . . . . . . . . . . . . . . M .

A . . . . . . . . . . . . . . R R R R R R R R

G . . . . . . . . . . . . . . K K K K K K K K

C . . . . . . . M M A M . . . . . . . . . . .

T . . Y . Y Y Y C C C C . . . . . . . . Y . .

G . . R . A A A A A A A W A R R R R R R R . .

T . . . . . . . . . . . W . . . . . . . . . .

T . Y Y Y C C C C C C C C C Y Y Y Y Y Y Y Y .

T T -

G . . . . . . . . . . . K K K K . . . . . . .

C . . . . . . . . . . M . . . . . . . . . . .

C . . Y . Y T Y T T T Y . . . . . . . . Y . Y

G . . . . S S S . . . . C C S S S S S S . . .

A . . W . W W W T T T W . . . . . . . . W . W

A . M M M C C C C C C C C C M C C C C C C C M

A . . . R . . . . . . . . . . R R R R R R G .

G . . . . . . . . . . . . . . R R R R R R R .

A . . . . . . . T W W W . . . . . . . . . . .

T . . . Y . . . . . . . . . . . . . . . . Y .

C . . . M . . . . . . . . . . M M M M M M A .

A . . . R . . . . . . . . . . R R R R R R G .

C . . . Y . . . . . . . . . . Y Y Y Y Y Y T .

C . . . Y . . . . . . . . . . Y Y Y Y Y Y T .

A . . . . . . . . . . . . . . . . . . . . . W

C . . . . . . . . . . . . . . Y Y Y Y Y Y Y .

C . . . . . . . . . . . . . . M M M M M M M .

T . . . . Y Y Y Y Y Y . . . . Y Y Y Y Y C Y .

A . R R R G G G G G G G G G R G G G G G G G S

A . . . . W W W . . . . T T W W W W W W . . W

Numbering starts with A of the start codon. Nucleotide Code: A (Adenine); C (Cytosine); T (Thymine); G (Guanine); M (A or C); R (A or G); W (A or T); S (C or G); Y ( C or T) and K (G or T).

ACS Paragon Plus Environment

Page 31 of 33

Journal of Agricultural and Food Chemistry

Nucleotide position Sample

850

943

955

UFGT TBr Ch MG GOU CS Sou Vio FP TR MF Ruf M TB TN TF TFi AB TA CL TC DT PN

A . . . . . . . . . . . . . . W W W W W W W .

C . . . . S S S S S S . . . . . . . . . S . .

G . . . . K K K K K K . . . . . . . . . K . .

1014 1054 1083 1122 1131 1134 1144 1146 1160 1165 1193 1232 1245 1273 1345 1404 1405 1409 1459 1481 G . K . . . . . . . . . . . . . . . . . . . .

T . . . . Y Y Y Y Y Y Y C C Y Y Y Y Y Y . . Y

G . . . . . . . . . . . . . . . . . . . . . R

A . . . . M M M M M M . . . . . . . . . M . .

T . . . . Y Y Y Y Y Y . . . . . . . . . Y . .

A . . . . T T T T T T T T T W T T T T T T W W

T . . . K . . . . . . . . . . . . . . . . K .

G . . . . . . . . . . . . . . R R R R R R R .

C . . . . . . . . . . . . . . S S S S S S S .

G . . . . . . . . . . K . . . . . . . . . . .

A . W W W T T T T T T T T T W T T T T T T T W

C . Y . . Y Y Y T T T T . . . Y Y Y Y Y T Y Y

A . . . . M M M M M . M . . . . . . . . M . .

G . R . . A A A A A A A A A R A A A A A A R R

G . R R R A A A A A A A A A R A A A A A A A R

T . . . . . . . . . . . . . . . . . . . . . Y

T . . . . . . . . W . . . . . . . . . . . . .

A . . . . W W W W W W . . . . . . . . . W . .

T . . . . . . . Y Y . Y . . . . . . . . . . .

A . . . . . . . . R . R . . . . . . . . . . .

Numbering starts with A of the start codon. Nucleotide Code: A (Adenine); C (Cytosine); T (Thymine); G (Guanine); M (A or C); R (A or G); W (A or T); S (C or G); Y ( C or T) and K (G or T).

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 32 of 33

Table 3. Predicted amino acid sequence variations in UFGT, using the reference genome, based on the SNPs found across the twenty-two grapevine varieties (Vitis vinifera L.). Amino acid position Sample UFGT CS Sou Vio FP MF TR Ruf Ch MG TBr GOU M TB TN PN TFi AB CL TA TF DT TC

33 47 69 74 80 86 88 89 91 142 153 161 174 181 182 204 256 259 290 293 312 327 357 362 364 373 386 400 424 444 445

A Q V A A D E . . I P . . . . . I P . . . . . I P . . . . . I P . . . . . I P . . . . . . P . . . . . . P T . . . . . P . . . . . . P . . . . . . . . . . . . . P . . . . . . . . . . . . . . . . . . . . . . . . V E . P P G D V E . P T G D V E . P T G D V E . P T G D V E . P T G D V E . P T G D V E . P T G D V E I P T G D

L . . . M M M M . . . . . . . . . . . . . . .

M T T T T T T T . T . . . . . . . . . . . . T

A . . . . . . . . . . . S S S . . . . . S . .

T I I I I I I I . I . . . . . I . . . . . . I.

G A A A . . . . . . . . A A A . A A A A A . .

N . . . Y Y Y Y . . . . . . . . . . . . . . .

S K M I . . . V . . . V . . . V . . . V . . . V . . . V . . . V . . . . . . . . . . . . Y E . . . . V . . . V . . . . . . L L Y E . V Y E . V Y E . V Y E . V Y E . V Y E . V Y E . V

T . . . . . . . . . . . . . . . S S S S S S S

L A V S V S V S V S V S . S . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . V S

R . . . . . . . S . . . . . . . . . . . . . .

Y L A G H . . . H . . . H . . . H . . . H . . . H . . . H . . W . . . . . . . . . . . . . V . . H . . . H . . . H . . . H . . . H . G . H . G . H . G . H . G . H . G . . V G . . . G .

Y F F F F F F F . . . . F F . . F F F F F F F

A V V V V V V V V . . . . . . V V V V V V V V

E K K K K K K K . . . . K K . . K K K K K

G R R R R R R R . . . . R R . . R R R R R R K R

F . . . . . I . . . . . . . . . . . . . . . .

K I I I I I I . . . . . . . . . . . . . . . I

Amino acid code: A (Alanine); D (Aspartic Acid); E (Glutamic Acid); F (Phenylalanine); G (Glycine); H (Histidine); I (Isoleucine); K (Lysine); L (Leucine); M (Methionine); N (Asparagine); P (Proline); Q (Glutamine); R (Arginine); S (Serine); T (Threonine); V (Valine); W (Tryptophan); Y (Tyrosine).

ACS Paragon Plus Environment

Page 33 of 33

Journal of Agricultural and Food Chemistry

Graphical Abstract 320x320mm (96 x 96 DPI)

ACS Paragon Plus Environment