Subscriber access provided by Nottingham Trent University
Functional Structure/Activity Relationships
Comparative Transcriptomic Analysis Reveals Regulatory Mechanisms of Theanine Synthesis in Tea (Camellia sinensis) and Oil Tea (Camellia. oleifera) Plants Yuling Tai, chengcheng Ling, huanhuan Wang, lin yang, guangbiao she, chengxiang wang, shuwei yu, Wei Chen, Chun Liu, and Xiaochun Wan J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/acs.jafc.9b02295 • Publication Date (Web): 22 Aug 2019 Downloaded from pubs.acs.org on August 22, 2019
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 34
Journal of Agricultural and Food Chemistry
Comparative Transcriptomic Analysis Reveals Regulatory Mechanisms of Theanine Synthesis in Tea (Camellia sinensis) and Oil Tea (Camellia oleifera) Plants Yuling Tai,†,‡ Chengcheng Ling,† Huanhuan Wang,† Lin Yang,† Guangbiao She,‡ Chengxiang Wang,† Shuwei Yu,‡ Wei Chen,† Chun Liu, *,†,§ Xianchun Wan, *,‡ † School of Life Science, Anhui Agricultural University, Hefei 230036, China
‡ State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
§ BGI Genomics, BGI-Shenzhen, Shenzhen, China
Corresponding author:
(Chun Liu) E-mail:
[email protected] (Xiaochun Wan) E-mail:
[email protected] 1
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
1
ABSTRACT: Tea provides a rich taste and has healthy properties due to its variety of
2
bioactive compounds, such as theanine, catechins and caffeine. Theanine is the most
3
abundant free amino acid (40%−70%) in tea leaves. Key genes related to theanine
4
biosynthesis have been studied, but relatively little is known about the regulatory
5
mechanisms of theanine accumulation in tea leaves. Herein, we analyzed theanine
6
content in tea (Camellia sinensis) and oil tea (Camellia oleifera), and found it to be
7
higher in roots than other tissues in both species. The theanine content was significantly
8
higher in tea than oil tea. To explore the regulatory mechanisms of theanine
9
accumulation, we identified genes involved in theanine biosynthesis by RNA-Seq
10
analysis, and compared theanine-related modules. Moreover, we cloned theanine
11
synthase (TS) promoters from tea and oil tea plants, and found that a difference in TS
12
expression and cis-acting elements may explain the difference in theanine accumulation
13
between the two species. These data provide an important resource for regulatory
14
mechanisms of theanine accumulation in tea plants.
15
KEYWORDS: Tea, oil tea, theanine biosynthesis, transcriptomics, regulatory
16
mechanism, RNA-seq, CsTS promoter, CoTS promoter, transcription factors
17
18
19
ACS Paragon Plus Environment
Page 2 of 34
Page 3 of 34
Journal of Agricultural and Food Chemistry
20
INTRODUCTION
21
Consumed since 3,000 BC, tea is one of the most important beverage crops worldwide,
22
and offers important health benefits due to its numerous bioactive compounds including
23
catechins, theanine, caffeine, vitamins and volatile oils1-3. These compounds also
24
contribute to the pleasant aroma and appealing taste4-6. Tea is made from the leaves of tea
25
plants (Camellia sinensis) belonging to genus Camellia in the Theaceae family of
26
angiosperms. The Camellia genus contains several other economically important species,
27
including the traditional oil tea plants (Camellia oleifera) that produce high-quality edible
28
seed oil 7. Oil tea plants are genetically closely related to tea plants, and are grown only
29
in China8-11.
30
Theanine (γ-glutamylethylamide) is one of the major amino acid components in tea
31
beverages, accounting for 1−2% dry weight 40−70% of total free amino acids in tea
32
leaves
33
plants, and has been detected in trace amounts in C. oleifera, C. japonica and C.
34
sasanqua, as well as the fungus Xerocomus badius
35
compounds in tea plants, theanine contributes to the umami and sweet tastes. In addition,
36
recent studies suggest that theanine has a variety of health functions including controlling
37
hypertension, lowering blood pressure, reducing anxiety and inhibiting tumor growth 15 16
38
17
39
blood-brain barrier 18, 19.
12
. Theanine is a unique non-protein amino acid found almost exclusively in tea
13, 14
. As one of the characteristic
, and it can affect the mammalian central nervous system by passing through the
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 4 of 34
40
Many researchers have investigated the biosynthesis of theanine and found that it is
41
synthesized in tea plant roots and transported to aerial parts, especially tender shoots 20, 21.
42
The key enzymes related to the theanine biosynthesis pathway are glutamine synthetase
43
(GS), theanine synthetase (TS), glutamate synthase (GOGAT) and glutamate
44
dehydrogenase (GDH)
45
performed in the 1960s 23, but more recent research
46
and regulation mechanisms related to the theanine biosynthesis pathway. However, little
47
is currently known about the properties of the theanine synthesis-related enzymes in tea
48
plants. It is interesting that only tea plants appear to synthesize and accumulate large
49
quantities of theanine.
22
. Studies on theanine biosynthesis in tea plants were first 24, 25
has identified the key enzymes
50
Weighted correlation network analysis (WGCNA) is a popular method for gene
51
co-expression network analysis, and this correlation-based approach facilitates
52
visualization of co-expression networks derived from transcriptomic data
53
technique has been successfully utilized to compare gene-expression patterns in
54
domesticated and wild tomato
55
networks for tea and oil tea based on transcriptomic data, and identified candidate genes
56
including TS (hereafter referred to as TSI) in theanine-related modules. TS is one of the
57
key enzymes and catalyzes the last step in theanine synthesis. Recently, Wei et al. (2018)
58
identified candidate gene CsTSI in the tea plant genome
59
protein product possessed theanine synthetase activity. Herein, we cloned and analyzed
27
26
. This
. In the present study, we constructed co-expression
ACS Paragon Plus Environment
28
, and demonstrated that the
Page 5 of 34
Journal of Agricultural and Food Chemistry
60
the C. sinensis and C. oleifera TS promoters (CsTSI and CoTSI) associated with the
61
theanine-related modules, and predicted the transcription factor (TF) binding sites
62
therein. Our findings provide useful data for theanine research, and illuminate the
63
mechanisms regulating the accumulation and biosynthesis of theanine. Our transcriptome
64
data will serve as an important resource for future functional genomics and breeding of
65
tea plants.
66
MATERIALS AND METHODS
67
Plant Materials
68
Cuttings of 6-year-old C. oleifera and C. sinensis plants were used, and plants were
69
grown in natural conditions at the De Chang Fabrication Base in Anhui Province, China
70
(Shucheng, latitude 31.3° N, longitude 117.2° E, above sea level). All plants were
71
fertilized and watered equally. Plants were grown in an experimental field with 150 cm
72
between rows and 40 cm between plants within a row, and the soil was acidic (pH 5.6).
73
Three independent biological replicates were performed for each sample, and each
74
sample was collected from 15 randomly selected tea and oil tea plants, respectively. All
75
samples were frozen in liquid nitrogen immediately after harvesting, and stored at −80°C
76
until RNA isolation. Bud (CO-B, CS-B), first young leaf (CO-L1, CS-L1), second young
77
leaf (CO-L2, CS-L2), mature leaf (CO-ML, CS-ML), stem (CO-S, CS-S), flower
78
(CO-FL, CS-FL), fruit (CO-FR, CS-FR) in June, and root (CO-R, CS-R) tissues
79
comprised the eight tissue samples studied. All necessary permits were obtained from
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 6 of 34
80
Wenyong Zhan, the director of the De Chang Fabrication Base.
81
Extraction and HPLC Determination of Theanine
82
Theanine was extracted from samples using hot water as described previously
83
some modifications. Briefly, 0.20 g of freeze-dried tea leaves were extracted with
84
deionized water for 20 min in a boiling water bath and centrifuged at 6,000 rpm for 10
85
min. Residues were re-extracted once as described above, supernatants were combined
86
and diluted with water to 10 mL, and filtered through a 0.22-μm membrane before HPLC
87
determination. Each sample was analyzed using three independent biological replicates.
88
Theanine content was measured using a Waters 600E series HPLC system (Waters, USA)
89
at a detection wavelength of 199 nm, and the injection approach and mobile phase ratio
90
were as described previously 13. The theanine standard used in this study was purchased
91
from Shanghai Winherb Medical Technology, Ltd., China. Theanine content is reported
92
as mean ± standard deviation (SD) from three independent biological replicates.
93
RNA Extraction, Library Construction, and Sequencing
94
Total RNA was extracted separately from each tissue using a modified CTAB (hexadecyl
95
trimethyl ammonium bromide) method as described previously 30, and each sample was
96
extracted in triplicate. The quality and quantity of total RNA were characterized using
97
1.2% agarose gel electrophoresis (AGE) and examined with a NanoDrop 2000 instrument
98
(USA). RNA integrity (RIN) was determined using an Agilent Bioanalyzer 2100
99
(Agilent, USA), and RNA samples with an A260/A280 ratio >1.8, A260/A230 ratio >1.8,
100
and RIN >8 were considered suitable for library construction. Equal amounts of RNA
ACS Paragon Plus Environment
29
with
Page 7 of 34
Journal of Agricultural and Food Chemistry
101
from three independent biological replicates were pooled prior to cDNA library
102
preparation. Library construction (including mRNA enrichment, cDNA synthesis,
103
fragment interruption, adapter addition, size selection, PCR amplification, and
104
transcriptomic sequencing) was performed by staff at the Beijing Genome Institute (BGI;
105
Shenzhen, China). Briefly, mRNA was enriched from total RNA using magnetic beads
106
with Oligo (dT), and cleaved into short fragments. These fragments were synthesized into
107
first-strand cDNA with random primers (TaKaRa, Japan), and double-stranded cDNAs
108
were then synthesized and repaired using T4 DNA ligase Klenow DNA polymerase and
109
T4 polynucleotide kinase (NEB, USA). Products were enriched by PCR to generate the
110
cDNA library after ligation of adapters. The cDNA library was examined after quality
111
control using an Agilent 2100 Bioanalyzer, and 90 bp paired-end reads were obtained for
112
qualified library construction using an Illumina HiSeq 2500 sequencer 31.
113
De Novo Transcriptome Assembly and Open Reading Frame Identification
114
Sequencing data (raw reads) were firstly filtered by removing low-quality reads (base
115
quality 5%). The remaining high-quality reads (clean reads) were de novo assembled using
117
Trinity 32 with parameters ‘min_kmer_cov = 7’ and ‘min_glue = 3’. The resulting contigs
118
were clustering and redundancy was removed using TGICL
119
Non-redundant
120
(https://transdecoder.github.io) was employed to identify candidate coding regions.
121
High-quality ORF datasets were obtained by removing sequence redundancy using
contigs
were
considered
ACS Paragon Plus Environment
33
with default parameters.
unigenes.
Transdecoder
Journal of Agricultural and Food Chemistry
Page 8 of 34
122
CD-HIT (version 4.6.8) 34. The completeness of unigenes was evaluated using the Busco
123
(version 3.0.2)
124
C. sinensis transcriptomic data (SRR1928149 and SRP056466) were used for comparison
125
with the C. oleifera data (PRJNA274203).
126
Functional Annotation and Classification
127
Unigenes were classified into functional categories using various databases and tools.
128
Unigenes and ORFs were aligned against NR (NCBI non-redundant protein sequences,
129
SwissProt36 (a manually annotated and reviewed protein sequence database), KOG
130
(Clusters of Orthologous Groups of proteins) and KEGG databases with a threshold
131
E-value of 10-5. GO (Gene ontology) annotation was carried out using the Blast 2 GO
132
program (version 2.3.4) based on BLAST search results against the NR database. Protein
133
domains and important sites were predicted by InterPro (http://www.ebi.ac.uk/interpro/).
134
TF analysis was carried out by iTAK
135
rules were mainly summarized from PlnTFDB
136
PlantTFcat 40 and AtTFDB 41 provided supporting evidence.
137
Phylogenetic Analysis Evolutionary Relationships between TSs and GSs
138
We retrieved the sequences identified from the Wei et al.
139
and GS from C. sinensis var. assamica, C. oleifera and Elaeis guineensis. The
140
phylogenetic tree was constructed and analyzed by MEGA X 42. Sequences of TS and GS
141
from C.sinensis cv. Shuchazao C. sinensis var. assamica, C. oleifera, A. thaliana, P.
142
vulgaris, G. max, Z. mays, T. aestivum, O. sativa, M. truncatula, E. guineensis were
35
and Embryophyta databases, and the data used for subsequent analysis.
37
using the standalone version (1.7a), consensus 38
and PlantTFDB
ACS Paragon Plus Environment
28
39
, and families from
and added sequences of TS
Page 9 of 34
Journal of Agricultural and Food Chemistry
143
aligned
144
maximum-likelihood method with five hundreds of bootstrap replicates
145
phylogenetic tree was visualized by ITOL (https://itol.embl.de/index.shtml). A
146
phylogenetic tree for AMT (ammonium transporter) was constructed using the same
147
method.
148
Identification of DEGs and Construction of Co-Expression Networks
149
Unigene expression levels were estimated and normalized using the fragments per
150
kilobase of exon model per million mapped reads (FPKM) method. We calculated DEGs
151
based on the method described in Love et al.
152
from RNA-seq reads, then estimated variance-mean dependence in count data and tested
153
for differential expression based on a model using the negative binomial distribution.
154
Genes with absolute log 2 ratio values ≥1 and a false discovery rate (FDR)