Comparative Transcriptomic Analysis Reveals Regulatory

1 hour ago - Herein, we analyzed theanine content in tea (Camellia sinensis) and oil tea (C. oleifera), and found it to be higher in roots than other ...
5 downloads 0 Views 2MB Size
Subscriber access provided by Nottingham Trent University

Functional Structure/Activity Relationships

Comparative Transcriptomic Analysis Reveals Regulatory Mechanisms of Theanine Synthesis in Tea (Camellia sinensis) and Oil Tea (Camellia. oleifera) Plants Yuling Tai, chengcheng Ling, huanhuan Wang, lin yang, guangbiao she, chengxiang wang, shuwei yu, Wei Chen, Chun Liu, and Xiaochun Wan J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/acs.jafc.9b02295 • Publication Date (Web): 22 Aug 2019 Downloaded from pubs.acs.org on August 22, 2019

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 34

Journal of Agricultural and Food Chemistry

Comparative Transcriptomic Analysis Reveals Regulatory Mechanisms of Theanine Synthesis in Tea (Camellia sinensis) and Oil Tea (Camellia oleifera) Plants Yuling Tai,†,‡ Chengcheng Ling,† Huanhuan Wang,† Lin Yang,† Guangbiao She,‡ Chengxiang Wang,† Shuwei Yu,‡ Wei Chen,† Chun Liu, *,†,§ Xianchun Wan, *,‡ † School of Life Science, Anhui Agricultural University, Hefei 230036, China

‡ State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China

§ BGI Genomics, BGI-Shenzhen, Shenzhen, China

Corresponding author:

(Chun Liu) E-mail: [email protected]

(Xiaochun Wan) E-mail: [email protected]

1

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

1

ABSTRACT: Tea provides a rich taste and has healthy properties due to its variety of

2

bioactive compounds, such as theanine, catechins and caffeine. Theanine is the most

3

abundant free amino acid (40%−70%) in tea leaves. Key genes related to theanine

4

biosynthesis have been studied, but relatively little is known about the regulatory

5

mechanisms of theanine accumulation in tea leaves. Herein, we analyzed theanine

6

content in tea (Camellia sinensis) and oil tea (Camellia oleifera), and found it to be

7

higher in roots than other tissues in both species. The theanine content was significantly

8

higher in tea than oil tea. To explore the regulatory mechanisms of theanine

9

accumulation, we identified genes involved in theanine biosynthesis by RNA-Seq

10

analysis, and compared theanine-related modules. Moreover, we cloned theanine

11

synthase (TS) promoters from tea and oil tea plants, and found that a difference in TS

12

expression and cis-acting elements may explain the difference in theanine accumulation

13

between the two species. These data provide an important resource for regulatory

14

mechanisms of theanine accumulation in tea plants.

15

KEYWORDS: Tea, oil tea, theanine biosynthesis, transcriptomics, regulatory

16

mechanism, RNA-seq, CsTS promoter, CoTS promoter, transcription factors

17

18

19

ACS Paragon Plus Environment

Page 2 of 34

Page 3 of 34

Journal of Agricultural and Food Chemistry

20

INTRODUCTION

21

Consumed since 3,000 BC, tea is one of the most important beverage crops worldwide,

22

and offers important health benefits due to its numerous bioactive compounds including

23

catechins, theanine, caffeine, vitamins and volatile oils1-3. These compounds also

24

contribute to the pleasant aroma and appealing taste4-6. Tea is made from the leaves of tea

25

plants (Camellia sinensis) belonging to genus Camellia in the Theaceae family of

26

angiosperms. The Camellia genus contains several other economically important species,

27

including the traditional oil tea plants (Camellia oleifera) that produce high-quality edible

28

seed oil 7. Oil tea plants are genetically closely related to tea plants, and are grown only

29

in China8-11.

30

Theanine (γ-glutamylethylamide) is one of the major amino acid components in tea

31

beverages, accounting for 1−2% dry weight 40−70% of total free amino acids in tea

32

leaves

33

plants, and has been detected in trace amounts in C. oleifera, C. japonica and C.

34

sasanqua, as well as the fungus Xerocomus badius

35

compounds in tea plants, theanine contributes to the umami and sweet tastes. In addition,

36

recent studies suggest that theanine has a variety of health functions including controlling

37

hypertension, lowering blood pressure, reducing anxiety and inhibiting tumor growth 15 16

38

17

39

blood-brain barrier 18, 19.

12

. Theanine is a unique non-protein amino acid found almost exclusively in tea

13, 14

. As one of the characteristic

, and it can affect the mammalian central nervous system by passing through the

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 4 of 34

40

Many researchers have investigated the biosynthesis of theanine and found that it is

41

synthesized in tea plant roots and transported to aerial parts, especially tender shoots 20, 21.

42

The key enzymes related to the theanine biosynthesis pathway are glutamine synthetase

43

(GS), theanine synthetase (TS), glutamate synthase (GOGAT) and glutamate

44

dehydrogenase (GDH)

45

performed in the 1960s 23, but more recent research

46

and regulation mechanisms related to the theanine biosynthesis pathway. However, little

47

is currently known about the properties of the theanine synthesis-related enzymes in tea

48

plants. It is interesting that only tea plants appear to synthesize and accumulate large

49

quantities of theanine.

22

. Studies on theanine biosynthesis in tea plants were first 24, 25

has identified the key enzymes

50

Weighted correlation network analysis (WGCNA) is a popular method for gene

51

co-expression network analysis, and this correlation-based approach facilitates

52

visualization of co-expression networks derived from transcriptomic data

53

technique has been successfully utilized to compare gene-expression patterns in

54

domesticated and wild tomato

55

networks for tea and oil tea based on transcriptomic data, and identified candidate genes

56

including TS (hereafter referred to as TSI) in theanine-related modules. TS is one of the

57

key enzymes and catalyzes the last step in theanine synthesis. Recently, Wei et al. (2018)

58

identified candidate gene CsTSI in the tea plant genome

59

protein product possessed theanine synthetase activity. Herein, we cloned and analyzed

27

26

. This

. In the present study, we constructed co-expression

ACS Paragon Plus Environment

28

, and demonstrated that the

Page 5 of 34

Journal of Agricultural and Food Chemistry

60

the C. sinensis and C. oleifera TS promoters (CsTSI and CoTSI) associated with the

61

theanine-related modules, and predicted the transcription factor (TF) binding sites

62

therein. Our findings provide useful data for theanine research, and illuminate the

63

mechanisms regulating the accumulation and biosynthesis of theanine. Our transcriptome

64

data will serve as an important resource for future functional genomics and breeding of

65

tea plants.

66

MATERIALS AND METHODS

67

Plant Materials

68

Cuttings of 6-year-old C. oleifera and C. sinensis plants were used, and plants were

69

grown in natural conditions at the De Chang Fabrication Base in Anhui Province, China

70

(Shucheng, latitude 31.3° N, longitude 117.2° E, above sea level). All plants were

71

fertilized and watered equally. Plants were grown in an experimental field with 150 cm

72

between rows and 40 cm between plants within a row, and the soil was acidic (pH 5.6).

73

Three independent biological replicates were performed for each sample, and each

74

sample was collected from 15 randomly selected tea and oil tea plants, respectively. All

75

samples were frozen in liquid nitrogen immediately after harvesting, and stored at −80°C

76

until RNA isolation. Bud (CO-B, CS-B), first young leaf (CO-L1, CS-L1), second young

77

leaf (CO-L2, CS-L2), mature leaf (CO-ML, CS-ML), stem (CO-S, CS-S), flower

78

(CO-FL, CS-FL), fruit (CO-FR, CS-FR) in June, and root (CO-R, CS-R) tissues

79

comprised the eight tissue samples studied. All necessary permits were obtained from

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 6 of 34

80

Wenyong Zhan, the director of the De Chang Fabrication Base.

81

Extraction and HPLC Determination of Theanine

82

Theanine was extracted from samples using hot water as described previously

83

some modifications. Briefly, 0.20 g of freeze-dried tea leaves were extracted with

84

deionized water for 20 min in a boiling water bath and centrifuged at 6,000 rpm for 10

85

min. Residues were re-extracted once as described above, supernatants were combined

86

and diluted with water to 10 mL, and filtered through a 0.22-μm membrane before HPLC

87

determination. Each sample was analyzed using three independent biological replicates.

88

Theanine content was measured using a Waters 600E series HPLC system (Waters, USA)

89

at a detection wavelength of 199 nm, and the injection approach and mobile phase ratio

90

were as described previously 13. The theanine standard used in this study was purchased

91

from Shanghai Winherb Medical Technology, Ltd., China. Theanine content is reported

92

as mean ± standard deviation (SD) from three independent biological replicates.

93

RNA Extraction, Library Construction, and Sequencing

94

Total RNA was extracted separately from each tissue using a modified CTAB (hexadecyl

95

trimethyl ammonium bromide) method as described previously 30, and each sample was

96

extracted in triplicate. The quality and quantity of total RNA were characterized using

97

1.2% agarose gel electrophoresis (AGE) and examined with a NanoDrop 2000 instrument

98

(USA). RNA integrity (RIN) was determined using an Agilent Bioanalyzer 2100

99

(Agilent, USA), and RNA samples with an A260/A280 ratio >1.8, A260/A230 ratio >1.8,

100

and RIN >8 were considered suitable for library construction. Equal amounts of RNA

ACS Paragon Plus Environment

29

with

Page 7 of 34

Journal of Agricultural and Food Chemistry

101

from three independent biological replicates were pooled prior to cDNA library

102

preparation. Library construction (including mRNA enrichment, cDNA synthesis,

103

fragment interruption, adapter addition, size selection, PCR amplification, and

104

transcriptomic sequencing) was performed by staff at the Beijing Genome Institute (BGI;

105

Shenzhen, China). Briefly, mRNA was enriched from total RNA using magnetic beads

106

with Oligo (dT), and cleaved into short fragments. These fragments were synthesized into

107

first-strand cDNA with random primers (TaKaRa, Japan), and double-stranded cDNAs

108

were then synthesized and repaired using T4 DNA ligase Klenow DNA polymerase and

109

T4 polynucleotide kinase (NEB, USA). Products were enriched by PCR to generate the

110

cDNA library after ligation of adapters. The cDNA library was examined after quality

111

control using an Agilent 2100 Bioanalyzer, and 90 bp paired-end reads were obtained for

112

qualified library construction using an Illumina HiSeq 2500 sequencer 31.

113

De Novo Transcriptome Assembly and Open Reading Frame Identification

114

Sequencing data (raw reads) were firstly filtered by removing low-quality reads (base

115

quality 5%). The remaining high-quality reads (clean reads) were de novo assembled using

117

Trinity 32 with parameters ‘min_kmer_cov = 7’ and ‘min_glue = 3’. The resulting contigs

118

were clustering and redundancy was removed using TGICL

119

Non-redundant

120

(https://transdecoder.github.io) was employed to identify candidate coding regions.

121

High-quality ORF datasets were obtained by removing sequence redundancy using

contigs

were

considered

ACS Paragon Plus Environment

33

with default parameters.

unigenes.

Transdecoder

Journal of Agricultural and Food Chemistry

Page 8 of 34

122

CD-HIT (version 4.6.8) 34. The completeness of unigenes was evaluated using the Busco

123

(version 3.0.2)

124

C. sinensis transcriptomic data (SRR1928149 and SRP056466) were used for comparison

125

with the C. oleifera data (PRJNA274203).

126

Functional Annotation and Classification

127

Unigenes were classified into functional categories using various databases and tools.

128

Unigenes and ORFs were aligned against NR (NCBI non-redundant protein sequences,

129

SwissProt36 (a manually annotated and reviewed protein sequence database), KOG

130

(Clusters of Orthologous Groups of proteins) and KEGG databases with a threshold

131

E-value of 10-5. GO (Gene ontology) annotation was carried out using the Blast 2 GO

132

program (version 2.3.4) based on BLAST search results against the NR database. Protein

133

domains and important sites were predicted by InterPro (http://www.ebi.ac.uk/interpro/).

134

TF analysis was carried out by iTAK

135

rules were mainly summarized from PlnTFDB

136

PlantTFcat 40 and AtTFDB 41 provided supporting evidence.

137

Phylogenetic Analysis Evolutionary Relationships between TSs and GSs

138

We retrieved the sequences identified from the Wei et al.

139

and GS from C. sinensis var. assamica, C. oleifera and Elaeis guineensis. The

140

phylogenetic tree was constructed and analyzed by MEGA X 42. Sequences of TS and GS

141

from C.sinensis cv. Shuchazao C. sinensis var. assamica, C. oleifera, A. thaliana, P.

142

vulgaris, G. max, Z. mays, T. aestivum, O. sativa, M. truncatula, E. guineensis were

35

and Embryophyta databases, and the data used for subsequent analysis.

37

using the standalone version (1.7a), consensus 38

and PlantTFDB

ACS Paragon Plus Environment

28

39

, and families from

and added sequences of TS

Page 9 of 34

Journal of Agricultural and Food Chemistry

143

aligned

144

maximum-likelihood method with five hundreds of bootstrap replicates

145

phylogenetic tree was visualized by ITOL (https://itol.embl.de/index.shtml). A

146

phylogenetic tree for AMT (ammonium transporter) was constructed using the same

147

method.

148

Identification of DEGs and Construction of Co-Expression Networks

149

Unigene expression levels were estimated and normalized using the fragments per

150

kilobase of exon model per million mapped reads (FPKM) method. We calculated DEGs

151

based on the method described in Love et al.

152

from RNA-seq reads, then estimated variance-mean dependence in count data and tested

153

for differential expression based on a model using the negative binomial distribution.

154

Genes with absolute log 2 ratio values ≥1 and a false discovery rate (FDR)