N-Glycoproteomic Analysis of Chicken Egg Yolk - Journal of

Oct 11, 2018 - This website uses cookies to improve your user experience. By continuing to use the site, you are accepting our use of cookies. Read th...
0 downloads 0 Views 2MB Size
Subscriber access provided by NEW MEXICO STATE UNIV

Omics Technologies Applied to Agriculture and Food

N-glycoproteomic Analysis of Chicken Egg Yolk Fang Geng, Yunxiao Xie, Jinqiu Wang, Kaustav Majumder, Ning Qiu, and Meihu Ma J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/acs.jafc.8b04492 • Publication Date (Web): 11 Oct 2018 Downloaded from http://pubs.acs.org on October 15, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 34

Journal of Agricultural and Food Chemistry

N-glycoproteomic Analysis of Chicken Egg Yolk

1 2 3

Fang Geng1,2, Yunxiao Xie1,2, Jinqiu Wang1,2*, Kaustav Majumder3*, Ning Qiu4, Meihu

4

Ma4

5 6

1

7

University, No. 2025 Chengluo Avenue, Chengdu, 610106, P. R. China

Key Laboratory of Coarse Cereal Processing (Ministry of Agriculture), Chengdu

8 9 10

2

College of Pharmacy and Biological Engineering, Chengdu University, No. 2025

Chengluo Avenue, Chengdu, 610106, P. R. China

11 12

3

13

Street, Lincoln, NE 68588, USA

Food Science and Technology Department, University of Nebraska-Lincoln, 1400 R

14 15

4

16

Huazhong Agricultural University, No. 1 Shizishan Street, Wuhan, 430070, P. R. China

National R&D Center for Egg Processing, College of Food Science and Technology,

17 18

*Corresponding authors:

19

Dr. Jinqiu Wang, E-mail: [email protected]

20

Dr. Kaustav Majumder, E-mail: [email protected]

21

1

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

22

ABSTRACT

23

Posttranslational N-glycosylation of food proteins plays a critical role in their structure

24

and function. However, the N-glycoproteome of chicken egg yolk (CEY) has not been

25

studied yet. Glycopeptides hydrolyzed from CEY proteins were enriched, and

26

deglycosylation using PNGase F, then were identified using a shotgun glycoproteomics

27

strategy. A total of 217 N-glycosylation sites and 86 glycoproteins were identified in CEY,

28

and these glycoproteins are mainly involved in “binding”, “biological regulation”,

29

“catalytic activity”, and “metabolic process”. Among the identified CEY glycoproteins, 22

30

were recognized as proteases and protease inhibitors, suggesting that a proteinase/inhibitor

31

regulation system exists in CEY; 15 were members of the complement and immune

32

systems, which provide protection against potential threats during hatching. The study

33

provides important structural information about CEY glycoproteins and aids in the

34

understanding of the underlying mechanism of embryo development as well as changes in

35

CEY functional characteristics during storage and processing.

36 37

KEYWORDS

38

Chicken egg yolk; Glycoproteome; N-glycosylation site; Mass spectrometry; Protease;

39

Immunology.

40

2

ACS Paragon Plus Environment

Page 2 of 34

Page 3 of 34

Journal of Agricultural and Food Chemistry

41

INTRODUCTION

42

Posttranslational modifications (PTMs) of proteins have important effects on the

43

correct folding, molecular structure, and physical and chemical properties of proteins.

44

PTMs have been found in food proteins, mainly in glycosylated and phosphorylated forms.

45

It has been proved that phosphorylation of muscle proteins is involved in glycolysis,

46

muscle contraction and the degradation of muscle proteins, thus influencing the quality of

47

the meat.1-3 Another study showed that phosphorylation reduced the allergenicity of cow

48

caseins in children.4 In addition to phosphorylation, the N-glycosylation of food proteins

49

has also been studied. Lactoferrins are one of the most well-studied food proteins, and their

50

N-glycosylation profile shows high heterogeneity, which is considered the potential reason

51

for the characteristic differences between lactoferrins.5-6 Recently, omics analyses of N-

52

glycoproteins in human colostrum and mature milk were performed, and a total of 133 N-

53

glycosylation sites on 73 milk whey proteins and 912 N-glycosylation sites on 506 milk fat

54

globule membrane proteins were identified; these results provided basic and important

55

information on the understanding of the role of milk protein glycosylation during infant

56

development.7-8

57

N-glycosylation also plays an important role in the structure and properties of chicken

58

egg proteins and is involved in the egg white thinning, egg allergenicity, antibacterial

59

properties and embryo protection.9-12 Therefore, the N-glycosylation profiles of egg

60

proteins need to be well characterized. In our previous work, the N-glycosylation proteome

61

of chicken egg white was studied, and a total of 71 N-glycosylation sites in 26 egg white

62

glycoproteins were identified.13 However, the N-glycosylation proteome of chicken egg

63

yolk (CEY) has not been investigated. Egg white proteins are synthesized by oviduct during 3

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 4 of 34

64

egg formation but most of the CEY proteins are synthesized as a precursor protein by the

65

liver of the hen, and then are transferred into the oocyte through bloodstream.14 After

66

enzymatic cleavage, the major fragments of precursor proteins (apolipoprotein B, APOB;

67

vitellogenins, VTGs) and lipids are combined into lipoprotein complexes (high-density

68

lipoprotein, HDL; low-density lipoprotein, LDL), which ultimately form yolk plasma and

69

yolk granule.14-16 The main CEY proteins, including immunoglobulin Y (IgY), HDL, LDL,

70

and phosvitin have all been reported as glycoproteins, and some of their glycosylation sites

71

have been clarified.17-18 Recently, a study of the CEY plasma peptidome demonstrated that

72

a total of 13 CEY plasma proteins contain PTMs, including 9 N-glycosylation sites on

73

apolipoprotein B, complement C4, histidine-rich glycoprotein, prothrombin, cathepsin EA-

74

like protein, and vitellogenin-2.19 The effect of glycosylation on the function of some CEY

75

proteins has been studied. A previous study revealed that the removal of N-glycan changed

76

the conformation, storage stability, and resistance to guanidine hydrochloride and pepsin

77

digestion of the IgY molecule, suggesting that N-glycosylation plays an important role in

78

the molecular structure and function of IgY.20

79

However, most of the other CEY glycoproteins have not been investigated. Therefore,

80

the N-glycosylation proteome of chicken egg yolk was studied in the present work. The

81

CEY proteins were digested with trypsin, and the glycopeptides were enriched using a

82

hydrophilic microcolumn, followed by deglycosylation using PNGase F in H218O water.

83

The

84

chromatography/nanoelectrospray ionization/mass spectrometry (UPLC-NSI-MS/MS),

85

and the N-glycosylation sites were identified based on the MS/MS data via the MaxQuant

deglycopeptides

were

identified

using

4

ACS Paragon Plus Environment

ultra-performance

liquid

Page 5 of 34

Journal of Agricultural and Food Chemistry

86

software. In addition, the function of the identified CEY glycoproteins were annotated

87

through Gene Ontology analysis.

88 89 90

MATERIALS AND METHODS

91

Chicken Egg Yolk (CEY)

92

Fresh chicken eggs laid within 24 h from White Leghorns (about 50-week-old, cage,

93

standard diets) were collected in the morning from the poultry farm of Sichuan Sundaily

94

Village Ecological Food Co., Ltd. (Mianyang, Sichuan) and were used in the study. The

95

fresh egg was broken, and the CEY was separated and rolled on a filter paper to eliminate

96

the egg white. After the chicken egg white was absorbed and the vitelline membrane was

97

adhered by the filter paper, a pipet tip was used to penetrate the vitelline membrane, and

98

the CEY was pulled.21 The yolks of 15 eggs from 3 repeated sampling were homogenized,

99

the mixed CEY sample was frozen and stored with liquid nitrogen until further analysis.

100

Protein Extraction

101

A standard protein extraction process was employed. In detail, the frozen CEY

102

samples (1 mL) was ground with liquid nitrogen into powder and then transferred to a 10-

103

mL centrifuge tube, and four volumes of lysis buffer (8 mol/L urea, 1% Triton-100, 10

104

mmol/L dithiothreitol, 10 μmol/L trichostatin A and 50 mmol/L nicotinamide) were added

105

to the tube, followed by sonicating three times (150 W, 30 s each time) on ice using a high-

106

intensity ultrasonic processor (JY92-II, Ningbo Scientz Biotechnology Co., Ltd., Ningbo,

107

China). The remaining insolubles were removed by centrifugation at 12,000 ×g at 4°C for

108

10 min. Then, the supernatant was precipitated overnight with the addition of 5 volumes of 5

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

109

0.1 mol/L ammonium acetate/methanol. After centrifugation at 12,000 ×g and 4°C for 10

110

min, the supernatant was discarded. The remaining precipitate was washed with cold

111

acetone three times. The protein was redissolved in 8 mol/L urea, and the protein

112

concentration was determined with a BCA kit according to the manufacturer’s

113

instructions.22-23

114

Digestion, Enrichment, and Deglycosylation of CEY Glycopeptides

115

The CEY proteins were processed based on a previous protocol with modifications.13,

116

24

117

for 30 min at 56°C and alkylated with 11 mmol/L iodoacetamide for 30 min at room

118

temperature in darkness. After diluting by adding 100 mM NH4HCO3 to urea with

119

concentration less than 2 mol/L, the CEY proteins were digested using trypsin (Sigma-

120

Aldrich) at a 1:50 trypsin-to-protein mass ratio for the first digestion overnight and 1:100

121

trypsin-to-protein mass ratio for a second 4-h digestion.

Briefly, the CEY protein solution (2 mg/mL) was reduced with 5 mmol/L dithiothreitol

122

The hydrolyzed CEY solution (50 μL) was freeze-dried and then reconstituted in 50

123

μL enrichment buffer (80% acetonitrile and 1% trifluoroacetic acid). The solution was

124

transferred to a hydrophilic interaction liquid chromatography microcolumn (HILIC,

125

SeQuant, Southborough, MA, USA) and centrifuged at 4000 ×g for 15 min, and then, the

126

hydrophilic microcolumn was washed 3 times with enrichment buffer.25 The glycopeptides

127

were eluted with 10% acetonitrile, the eluate was collected and lyophilized. Afterwards,

128

the glycopeptides were reconstituted in 50 μL of the 50 mmol/L ammonium bicarbonate

129

buffer, which was prepared with H218O, and 2 μg of PNGase F (Roche, 11365185001,

130

Mannheim, Germany) was added and incubated at 37°C overnight.13 Finally, the salt was

6

ACS Paragon Plus Environment

Page 6 of 34

Page 7 of 34

Journal of Agricultural and Food Chemistry

131

removed according to the C18 ZipTips instructions (Millipore, Billerica, MA, USA), and

132

the enriched CEY glycopeptides were lyophilized.

133

Liquid Chromatography Tandem Mass Spectrometry (LC-MS/MS) Analysis

134

Liquid chromatography separation and mass spectrometric measurement were

135

performed via UPLC-NSI-MS/MS using a Thermo Scientific™ Orbitrap Fusion™

136

Tribrid™ equipped with an EASY-nLC 1000 UPLC system (Thermo Fisher Scientific,

137

Bremen, Germany). The enriched CEY glycopeptides were dissolved in 0.1% formic acid

138

(solvent A) and loaded onto a Reprosil-Pur C18 reverse-phase analytical column (1.9 μm

139

particles, inner diameter 75 µm, 15 cm length). The gradient comprised an increase from 5%

140

to 20% solvent B (0.1% formic acid in 98% acetonitrile) over 42 min, 20% to 35% in 12

141

min and climbing to 80% in 3 min, then holding at 80% for the last 3 min. The separation

142

was performed at a constant flow rate of 700 nL/min. After LC separation, the CEY

143

glycopeptides were subjected to a nanoelectrospray ionization (nESI) source followed by

144

mass spectrometry (MS/MS) in the Orbitrap FusionTM. The electrospray voltage of the

145

nESI source was 2.0 kV, the m/z scan range was 350 to 1550 for the full scan, and intact

146

peptides were detected in the Orbitrap at a resolution of 60,000. The mass spectrometer

147

was operated in data-dependent acquisition mode, and the top20 most abundant peptides

148

from the MS data were subjected to fragmentation by high-energy collision-induced

149

decomposition. For the MS/MS analysis, the automatic gain control was set at 5E4, the

150

signal threshold at 5000 ions/s, the maximum injection time at 200 ms, and the dynamic

151

exclusion time for the tandem mass scan at 15 s.

152

Tandem Mass Spectrometry Data Analysis

7

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

153

The data were analyzed using MaxQuant software (1.5.2.8) and compared against the

154

UniProt and NCBI databases (organism: Gallus gallus, date: 2018.03.09). Trypsin was

155

specified as the cleavage enzyme, allowing up to 4 missing cleavages. The mass tolerances

156

for the precursor ions were set as 20 ppm in the first search and 5 ppm in the main search,

157

and the mass tolerance for the fragment ions was set as 0.02 Da. Carbamidomethyl on Cys

158

was specified as fixed modification, and deamidation with 18O (N) and oxidation of Met

159

were specified as variable modifications. For identification, the false discovery rate was

160

specified as 1% at both the peptide and site levels.

161

Bioinformatics Analysis

162

The potential N-glycosylation sites of the identified CEY glycoproteins were

163

predicted by the NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/) based

164

on the sequences obtained from the UniProt database (http://www.uniprot.org/). Gene

165

Ontology (GO) analysis of the identified CEY glycoproteins was derived from the UniProt-

166

Gene Ontology Annotation database (http://www.ebi.ac.uk/GOA). The locations of the N-

167

glycosylation sites of Vitellogenins (VTGs) were visualized and presented by Illustrator

168

for Biological Sequences (IBS, version 1.0).26

169 170

RESULTS AND DISCUSSION

171

Characterization of the identified N-glycosylation sites in CEY

172

Glycopeptides derived from the CEY proteins were enriched by the HILIC column

173

and deglycosylated by PNGase F in H218O, resulting in the conversion of the originally

174

glycosylated asparagine to aspartic acid (occupied by -18OH) with a 2.99 Da increase in

175

molecular weight, which can be exploited to identify the N-glycosylation sites.27-28 Of the 8

ACS Paragon Plus Environment

Page 8 of 34

Page 9 of 34

Journal of Agricultural and Food Chemistry

176

28883 spectra produced by the mass spectrometer, a total of 4341 spectra matched with the

177

alignment protein, and 1109 of these spectra were identified as glycopeptides. All the

178

glycopeptides were identified with high precision, and the mass tolerance of the peptide

179

ions was less than 5 ppm (Figure 1A). After clearing the repeated sequence, a total of 208

180

unique CEY glycopeptides were obtained, which contain 217 N-glycosylation sites and

181

belong to 86 glycoproteins (Table S1).

182

In the present study, the number of MS/MS spectra that matched a certain unique

183

glycopeptide was called the “MS/MS count”, which was used to approximate the

184

abundance of the glycopeptides. Among the identified CEY glycopeptides, about half of

185

the identified CEY glycopeptides had an MS/MS count of 1, indicating that these N-

186

glycosylation sites are low in abundance. However, 17% of the identified CEY

187

glycopeptides had a large MS/MS count (5 or more), indicating that these N-glycopeptides

188

are relatively high in abundance (Figure 1B).

189

The canonical N-glycosylation sequence is N-X-[S/T] (where X is not proline). Here,

190

Web-Logo was employed to visualize the sequence motifs around the N-glycosylation sites

191

(5 amino acids on each side).29 As shown in Figure 1C, threonine (T) and serine (S) were

192

significantly overrepresented at the +2 position, suggesting that most of the identified egg

193

white N-glycosylation sites were localized at the canonical sequence. More specifically,

194

based on the unique peptide, the N-glycosylation sites that matched with N-X-T (75 unique

195

peptides, accounting for 35%) occurred more frequently than those that matched with N-

196

X-S (62 unique peptides, 29%). However, based on the MS/MS count, the number of

197

MS/MS spectra with N-X-T (391 MS/MS count, accounting for 36%) was lower than that

9

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

198

with N-X-S (567 MS/MS count, 52%) (Figure 1C). This indicated that CEY glycoproteins

199

that contain the N-X-S sequence have a higher abundance.

200

Gene Ontology analysis of the identified CEY glycoproteins

201

The identified CEY glycoproteins were classified according to their GO annotation.

202

As shown in Figure 2, in terms of “molecular function,” 35 CEY glycoproteins are involved

203

in “binding” (GO:0005488) at GO level 2. For the development of the embryo, CEY is rich

204

in a variety of nutrients, such as lipids, proteins, minerals, and bioactive molecules.30

205

Therefore, many CEY glycoproteins have binding abilities, including “ion binding” (19

206

glycoproteins), “protein binding” (15), and “lipid binding” (5) at GO level 3. A total of 22

207

CEY glycoproteins are involved in “catalytic activity” (GO:0003824), which mainly refers

208

to “hydrolase activity” (GO:0016787, 17) and “catalytic activity, acting on a protein”

209

(GO:0140096, 16) at GO level 3; in particular, 14 CEY glycoproteins are classified as

210

having “peptidase activity” (GO:0070011). These results indicated that many CEY

211

glycoproteins are involved in protein hydrolysis/digestion, which can provide peptides and

212

amino acids for embryonic development. Another well-represented category is “molecular

213

function regulator” (19); notably, 17 of these 19 CEY glycoproteins were annotated as

214

having “endopeptidase inhibitor activity” (GO:0004866). The number of inhibitors was

215

equivalent to that of hydrolases, suggesting that there is a proteinase/inhibitor regulation

216

system in CEY.

217

In terms of “biological process” at GO level 2, the CEY glycoproteins related to

10

ACS Paragon Plus Environment

Page 10 of 34

Page 11 of 34

Journal of Agricultural and Food Chemistry

218

“biological regulation” (GO:0065007, 39), “metabolic process” (GO:0008152, 21),

219

“cellular process” (GO:0009987, 19), “response to stimulus” (GO:0050896, 18), and

220

“immune system process” (GO:0002376, 10) accounted for the largest portion. As an

221

independent organ, the chicken egg needs to perform a variety of biological processes alone,

222

and thus, many CEY glycoproteins are involved in regulatory and metabolic processes.

223

During hatching, the chicken egg is under different types of stress and environmental

224

factors;31 therefore, proteins that respond to stimuli account for a large proportion of the

225

CEY glycoproteins.

226

Regarding the cellular components (at GO level 2), the most well-represented

227

category was “extracellular region part” (GO:0044421, 40), “cell part” (GO:0044464, 25),

228

“protein-containing complex” (GO:0032991, 13). The egg yolk plasma and granules,

229

which account for most of the egg yolk, do not belong to the cell part, and thus most of the

230

CEY glycoproteins are classified as “extracellular region part.” Low-density lipoprotein,

231

high-density lipoprotein, and yolk granules are all complexes, and thus “protein-containing

232

complex” accounts for a high proportion of the CEY glycoproteins.

233

The primary glycoproteins in CEY

234

Of the 86 identified CEY N-glycoproteins, 39 (approximately 49%) carried single N-

235

glycosylation sites, and the other 47 proteins had multiple N-glycosylation sites.

236

Apolipoprotein B (APOB) was the most heavily N-glycosylated protein in CEY with 35

237

N-glycosylation sites, followed by Vitellogenin-1 (VTG1), Vitellogenin-2 (VTG2) and

238

serum albumin with 15, 13 and 7 N-glycosylation sites, respectively (Figure 3A). However, 11

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

239

according to the abundance (MS/MS count) of the CEY glycopeptides, APOB is the most

240

abundant glycoprotein in CEY (443 MS/MS count), followed by VTG-2 (163 MS/MS

241

count), VTG-1 (89 MS/MS count) and IgY (67 MS/MS count, including light chain P20763

242

and heavy chain AHX37590.1) (Figure 3B).

243

Apolipoprotein B

244

As the most heavily glycosylated protein in CEY, APOB is a 523 kDa (4,631 AA)

245

protein that is enzymatically cleaved and results in 7 kinds of fragments during its transfer

246

from chicken serum into the yolk.14 These APOB fragments, together with apovitellenin-1

247

(which was also identified as a glycoprotein with an N-glycosylation site at N75 in the

248

present study) and lipids, assemble and form CEY LDL.15-16 APOB contains 19 potential

249

N-glycosylation sites predicted by NetNGlyc 1.0, two of which have been identified

250

recently (N1369 and N3697).19 Here, a total of 35 N-glycosylation sites of APOB were

251

identified, and 12 of them were also predicted to be modified by NetNGlyc 1.0, while the

252

other 23 sites were unpredicted, and all of them localized at the noncanonical sequence.

253

Meanwhile, 7 of the predicted N-glycosylation sites of APOB were not identified in the

254

current experiment (Figure 4). These results indicated that the low-frequency noncanonical

255

N-glycosylation modifications cannot be predicted, while the predicted N-glycosylation

256

sites based on the canonical N-X-[S/T] sequence need to be confirmed through

257

experimental analysis. However, the 12 APOB N-glycopeptides, which were both

258

predicted and identified, had a large MS/MS count (Table S1), suggesting that high-

259

frequency N-glycosylation tends to occupy the canonical N-X-[S/T] sequence.

260

The large number of low-frequency noncanonical N-glycosylation modifications was

261

a noteworthy phenomenon. How does this low-frequency N-glycosylation modification 12

ACS Paragon Plus Environment

Page 12 of 34

Page 13 of 34

Journal of Agricultural and Food Chemistry

262

affect the structure and function of proteins? Is it a random and meaningless modification

263

or important for biological processes? These questions should be of concern and studied in

264

the future.

265

Vitellogenins

266

VTG1, VTG2 and VTG3 are the three major vitellogenin proteins in CEY, and they

267

have similar domains (lipovitellin-1, lipovitellin-2, phosvitin and YGPs) but low sequence

268

homology (34% identity by the Alignment analysis of UniPort). VTGs are synthesized in

269

the liver and transferred to the CEY through the blood circulation, and then they are cleaved

270

to lipovitellin-1, phosvitin, lipovitellin-2 and YGP42/40/30. Lipovitellin-1, lipovitellin-2

271

and lipids compose the CEY HDL, and HDL together with phosvitins assemble the CEY

272

granules.14, 30

273

According to sequence analysis, VTG1, VTG2 and VTG3 contain 9, 9 and 5 potential

274

N-glycosylation sites, respectively. However, similar to the case of APOB, only some of

275

these predicted sites (4 sites of VTG1, 5 sites of VTG2, 1 sites of VTG3) were confirmed

276

to have undergone N-glycosylation in the present experiment. In addition, 11 unpredicted

277

sites on VTG1, 8 unpredicted sites on VTG2 and 5 unpredicted sites on VTG3 (all of which

278

were located at the noncanonical sequence) were identified as N-glycosylation sites.

279

Among the 15 total identified N-glycosylation sites of VTG1, 7 are located on lipovitellin-1,

280

8 are located on YGP42, and no sites are located on lipovitellin-2. In contrast, among the

281

13 total identified N-glycosylation sites of VTG2, 7 are located on lipovitellin-1, 1 is

282

located on lipovitellin-1, and 5 are located on YGP40 (Figure 5). The difference in the

283

distribution of these N-glycosylation sites may be an important reason for the difference in

284

the properties of VTG1 and VTG2. Another noteworthy result was that the total number of 13

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Page 14 of 34

285

MS/MS counts that matched with VTG1, VTG2 and VTG3 was 89, 163 and 14,

286

respectively, suggesting that VTG2 was the most abundant CEY VTG, which was

287

consistent with previous research.32-33

288

Although phosvitin has been reported to undergo glycosylation on the asparagine at

289

1280,34 none of the N-glycosylation sites was identified in the phosvitin region of the VTGs

290

in the present study. This may be because the high phosphorylation weakens the affinity

291

between the phosvitin peptides and HILIC column.

292

Protease and its inhibitor

293

A total of 8 and 14 identified CEY glycoproteins were recognized as proteases and

294

protease

295

carboxypeptidase B2, cathepsin EA-like protein, gamma-glutamyl hydrolase, plasminogen

296

(F1NWX6, and R4GMH5), protein C, and prothrombin; the protease inhibitors included:

297

antithrombin-III, kininogen 1, serpin family A member 10, serpin family G member 1,

298

ovoinhibitor, and several uncharacterized proteins (A0A1D5PBU0, A0A1D5PCD2,

299

E1C206, E1C7T1, F1NEQ4, F1NK40, F1NL38, F1NPN5, and F1P587) (Table S1). Of

300

these proteases and its inhibitors, carboxypeptidase was the most heavily N-glycosylated

301

proteases, with 2 N-glycosylation sites and an MS/MS count of 9; while ovoinhibitor was

302

the most heavily N-glycosylated inhibitors, with 4 N-glycosylation sites and an MS/MS

303

count of 15. More specifically, the ovoinhibitor, a member of the Kazal family of protease

304

inhibitors with inhibitory activity against serine proteinase, contains three predicted N-

305

glycosylation sites at N27, N141 and N461. In our previous study, two N-glycosylation sites

306

at N62 and N141 had been experimentally verified;13 here, another two sites (N193 and N461)

307

were identified to have undergone N-glycosylation.

inhibitors,

respectively.

The

proteases

14

ACS Paragon Plus Environment

included:

carboxypeptidase,

Page 15 of 34

Journal of Agricultural and Food Chemistry

308

Covalently bonded N-glycans can alter the folding and the surface properties of the

309

protease or inhibitor, thus affecting the recognition, specificity, and binding affinity.

310

Therefore, N-glycosylation modifications are important for the stability, activation, and

311

catalytic activity of proteases and their inhibitors.35 However, the effects of glycosylation

312

on CEY proteases and inhibitors are not yet clear, and they remain one of the concerned

313

topics in the field of egg science and need to be studied in the future.

314

Members of the Complement and Immune Systems

315

CEY is an important part of the embryo's defensive system and needs to provide

316

protection against potential internal or invasive threats. Therefore, many CEY

317

glycoproteins are involved in the immune and complement systems, which allow for the

318

direct killing of microbes, the disposal of ineffective complexes, and the self-regulation of

319

immune processes. Eight CEY glycoproteins were identified as components of the

320

complement system: complement C2, C3, C4, C6, C8, complement factors H and I, as well

321

as an uncharacterized protein (F1P587, gene name is LOC418892). Complement C3 was

322

most heavily N-glycosylated, with two N-glycosylation sites at N957 and N1428, which were

323

predicted by NetNGlyc 1.0. The matching MS/MS counts of N957 and N1428 were 7 and 10,

324

suggesting that complement C3 is a highly abundant complement glycoprotein in CEY.

325

The alignment analysis (provided by UniProt) revealed that the sequence identity between

326

CEY complement C3 and human complement C3 (P01024) is 53.8%; however, these two

327

proteins do not share an N-glycosylation site (data not shown). The high sequence identity

328

and low modification similarity of homologous glycoproteins have also been found in our

329

previous study of chicken egg white ovomacroglobulin.24 This phenomenon is interesting

330

and noteworthy, showing a balance of the conservation and diversity of protein molecules. 15

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

331

There were 7 identified immune-related glycoproteins, including the light chain (Ig

332

lambda chain C region, P20763) and heavy chain (AHX37590.1) of IgY, Fc fragment of

333

IgG binding protein (A0A1D5P6F4), Ig mu chain C region (P01875), immunoglobulin

334

heavy chain variable region (CAO79238.1), and immunoglobulin light chain variable

335

region (AJQ23642.1). Among these glycoproteins, IgY was the most abundant immune-

336

related glycoprotein, containing 3 N-glycosylation sites (N156, N178 and N255 of

337

AHX37590.1) on the heavy chain constant region and 2 N-glycosylation sites (N61 and N90

338

of P20763) at the light chain constant region. The total MS/MS count of IgY glycopeptides

339

was 67, suggesting that IgY is a highly abundant immune glycoprotein in CEY. In a

340

previous work, two sites (N308 and N407) at the heavy chain of IgY were identified as N-

341

glycosylated, which corresponded the N156 and N255 of AHX37590.1 in the present study.18

342

In addition to these 2 sites, other 3 newly identified N-glycosylation sites of IgY provided

343

additional information of the glycosylation profile of IgY.

344

Other CEY glycoproteins

345

Ovotransferrin, which is the second most abundant protein in chicken egg white, has

346

also been found in egg yolk.19, 36-37 The N-glycosylation sites at N492 and N637 had been

347

reported in a previous study,13 and another two N-glycosylation sites at N233 and N514 were

348

found in the present study.

349

Albumin, also known as α-livetin, which is transferred from the blood serum to CEY,

350

has been identified as one of the major egg allergens. In albumin, MS/MS analysis

351

ascertained seven N-glycosylation sites, namely, N126, N158, N291, N393, N400, N498, and N500,

352

while only the N500 residue was predicted by NetNGlyc 1.0. Among the six unpredicted N-

16

ACS Paragon Plus Environment

Page 16 of 34

Page 17 of 34

Journal of Agricultural and Food Chemistry

353

glycosylation sites, N291 and N498 have MS/MS counts of 4 and 2, respectively, implying

354

that these two sites are relatively high in abundance.

355

In summary, the N-glycoproteome of CEY was studied using a shotgun

356

glycoproteomics strategy, a total of 86 glycoproteins were identified in CEY and their

357

biological function were annotated and discussed. The N-glycosylated modification, as

358

well as other types of post-translational modifications, are extremely important for

359

understanding the structures, functions, and bioactivities of the food proteins. Because

360

these modifications could alter the molecular weight, surface charge and folding of proteins,

361

or as a marker for interaction and signaling. Therefore, in the research of food proteome,

362

modification of proteins is another basic dimension in addition to the protein species and

363

abundance, which need to be given more attention in the further.

364 365

Abbreviations Used

366

APOB, Apolipoprotein B; CEY, chicken egg yolk; GO, Gene Ontology; HDL, high-

367

density lipoprotein; HILIC, Hydrophilic interaction liquid chromatography; IgY,

368

immunoglobulin Y; LDL, low-density lipoprotein; nESI, nanoelectrospray ionization;

369

PTMs, Posttranslational modifications; VTG, Vitellogenin.

370 371

Funding

372

National Key Research and Development Program of China (2018YFD0400302)

373

National Natural Science Foundation of China (No. 31601490; No. 31871732)

374

Supporting Information Description

17

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

375

The details of identified CEY N-glycopeptides and N-glycoproteins resulting from a

376

database search of LC-MS/MS data are shown in Supporting file (Table S1). This

377

information is available free of charge via the Internet at http://pubs.acs.org.

378 379

Notes

380

The authors declare no competing financial interests.

381 382

REFERENCES

383

1.

384 385 386

phosphoproteomic analysis among muscles of different color stability using tandem mass tag labeling. Food Chem. 2018, 249, 8-15. 2.

387 388 389

111. 3.

4.

Allergy to Goat and Sheep Milk. J. Invest. Allerg. Clin. 2011, 21, 398-400. 5.

400 401

Karav, S.; German, J. B.; Rouquie, C.; Le Parc, A.; Barile, D., Studying Lactoferrin N-Glycosylation. Int. J. Mol. Sci. 2017, 18, 870.

6.

397 398 399

Cases, B.; Garcia-Ara, C.; Boyano, M. T.; Perez-Gordo, M.; Pedrosa, M.; Vivanco, F.; Quirce, S.; Pastor-Vargas, C., Phosphorylation Reduces the Allergenicity of Cow Casein in Children With Selective

395 396

Li, Z.; Li, M.; Du, M. T.; Shen, Q. W. E.; Zhang, D. Q., Dephosphorylation enhances postmortem degradation of myofibrillar proteins. Food Chem. 2018, 245, 233-239.

392 393 394

Liu, M. S.; Wei, Y. C.; Li, X.; Quek, S. Y.; Zhao, J.; Zhong, H. Z.; Zhang, D. Q.; Liu, Y. F., Quantitative phosphoproteomic analysis of caprine muscle with high and low meat quality. Meat Sci. 2018, 141, 103-

390 391

Li, Z.; Li, M.; Li, X.; Xin, J. Z.; Wang, Y.; Shen, Q. W. W.; Zhang, D. Q., Quantitative

Le Parc, A.; Karav, S.; Rouquie, C.; Maga, E. A.; Bunyatratchata, A.; Barile, D., Characterization of recombinant human lactoferrin N-glycans expressed in the milk of transgenic cows. Plos One 2017, 12, e0171477.

7.

Cao, X. Y.; Song, D. H.; Yang, M.; Yang, N.; Ye, Q.; Tao, D. B.; Liu, B.; Wu, R. N.; Yue, X. Q., Comparative Analysis of Whey N-Glycoproteins in Human Colostrum and Mature Milk Using Quantitative Glycoproteomics. J. Agric. Food Chem. 2017, 65, 10360-10367. 18

ACS Paragon Plus Environment

Page 18 of 34

Page 19 of 34

402

Journal of Agricultural and Food Chemistry

8.

403 404 405

Cao, X. Y.; Kang, S. M.; Yang, M.; Li, W. X.; Wu, S. Y.; Han, H. J.; Meng, L. H.; Wu, R.; Yue, X. Q., Quantitative N-glycoproteomics of milk fat globule membrane in human colostrum and mature milk reveals changes in protein glycosylation during lactation. Food Funct. 2018, 9, 1163-1172.

9.

Sun, X.; Gänzle, M. G.; Wu, J., Identification and Characterization of Glycopeptides from Egg Protein

406 407

Ovomucin with Anti-Agglutinating Activity against Porcine K88 Enterotoxigenic Escherichia coli

408

10. Offengenden, M.; Wu, J., Egg white ovomucin gels: structured fluids with weak polyelectrolyte

409 410 411 412 413 414 415

Strains. J. Agric. Food Chem. 2017, 65, 777-783.

properties. RSC Adv. 2013, 3, 910-917. 11. Mine, Y.; Yang, M., Recent advances in the understanding of egg allergens: basic, industrial, and clinical perspectives. J. Agric. Food Chem. 2008, 56, 4874-4900. 12. Benede, S.; Lopez-Exposito, I.; Molina, E.; Lopez-Fandino, R., Egg proteins as allergens and the effects of the food matrix and processing. Food Funct. 2015, 6, 694-713. 13. Geng, F.; Wang, J. Q.; Liu, D. Y.; Jin, Y. G.; Ma, M. H., Identification of N-Glycosites in Chicken Egg White Proteins Using an Omics Strategy. J. Agric. Food Chem. 2017, 65, 5357-5364.

416

14. Hatta H.; Kapoor M. P.; Juneja L.R. Bioactive Components in Egg Yolk. in Egg bioscience and

417

biotechnology. Mine, Y. Eds.; John Wiley & Sons Inc.: Hoboken, New Jersey, 2008; pp 185-192.

418

15. Jolivet, P.; Boulard, C.; Beaumal, V.; Chardot, T.; Anton, M., Protein components of low-density

419

lipoproteins purified from hen egg yolk. J. Agric. Food Chem. 2006, 54, 4424-4429.

420

16. Jolivet, P.; Boulard, C.; Chardot, T.; Anton, M., New insights into the structure of apolipoprotein B

421 422

from low-density lipoproteins and identification of a novel YGP-like protein in hen egg yolk. J. Agric.

423

17. Mine, Y.; D’Silva, I. Bioactive Components in Egg White. In Egg bioscience and biotechnology, Mine,

424 425 426 427 428

Food Chem. 2008, 56, 5871-5879.

Y. Eds.; John Wiley & Sons, Inc.: Hoboken, New Jersey, 2008; pp 141-184. 18. Sheng, L.; He, Z. J.; Liu, Y. P.; Ma, M. H.; Cai, Z. X., Mass spectrometry characterization for Nglycosylation of immunoglobulin Y from hen egg yolk. Int. J. Biol. Macromol. 2018, 108, 277-283. 19. Arena, S.; Scaloni, A., An Extensive Description of the Peptidomic Repertoire of the Hen Egg Yolk Plasma. J. Agric. Food Chem. 2018, 66, 3239-3255.

19

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

429

20. Sheng, L.; He, Z. J.; Chen, J. H.; Liu, Y. F.; Ma, M. H.; Cai, Z. X., The impact of N-glycosylation on

430 431

conformation and stability of immunoglobulin Y from egg yolk. Int. J. Biol. Macromol. 2017, 96, 129136.

432

21. Tong, C. Y.; Geng, F.; He, Z. J.; Cai, Z. X.; Ma, M. H., A simple method for isolating chicken egg yolk

433 434

immunoglobulin using effective delipidation solution and ammonium sulfate. Poultry Sci. 2015, 94,

435 436 437 438

104-110. 22. Washburn, M. P.; Wolters, D.; Yates III, J. R., Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat. biotechnol. 2001, 19, 242-247. 23. Wiśniewski, J. R.; Zougman, A.; Nagaraj, N.; Mann, M., Universal sample preparation method for proteome analysis. Nat. methods 2009, 6, 359-362.

439

24. Geng, F.; Huang, X.; Majumder, K.; Zhu, Z. H.; Cai, Z. X.; Ma, M. H., Mass Spectrometry and Two-

440 441

Dimensional Electrophoresis To Characterize the Glycosylation of Hen Egg White Ovomacroglobulin.

442

25. Zacharias, L. G.; Hartmann, A. K.; Song, E.; Zhao, J.; Zhu, R.; Mirzaei, P.; Mechref, Y., HILIC and

443 444

ERLIC enrichment of glycopeptides derived from breast and brain cancer cells. J. Proteome Res. 2016,

445

26. Liu, W. Z.; Xie, Y. B.; Ma, J. Y.; Luo, X. T.; Nie, P.; Zuo, Z. X.; Lahrmann, U.; Zhao, Q.; Zheng, Y.

446 447

Y.; Zhao, Y.; Xue, Y.; Ren, J., IBS: an illustrator for the presentation and visualization of biological

448

27. Zielinska, D. F.; Gnad, F.; Wiśniewski, J. R.; Mann, M., Precision mapping of an in vivo N-

449

J. Agric. Food Chem. 2015, 63, 8209-8215.

15, 3624-3634.

sequences. Bioinformatics 2015, 31, 3359-3361.

glycoproteome reveals rigid topological and sequence constraints. Cell 2010, 141, 897-907.

450

28. Zielinska, D. F.; Gnad, F.; Schropp, K.; Wiśniewski, J. R.; Mann, M., Mapping N-glycosylation sites

451 452

across seven evolutionarily distant species reveals a divergent substrate proteome despite a common

453

29. Crooks, G. E.; Hon, G.; Chandonia, J.-M.; Brenner, S. E., WebLogo: a sequence logo generator. Genome

454 455 456

core machinery. Mol. Cell 2012, 46, 542-548.

Res. 2004, 14, 1188-1190. 30. Anton, M. Composition and Structure of Hen Egg Yolk, in Bioactive egg compounds; Huopalahti, R., Anton, M., López-Fandiño, R., Schade, R., Eds.; Springer Berlin Heidelberg: 2007; pp 1-32.

20

ACS Paragon Plus Environment

Page 20 of 34

Page 21 of 34

Journal of Agricultural and Food Chemistry

457

31. Guyot, N.; Réhault-Godbert, S., Understanding the natural antibacterial defences of egg white and their

458 459

regulation. In Achieving sustainable production of eggs, Cargill P., Eds.; Burleigh Dodds Science

460

32. Wang, S.-Y.; Williams, D. L., Identification, purification, and characterization of two distinct avian

461 462 463 464 465 466 467 468 469

Publishing: Sawston, UK, 2017; Volume 1, pp 430-464.

vitellogenins. Biochemistry 1980, 19, 1557-1563. 33. Wang, S.; Smith, D. E.; Williams, D. L., Purification of avian vitellogenin III: comparison with vitellogenins I and II. Biochemistry 1983, 22, 6206-6212. 34. Brockbank, R. L.; Vogel, H. J., Structure of the oligosaccharide of hen phosvitin as determined by twodimensional proton NMR of the intact glycoprotein. Biochemistry 1990, 29, 5574-5583. 35. Goettig, P., Effects of Glycosylation on the Enzymatic Activity and Mechanisms of Proteases. Int. J. Mol. Sci. 2016, 17, 1969-1992. 36. Liu, Y.; Qiu, N.; Gao, D.; Ma, M., Comparative proteomic analysis of chicken, duck, and quail egg yolks. Int. J. Food Prop. 2018, 21, 1311-1321.

470

37. Farinazzo A.; Restuccia U.; Bachi A.; Guerrier L.; Fortis F.; Boschetti E.; Fasoli E.; Citterio A.; Righetti

471 472 473

PG. Chicken egg yolk cytoplasmic proteome, mined via combinatorial peptide ligand libraries. J. Chromatog. A. 2009, 1216, 8, 1241-1252.

21

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

474

Figure Captions

475 476

Figure 1. Characteristics of the identified N-glycopeptides in CEY. A, mass error

477

distribution of identified N-glycopeptides; B, distribution of the MS/MS count number of

478

identified N-glycopeptides; C, N-glycosylation unique sequences as derived using Web-

479

Logo; D, distribution of recognized sequence motifs based on the unique sequence or

480

MS/MS count of the identified N-glycopeptides (X≠P).

481 482

Figure 2. GO analysis (level 2) of identified CEY glycoproteins.

483 484

Figure 3. Distribution of identified N-glycosylation sites (A) or matched MS/MS count

485

(B) in CEY proteins.

486 487

Figure 4 Overlapping Venn diagram of predicted and identified N-glycosylation sites of

488

APOB.

489 490

Figure 5. Comparison of the distributions of identified N-glycosylation sites on VTG1 and

491

VTG2.

492

22

ACS Paragon Plus Environment

Page 22 of 34

Page 23 of 34

Journal of Agricultural and Food Chemistry

493 494

Figure 1. Characteristics of the identified N-glycopeptides in CEY. A, mass error

495

distribution of identified N-glycopeptides; B, distribution of the MS/MS count number of

496

identified N-glycopeptides; C, N-glycosylation unique sequences as derived using Web-

497

Logo; D, distribution of recognized sequence motifs based on the unique sequence or

498

MS/MS count of the identified N-glycopeptides (X≠P).

499

23

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

500

Figure 2. GO analysis (level 2) of identified CEY glycoproteins.

501

24

ACS Paragon Plus Environment

Page 24 of 34

Page 25 of 34

Journal of Agricultural and Food Chemistry

502

Figure 3. Distribution of identified N-glycosylation sites (A) or matched MS/MS count

503

(B) in CEY proteins.

504

25

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

505

Figure 4 Overlapping Venn diagram of predicted and identified N-glycosylation sites of

506

APOB.

507

26

ACS Paragon Plus Environment

Page 26 of 34

Page 27 of 34

Journal of Agricultural and Food Chemistry

508

Figure 5. Comparison of the distribution of identified N-glycosylation sites on VTG1 and

509

VTG2.

510

27

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

511

Graphic for table of contents

512

28

ACS Paragon Plus Environment

Page 28 of 34

Page 29 of 34

Journal of Agricultural and Food Chemistry

Figure 1. Characteristics of the identified N-glycopeptides in CEY. A, mass error distribution of identified Nglycopeptides; B, distribution of the MS/MS count number of identified N-glycopeptides; C, N-glycosylation unique sequences as derived using Web-Logo; D, distribution of recognized sequence motifs based on the unique sequence or MS/MS count of the identified N-glycopeptides (X≠P). 190x139mm (300 x 300 DPI)

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Figure 2. GO analysis (level 2) of identified CEY glycoproteins. 153x94mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 30 of 34

Page 31 of 34

Journal of Agricultural and Food Chemistry

Figure 3. Distribution of identified N-glycosylation sites (A) or matched MS/MS count (B) in CEY proteins. 123x137mm (300 x 300 DPI)

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Figure 4 Overlapping Venn diagram of predicted and identified N-glycosylation sites of APOB. 74x54mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 32 of 34

Page 33 of 34

Journal of Agricultural and Food Chemistry

Figure 5. Comparison of the distributions of identified N-glycosylation sites on VTG1 and VTG2. 190x64mm (300 x 300 DPI)

ACS Paragon Plus Environment

Journal of Agricultural and Food Chemistry

Graphic for table of contents 168x107mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 34 of 34