Subscriber access provided by NEW MEXICO STATE UNIV
Omics Technologies Applied to Agriculture and Food
N-glycoproteomic Analysis of Chicken Egg Yolk Fang Geng, Yunxiao Xie, Jinqiu Wang, Kaustav Majumder, Ning Qiu, and Meihu Ma J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/acs.jafc.8b04492 • Publication Date (Web): 11 Oct 2018 Downloaded from http://pubs.acs.org on October 15, 2018
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 34
Journal of Agricultural and Food Chemistry
N-glycoproteomic Analysis of Chicken Egg Yolk
1 2 3
Fang Geng1,2, Yunxiao Xie1,2, Jinqiu Wang1,2*, Kaustav Majumder3*, Ning Qiu4, Meihu
4
Ma4
5 6
1
7
University, No. 2025 Chengluo Avenue, Chengdu, 610106, P. R. China
Key Laboratory of Coarse Cereal Processing (Ministry of Agriculture), Chengdu
8 9 10
2
College of Pharmacy and Biological Engineering, Chengdu University, No. 2025
Chengluo Avenue, Chengdu, 610106, P. R. China
11 12
3
13
Street, Lincoln, NE 68588, USA
Food Science and Technology Department, University of Nebraska-Lincoln, 1400 R
14 15
4
16
Huazhong Agricultural University, No. 1 Shizishan Street, Wuhan, 430070, P. R. China
National R&D Center for Egg Processing, College of Food Science and Technology,
17 18
*Corresponding authors:
19
Dr. Jinqiu Wang, E-mail:
[email protected] 20
Dr. Kaustav Majumder, E-mail:
[email protected] 21
1
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
22
ABSTRACT
23
Posttranslational N-glycosylation of food proteins plays a critical role in their structure
24
and function. However, the N-glycoproteome of chicken egg yolk (CEY) has not been
25
studied yet. Glycopeptides hydrolyzed from CEY proteins were enriched, and
26
deglycosylation using PNGase F, then were identified using a shotgun glycoproteomics
27
strategy. A total of 217 N-glycosylation sites and 86 glycoproteins were identified in CEY,
28
and these glycoproteins are mainly involved in “binding”, “biological regulation”,
29
“catalytic activity”, and “metabolic process”. Among the identified CEY glycoproteins, 22
30
were recognized as proteases and protease inhibitors, suggesting that a proteinase/inhibitor
31
regulation system exists in CEY; 15 were members of the complement and immune
32
systems, which provide protection against potential threats during hatching. The study
33
provides important structural information about CEY glycoproteins and aids in the
34
understanding of the underlying mechanism of embryo development as well as changes in
35
CEY functional characteristics during storage and processing.
36 37
KEYWORDS
38
Chicken egg yolk; Glycoproteome; N-glycosylation site; Mass spectrometry; Protease;
39
Immunology.
40
2
ACS Paragon Plus Environment
Page 2 of 34
Page 3 of 34
Journal of Agricultural and Food Chemistry
41
INTRODUCTION
42
Posttranslational modifications (PTMs) of proteins have important effects on the
43
correct folding, molecular structure, and physical and chemical properties of proteins.
44
PTMs have been found in food proteins, mainly in glycosylated and phosphorylated forms.
45
It has been proved that phosphorylation of muscle proteins is involved in glycolysis,
46
muscle contraction and the degradation of muscle proteins, thus influencing the quality of
47
the meat.1-3 Another study showed that phosphorylation reduced the allergenicity of cow
48
caseins in children.4 In addition to phosphorylation, the N-glycosylation of food proteins
49
has also been studied. Lactoferrins are one of the most well-studied food proteins, and their
50
N-glycosylation profile shows high heterogeneity, which is considered the potential reason
51
for the characteristic differences between lactoferrins.5-6 Recently, omics analyses of N-
52
glycoproteins in human colostrum and mature milk were performed, and a total of 133 N-
53
glycosylation sites on 73 milk whey proteins and 912 N-glycosylation sites on 506 milk fat
54
globule membrane proteins were identified; these results provided basic and important
55
information on the understanding of the role of milk protein glycosylation during infant
56
development.7-8
57
N-glycosylation also plays an important role in the structure and properties of chicken
58
egg proteins and is involved in the egg white thinning, egg allergenicity, antibacterial
59
properties and embryo protection.9-12 Therefore, the N-glycosylation profiles of egg
60
proteins need to be well characterized. In our previous work, the N-glycosylation proteome
61
of chicken egg white was studied, and a total of 71 N-glycosylation sites in 26 egg white
62
glycoproteins were identified.13 However, the N-glycosylation proteome of chicken egg
63
yolk (CEY) has not been investigated. Egg white proteins are synthesized by oviduct during 3
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 4 of 34
64
egg formation but most of the CEY proteins are synthesized as a precursor protein by the
65
liver of the hen, and then are transferred into the oocyte through bloodstream.14 After
66
enzymatic cleavage, the major fragments of precursor proteins (apolipoprotein B, APOB;
67
vitellogenins, VTGs) and lipids are combined into lipoprotein complexes (high-density
68
lipoprotein, HDL; low-density lipoprotein, LDL), which ultimately form yolk plasma and
69
yolk granule.14-16 The main CEY proteins, including immunoglobulin Y (IgY), HDL, LDL,
70
and phosvitin have all been reported as glycoproteins, and some of their glycosylation sites
71
have been clarified.17-18 Recently, a study of the CEY plasma peptidome demonstrated that
72
a total of 13 CEY plasma proteins contain PTMs, including 9 N-glycosylation sites on
73
apolipoprotein B, complement C4, histidine-rich glycoprotein, prothrombin, cathepsin EA-
74
like protein, and vitellogenin-2.19 The effect of glycosylation on the function of some CEY
75
proteins has been studied. A previous study revealed that the removal of N-glycan changed
76
the conformation, storage stability, and resistance to guanidine hydrochloride and pepsin
77
digestion of the IgY molecule, suggesting that N-glycosylation plays an important role in
78
the molecular structure and function of IgY.20
79
However, most of the other CEY glycoproteins have not been investigated. Therefore,
80
the N-glycosylation proteome of chicken egg yolk was studied in the present work. The
81
CEY proteins were digested with trypsin, and the glycopeptides were enriched using a
82
hydrophilic microcolumn, followed by deglycosylation using PNGase F in H218O water.
83
The
84
chromatography/nanoelectrospray ionization/mass spectrometry (UPLC-NSI-MS/MS),
85
and the N-glycosylation sites were identified based on the MS/MS data via the MaxQuant
deglycopeptides
were
identified
using
4
ACS Paragon Plus Environment
ultra-performance
liquid
Page 5 of 34
Journal of Agricultural and Food Chemistry
86
software. In addition, the function of the identified CEY glycoproteins were annotated
87
through Gene Ontology analysis.
88 89 90
MATERIALS AND METHODS
91
Chicken Egg Yolk (CEY)
92
Fresh chicken eggs laid within 24 h from White Leghorns (about 50-week-old, cage,
93
standard diets) were collected in the morning from the poultry farm of Sichuan Sundaily
94
Village Ecological Food Co., Ltd. (Mianyang, Sichuan) and were used in the study. The
95
fresh egg was broken, and the CEY was separated and rolled on a filter paper to eliminate
96
the egg white. After the chicken egg white was absorbed and the vitelline membrane was
97
adhered by the filter paper, a pipet tip was used to penetrate the vitelline membrane, and
98
the CEY was pulled.21 The yolks of 15 eggs from 3 repeated sampling were homogenized,
99
the mixed CEY sample was frozen and stored with liquid nitrogen until further analysis.
100
Protein Extraction
101
A standard protein extraction process was employed. In detail, the frozen CEY
102
samples (1 mL) was ground with liquid nitrogen into powder and then transferred to a 10-
103
mL centrifuge tube, and four volumes of lysis buffer (8 mol/L urea, 1% Triton-100, 10
104
mmol/L dithiothreitol, 10 μmol/L trichostatin A and 50 mmol/L nicotinamide) were added
105
to the tube, followed by sonicating three times (150 W, 30 s each time) on ice using a high-
106
intensity ultrasonic processor (JY92-II, Ningbo Scientz Biotechnology Co., Ltd., Ningbo,
107
China). The remaining insolubles were removed by centrifugation at 12,000 ×g at 4°C for
108
10 min. Then, the supernatant was precipitated overnight with the addition of 5 volumes of 5
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
109
0.1 mol/L ammonium acetate/methanol. After centrifugation at 12,000 ×g and 4°C for 10
110
min, the supernatant was discarded. The remaining precipitate was washed with cold
111
acetone three times. The protein was redissolved in 8 mol/L urea, and the protein
112
concentration was determined with a BCA kit according to the manufacturer’s
113
instructions.22-23
114
Digestion, Enrichment, and Deglycosylation of CEY Glycopeptides
115
The CEY proteins were processed based on a previous protocol with modifications.13,
116
24
117
for 30 min at 56°C and alkylated with 11 mmol/L iodoacetamide for 30 min at room
118
temperature in darkness. After diluting by adding 100 mM NH4HCO3 to urea with
119
concentration less than 2 mol/L, the CEY proteins were digested using trypsin (Sigma-
120
Aldrich) at a 1:50 trypsin-to-protein mass ratio for the first digestion overnight and 1:100
121
trypsin-to-protein mass ratio for a second 4-h digestion.
Briefly, the CEY protein solution (2 mg/mL) was reduced with 5 mmol/L dithiothreitol
122
The hydrolyzed CEY solution (50 μL) was freeze-dried and then reconstituted in 50
123
μL enrichment buffer (80% acetonitrile and 1% trifluoroacetic acid). The solution was
124
transferred to a hydrophilic interaction liquid chromatography microcolumn (HILIC,
125
SeQuant, Southborough, MA, USA) and centrifuged at 4000 ×g for 15 min, and then, the
126
hydrophilic microcolumn was washed 3 times with enrichment buffer.25 The glycopeptides
127
were eluted with 10% acetonitrile, the eluate was collected and lyophilized. Afterwards,
128
the glycopeptides were reconstituted in 50 μL of the 50 mmol/L ammonium bicarbonate
129
buffer, which was prepared with H218O, and 2 μg of PNGase F (Roche, 11365185001,
130
Mannheim, Germany) was added and incubated at 37°C overnight.13 Finally, the salt was
6
ACS Paragon Plus Environment
Page 6 of 34
Page 7 of 34
Journal of Agricultural and Food Chemistry
131
removed according to the C18 ZipTips instructions (Millipore, Billerica, MA, USA), and
132
the enriched CEY glycopeptides were lyophilized.
133
Liquid Chromatography Tandem Mass Spectrometry (LC-MS/MS) Analysis
134
Liquid chromatography separation and mass spectrometric measurement were
135
performed via UPLC-NSI-MS/MS using a Thermo Scientific™ Orbitrap Fusion™
136
Tribrid™ equipped with an EASY-nLC 1000 UPLC system (Thermo Fisher Scientific,
137
Bremen, Germany). The enriched CEY glycopeptides were dissolved in 0.1% formic acid
138
(solvent A) and loaded onto a Reprosil-Pur C18 reverse-phase analytical column (1.9 μm
139
particles, inner diameter 75 µm, 15 cm length). The gradient comprised an increase from 5%
140
to 20% solvent B (0.1% formic acid in 98% acetonitrile) over 42 min, 20% to 35% in 12
141
min and climbing to 80% in 3 min, then holding at 80% for the last 3 min. The separation
142
was performed at a constant flow rate of 700 nL/min. After LC separation, the CEY
143
glycopeptides were subjected to a nanoelectrospray ionization (nESI) source followed by
144
mass spectrometry (MS/MS) in the Orbitrap FusionTM. The electrospray voltage of the
145
nESI source was 2.0 kV, the m/z scan range was 350 to 1550 for the full scan, and intact
146
peptides were detected in the Orbitrap at a resolution of 60,000. The mass spectrometer
147
was operated in data-dependent acquisition mode, and the top20 most abundant peptides
148
from the MS data were subjected to fragmentation by high-energy collision-induced
149
decomposition. For the MS/MS analysis, the automatic gain control was set at 5E4, the
150
signal threshold at 5000 ions/s, the maximum injection time at 200 ms, and the dynamic
151
exclusion time for the tandem mass scan at 15 s.
152
Tandem Mass Spectrometry Data Analysis
7
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
153
The data were analyzed using MaxQuant software (1.5.2.8) and compared against the
154
UniProt and NCBI databases (organism: Gallus gallus, date: 2018.03.09). Trypsin was
155
specified as the cleavage enzyme, allowing up to 4 missing cleavages. The mass tolerances
156
for the precursor ions were set as 20 ppm in the first search and 5 ppm in the main search,
157
and the mass tolerance for the fragment ions was set as 0.02 Da. Carbamidomethyl on Cys
158
was specified as fixed modification, and deamidation with 18O (N) and oxidation of Met
159
were specified as variable modifications. For identification, the false discovery rate was
160
specified as 1% at both the peptide and site levels.
161
Bioinformatics Analysis
162
The potential N-glycosylation sites of the identified CEY glycoproteins were
163
predicted by the NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/) based
164
on the sequences obtained from the UniProt database (http://www.uniprot.org/). Gene
165
Ontology (GO) analysis of the identified CEY glycoproteins was derived from the UniProt-
166
Gene Ontology Annotation database (http://www.ebi.ac.uk/GOA). The locations of the N-
167
glycosylation sites of Vitellogenins (VTGs) were visualized and presented by Illustrator
168
for Biological Sequences (IBS, version 1.0).26
169 170
RESULTS AND DISCUSSION
171
Characterization of the identified N-glycosylation sites in CEY
172
Glycopeptides derived from the CEY proteins were enriched by the HILIC column
173
and deglycosylated by PNGase F in H218O, resulting in the conversion of the originally
174
glycosylated asparagine to aspartic acid (occupied by -18OH) with a 2.99 Da increase in
175
molecular weight, which can be exploited to identify the N-glycosylation sites.27-28 Of the 8
ACS Paragon Plus Environment
Page 8 of 34
Page 9 of 34
Journal of Agricultural and Food Chemistry
176
28883 spectra produced by the mass spectrometer, a total of 4341 spectra matched with the
177
alignment protein, and 1109 of these spectra were identified as glycopeptides. All the
178
glycopeptides were identified with high precision, and the mass tolerance of the peptide
179
ions was less than 5 ppm (Figure 1A). After clearing the repeated sequence, a total of 208
180
unique CEY glycopeptides were obtained, which contain 217 N-glycosylation sites and
181
belong to 86 glycoproteins (Table S1).
182
In the present study, the number of MS/MS spectra that matched a certain unique
183
glycopeptide was called the “MS/MS count”, which was used to approximate the
184
abundance of the glycopeptides. Among the identified CEY glycopeptides, about half of
185
the identified CEY glycopeptides had an MS/MS count of 1, indicating that these N-
186
glycosylation sites are low in abundance. However, 17% of the identified CEY
187
glycopeptides had a large MS/MS count (5 or more), indicating that these N-glycopeptides
188
are relatively high in abundance (Figure 1B).
189
The canonical N-glycosylation sequence is N-X-[S/T] (where X is not proline). Here,
190
Web-Logo was employed to visualize the sequence motifs around the N-glycosylation sites
191
(5 amino acids on each side).29 As shown in Figure 1C, threonine (T) and serine (S) were
192
significantly overrepresented at the +2 position, suggesting that most of the identified egg
193
white N-glycosylation sites were localized at the canonical sequence. More specifically,
194
based on the unique peptide, the N-glycosylation sites that matched with N-X-T (75 unique
195
peptides, accounting for 35%) occurred more frequently than those that matched with N-
196
X-S (62 unique peptides, 29%). However, based on the MS/MS count, the number of
197
MS/MS spectra with N-X-T (391 MS/MS count, accounting for 36%) was lower than that
9
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
198
with N-X-S (567 MS/MS count, 52%) (Figure 1C). This indicated that CEY glycoproteins
199
that contain the N-X-S sequence have a higher abundance.
200
Gene Ontology analysis of the identified CEY glycoproteins
201
The identified CEY glycoproteins were classified according to their GO annotation.
202
As shown in Figure 2, in terms of “molecular function,” 35 CEY glycoproteins are involved
203
in “binding” (GO:0005488) at GO level 2. For the development of the embryo, CEY is rich
204
in a variety of nutrients, such as lipids, proteins, minerals, and bioactive molecules.30
205
Therefore, many CEY glycoproteins have binding abilities, including “ion binding” (19
206
glycoproteins), “protein binding” (15), and “lipid binding” (5) at GO level 3. A total of 22
207
CEY glycoproteins are involved in “catalytic activity” (GO:0003824), which mainly refers
208
to “hydrolase activity” (GO:0016787, 17) and “catalytic activity, acting on a protein”
209
(GO:0140096, 16) at GO level 3; in particular, 14 CEY glycoproteins are classified as
210
having “peptidase activity” (GO:0070011). These results indicated that many CEY
211
glycoproteins are involved in protein hydrolysis/digestion, which can provide peptides and
212
amino acids for embryonic development. Another well-represented category is “molecular
213
function regulator” (19); notably, 17 of these 19 CEY glycoproteins were annotated as
214
having “endopeptidase inhibitor activity” (GO:0004866). The number of inhibitors was
215
equivalent to that of hydrolases, suggesting that there is a proteinase/inhibitor regulation
216
system in CEY.
217
In terms of “biological process” at GO level 2, the CEY glycoproteins related to
10
ACS Paragon Plus Environment
Page 10 of 34
Page 11 of 34
Journal of Agricultural and Food Chemistry
218
“biological regulation” (GO:0065007, 39), “metabolic process” (GO:0008152, 21),
219
“cellular process” (GO:0009987, 19), “response to stimulus” (GO:0050896, 18), and
220
“immune system process” (GO:0002376, 10) accounted for the largest portion. As an
221
independent organ, the chicken egg needs to perform a variety of biological processes alone,
222
and thus, many CEY glycoproteins are involved in regulatory and metabolic processes.
223
During hatching, the chicken egg is under different types of stress and environmental
224
factors;31 therefore, proteins that respond to stimuli account for a large proportion of the
225
CEY glycoproteins.
226
Regarding the cellular components (at GO level 2), the most well-represented
227
category was “extracellular region part” (GO:0044421, 40), “cell part” (GO:0044464, 25),
228
“protein-containing complex” (GO:0032991, 13). The egg yolk plasma and granules,
229
which account for most of the egg yolk, do not belong to the cell part, and thus most of the
230
CEY glycoproteins are classified as “extracellular region part.” Low-density lipoprotein,
231
high-density lipoprotein, and yolk granules are all complexes, and thus “protein-containing
232
complex” accounts for a high proportion of the CEY glycoproteins.
233
The primary glycoproteins in CEY
234
Of the 86 identified CEY N-glycoproteins, 39 (approximately 49%) carried single N-
235
glycosylation sites, and the other 47 proteins had multiple N-glycosylation sites.
236
Apolipoprotein B (APOB) was the most heavily N-glycosylated protein in CEY with 35
237
N-glycosylation sites, followed by Vitellogenin-1 (VTG1), Vitellogenin-2 (VTG2) and
238
serum albumin with 15, 13 and 7 N-glycosylation sites, respectively (Figure 3A). However, 11
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
239
according to the abundance (MS/MS count) of the CEY glycopeptides, APOB is the most
240
abundant glycoprotein in CEY (443 MS/MS count), followed by VTG-2 (163 MS/MS
241
count), VTG-1 (89 MS/MS count) and IgY (67 MS/MS count, including light chain P20763
242
and heavy chain AHX37590.1) (Figure 3B).
243
Apolipoprotein B
244
As the most heavily glycosylated protein in CEY, APOB is a 523 kDa (4,631 AA)
245
protein that is enzymatically cleaved and results in 7 kinds of fragments during its transfer
246
from chicken serum into the yolk.14 These APOB fragments, together with apovitellenin-1
247
(which was also identified as a glycoprotein with an N-glycosylation site at N75 in the
248
present study) and lipids, assemble and form CEY LDL.15-16 APOB contains 19 potential
249
N-glycosylation sites predicted by NetNGlyc 1.0, two of which have been identified
250
recently (N1369 and N3697).19 Here, a total of 35 N-glycosylation sites of APOB were
251
identified, and 12 of them were also predicted to be modified by NetNGlyc 1.0, while the
252
other 23 sites were unpredicted, and all of them localized at the noncanonical sequence.
253
Meanwhile, 7 of the predicted N-glycosylation sites of APOB were not identified in the
254
current experiment (Figure 4). These results indicated that the low-frequency noncanonical
255
N-glycosylation modifications cannot be predicted, while the predicted N-glycosylation
256
sites based on the canonical N-X-[S/T] sequence need to be confirmed through
257
experimental analysis. However, the 12 APOB N-glycopeptides, which were both
258
predicted and identified, had a large MS/MS count (Table S1), suggesting that high-
259
frequency N-glycosylation tends to occupy the canonical N-X-[S/T] sequence.
260
The large number of low-frequency noncanonical N-glycosylation modifications was
261
a noteworthy phenomenon. How does this low-frequency N-glycosylation modification 12
ACS Paragon Plus Environment
Page 12 of 34
Page 13 of 34
Journal of Agricultural and Food Chemistry
262
affect the structure and function of proteins? Is it a random and meaningless modification
263
or important for biological processes? These questions should be of concern and studied in
264
the future.
265
Vitellogenins
266
VTG1, VTG2 and VTG3 are the three major vitellogenin proteins in CEY, and they
267
have similar domains (lipovitellin-1, lipovitellin-2, phosvitin and YGPs) but low sequence
268
homology (34% identity by the Alignment analysis of UniPort). VTGs are synthesized in
269
the liver and transferred to the CEY through the blood circulation, and then they are cleaved
270
to lipovitellin-1, phosvitin, lipovitellin-2 and YGP42/40/30. Lipovitellin-1, lipovitellin-2
271
and lipids compose the CEY HDL, and HDL together with phosvitins assemble the CEY
272
granules.14, 30
273
According to sequence analysis, VTG1, VTG2 and VTG3 contain 9, 9 and 5 potential
274
N-glycosylation sites, respectively. However, similar to the case of APOB, only some of
275
these predicted sites (4 sites of VTG1, 5 sites of VTG2, 1 sites of VTG3) were confirmed
276
to have undergone N-glycosylation in the present experiment. In addition, 11 unpredicted
277
sites on VTG1, 8 unpredicted sites on VTG2 and 5 unpredicted sites on VTG3 (all of which
278
were located at the noncanonical sequence) were identified as N-glycosylation sites.
279
Among the 15 total identified N-glycosylation sites of VTG1, 7 are located on lipovitellin-1,
280
8 are located on YGP42, and no sites are located on lipovitellin-2. In contrast, among the
281
13 total identified N-glycosylation sites of VTG2, 7 are located on lipovitellin-1, 1 is
282
located on lipovitellin-1, and 5 are located on YGP40 (Figure 5). The difference in the
283
distribution of these N-glycosylation sites may be an important reason for the difference in
284
the properties of VTG1 and VTG2. Another noteworthy result was that the total number of 13
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 14 of 34
285
MS/MS counts that matched with VTG1, VTG2 and VTG3 was 89, 163 and 14,
286
respectively, suggesting that VTG2 was the most abundant CEY VTG, which was
287
consistent with previous research.32-33
288
Although phosvitin has been reported to undergo glycosylation on the asparagine at
289
1280,34 none of the N-glycosylation sites was identified in the phosvitin region of the VTGs
290
in the present study. This may be because the high phosphorylation weakens the affinity
291
between the phosvitin peptides and HILIC column.
292
Protease and its inhibitor
293
A total of 8 and 14 identified CEY glycoproteins were recognized as proteases and
294
protease
295
carboxypeptidase B2, cathepsin EA-like protein, gamma-glutamyl hydrolase, plasminogen
296
(F1NWX6, and R4GMH5), protein C, and prothrombin; the protease inhibitors included:
297
antithrombin-III, kininogen 1, serpin family A member 10, serpin family G member 1,
298
ovoinhibitor, and several uncharacterized proteins (A0A1D5PBU0, A0A1D5PCD2,
299
E1C206, E1C7T1, F1NEQ4, F1NK40, F1NL38, F1NPN5, and F1P587) (Table S1). Of
300
these proteases and its inhibitors, carboxypeptidase was the most heavily N-glycosylated
301
proteases, with 2 N-glycosylation sites and an MS/MS count of 9; while ovoinhibitor was
302
the most heavily N-glycosylated inhibitors, with 4 N-glycosylation sites and an MS/MS
303
count of 15. More specifically, the ovoinhibitor, a member of the Kazal family of protease
304
inhibitors with inhibitory activity against serine proteinase, contains three predicted N-
305
glycosylation sites at N27, N141 and N461. In our previous study, two N-glycosylation sites
306
at N62 and N141 had been experimentally verified;13 here, another two sites (N193 and N461)
307
were identified to have undergone N-glycosylation.
inhibitors,
respectively.
The
proteases
14
ACS Paragon Plus Environment
included:
carboxypeptidase,
Page 15 of 34
Journal of Agricultural and Food Chemistry
308
Covalently bonded N-glycans can alter the folding and the surface properties of the
309
protease or inhibitor, thus affecting the recognition, specificity, and binding affinity.
310
Therefore, N-glycosylation modifications are important for the stability, activation, and
311
catalytic activity of proteases and their inhibitors.35 However, the effects of glycosylation
312
on CEY proteases and inhibitors are not yet clear, and they remain one of the concerned
313
topics in the field of egg science and need to be studied in the future.
314
Members of the Complement and Immune Systems
315
CEY is an important part of the embryo's defensive system and needs to provide
316
protection against potential internal or invasive threats. Therefore, many CEY
317
glycoproteins are involved in the immune and complement systems, which allow for the
318
direct killing of microbes, the disposal of ineffective complexes, and the self-regulation of
319
immune processes. Eight CEY glycoproteins were identified as components of the
320
complement system: complement C2, C3, C4, C6, C8, complement factors H and I, as well
321
as an uncharacterized protein (F1P587, gene name is LOC418892). Complement C3 was
322
most heavily N-glycosylated, with two N-glycosylation sites at N957 and N1428, which were
323
predicted by NetNGlyc 1.0. The matching MS/MS counts of N957 and N1428 were 7 and 10,
324
suggesting that complement C3 is a highly abundant complement glycoprotein in CEY.
325
The alignment analysis (provided by UniProt) revealed that the sequence identity between
326
CEY complement C3 and human complement C3 (P01024) is 53.8%; however, these two
327
proteins do not share an N-glycosylation site (data not shown). The high sequence identity
328
and low modification similarity of homologous glycoproteins have also been found in our
329
previous study of chicken egg white ovomacroglobulin.24 This phenomenon is interesting
330
and noteworthy, showing a balance of the conservation and diversity of protein molecules. 15
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
331
There were 7 identified immune-related glycoproteins, including the light chain (Ig
332
lambda chain C region, P20763) and heavy chain (AHX37590.1) of IgY, Fc fragment of
333
IgG binding protein (A0A1D5P6F4), Ig mu chain C region (P01875), immunoglobulin
334
heavy chain variable region (CAO79238.1), and immunoglobulin light chain variable
335
region (AJQ23642.1). Among these glycoproteins, IgY was the most abundant immune-
336
related glycoprotein, containing 3 N-glycosylation sites (N156, N178 and N255 of
337
AHX37590.1) on the heavy chain constant region and 2 N-glycosylation sites (N61 and N90
338
of P20763) at the light chain constant region. The total MS/MS count of IgY glycopeptides
339
was 67, suggesting that IgY is a highly abundant immune glycoprotein in CEY. In a
340
previous work, two sites (N308 and N407) at the heavy chain of IgY were identified as N-
341
glycosylated, which corresponded the N156 and N255 of AHX37590.1 in the present study.18
342
In addition to these 2 sites, other 3 newly identified N-glycosylation sites of IgY provided
343
additional information of the glycosylation profile of IgY.
344
Other CEY glycoproteins
345
Ovotransferrin, which is the second most abundant protein in chicken egg white, has
346
also been found in egg yolk.19, 36-37 The N-glycosylation sites at N492 and N637 had been
347
reported in a previous study,13 and another two N-glycosylation sites at N233 and N514 were
348
found in the present study.
349
Albumin, also known as α-livetin, which is transferred from the blood serum to CEY,
350
has been identified as one of the major egg allergens. In albumin, MS/MS analysis
351
ascertained seven N-glycosylation sites, namely, N126, N158, N291, N393, N400, N498, and N500,
352
while only the N500 residue was predicted by NetNGlyc 1.0. Among the six unpredicted N-
16
ACS Paragon Plus Environment
Page 16 of 34
Page 17 of 34
Journal of Agricultural and Food Chemistry
353
glycosylation sites, N291 and N498 have MS/MS counts of 4 and 2, respectively, implying
354
that these two sites are relatively high in abundance.
355
In summary, the N-glycoproteome of CEY was studied using a shotgun
356
glycoproteomics strategy, a total of 86 glycoproteins were identified in CEY and their
357
biological function were annotated and discussed. The N-glycosylated modification, as
358
well as other types of post-translational modifications, are extremely important for
359
understanding the structures, functions, and bioactivities of the food proteins. Because
360
these modifications could alter the molecular weight, surface charge and folding of proteins,
361
or as a marker for interaction and signaling. Therefore, in the research of food proteome,
362
modification of proteins is another basic dimension in addition to the protein species and
363
abundance, which need to be given more attention in the further.
364 365
Abbreviations Used
366
APOB, Apolipoprotein B; CEY, chicken egg yolk; GO, Gene Ontology; HDL, high-
367
density lipoprotein; HILIC, Hydrophilic interaction liquid chromatography; IgY,
368
immunoglobulin Y; LDL, low-density lipoprotein; nESI, nanoelectrospray ionization;
369
PTMs, Posttranslational modifications; VTG, Vitellogenin.
370 371
Funding
372
National Key Research and Development Program of China (2018YFD0400302)
373
National Natural Science Foundation of China (No. 31601490; No. 31871732)
374
Supporting Information Description
17
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
375
The details of identified CEY N-glycopeptides and N-glycoproteins resulting from a
376
database search of LC-MS/MS data are shown in Supporting file (Table S1). This
377
information is available free of charge via the Internet at http://pubs.acs.org.
378 379
Notes
380
The authors declare no competing financial interests.
381 382
REFERENCES
383
1.
384 385 386
phosphoproteomic analysis among muscles of different color stability using tandem mass tag labeling. Food Chem. 2018, 249, 8-15. 2.
387 388 389
111. 3.
4.
Allergy to Goat and Sheep Milk. J. Invest. Allerg. Clin. 2011, 21, 398-400. 5.
400 401
Karav, S.; German, J. B.; Rouquie, C.; Le Parc, A.; Barile, D., Studying Lactoferrin N-Glycosylation. Int. J. Mol. Sci. 2017, 18, 870.
6.
397 398 399
Cases, B.; Garcia-Ara, C.; Boyano, M. T.; Perez-Gordo, M.; Pedrosa, M.; Vivanco, F.; Quirce, S.; Pastor-Vargas, C., Phosphorylation Reduces the Allergenicity of Cow Casein in Children With Selective
395 396
Li, Z.; Li, M.; Du, M. T.; Shen, Q. W. E.; Zhang, D. Q., Dephosphorylation enhances postmortem degradation of myofibrillar proteins. Food Chem. 2018, 245, 233-239.
392 393 394
Liu, M. S.; Wei, Y. C.; Li, X.; Quek, S. Y.; Zhao, J.; Zhong, H. Z.; Zhang, D. Q.; Liu, Y. F., Quantitative phosphoproteomic analysis of caprine muscle with high and low meat quality. Meat Sci. 2018, 141, 103-
390 391
Li, Z.; Li, M.; Li, X.; Xin, J. Z.; Wang, Y.; Shen, Q. W. W.; Zhang, D. Q., Quantitative
Le Parc, A.; Karav, S.; Rouquie, C.; Maga, E. A.; Bunyatratchata, A.; Barile, D., Characterization of recombinant human lactoferrin N-glycans expressed in the milk of transgenic cows. Plos One 2017, 12, e0171477.
7.
Cao, X. Y.; Song, D. H.; Yang, M.; Yang, N.; Ye, Q.; Tao, D. B.; Liu, B.; Wu, R. N.; Yue, X. Q., Comparative Analysis of Whey N-Glycoproteins in Human Colostrum and Mature Milk Using Quantitative Glycoproteomics. J. Agric. Food Chem. 2017, 65, 10360-10367. 18
ACS Paragon Plus Environment
Page 18 of 34
Page 19 of 34
402
Journal of Agricultural and Food Chemistry
8.
403 404 405
Cao, X. Y.; Kang, S. M.; Yang, M.; Li, W. X.; Wu, S. Y.; Han, H. J.; Meng, L. H.; Wu, R.; Yue, X. Q., Quantitative N-glycoproteomics of milk fat globule membrane in human colostrum and mature milk reveals changes in protein glycosylation during lactation. Food Funct. 2018, 9, 1163-1172.
9.
Sun, X.; Gänzle, M. G.; Wu, J., Identification and Characterization of Glycopeptides from Egg Protein
406 407
Ovomucin with Anti-Agglutinating Activity against Porcine K88 Enterotoxigenic Escherichia coli
408
10. Offengenden, M.; Wu, J., Egg white ovomucin gels: structured fluids with weak polyelectrolyte
409 410 411 412 413 414 415
Strains. J. Agric. Food Chem. 2017, 65, 777-783.
properties. RSC Adv. 2013, 3, 910-917. 11. Mine, Y.; Yang, M., Recent advances in the understanding of egg allergens: basic, industrial, and clinical perspectives. J. Agric. Food Chem. 2008, 56, 4874-4900. 12. Benede, S.; Lopez-Exposito, I.; Molina, E.; Lopez-Fandino, R., Egg proteins as allergens and the effects of the food matrix and processing. Food Funct. 2015, 6, 694-713. 13. Geng, F.; Wang, J. Q.; Liu, D. Y.; Jin, Y. G.; Ma, M. H., Identification of N-Glycosites in Chicken Egg White Proteins Using an Omics Strategy. J. Agric. Food Chem. 2017, 65, 5357-5364.
416
14. Hatta H.; Kapoor M. P.; Juneja L.R. Bioactive Components in Egg Yolk. in Egg bioscience and
417
biotechnology. Mine, Y. Eds.; John Wiley & Sons Inc.: Hoboken, New Jersey, 2008; pp 185-192.
418
15. Jolivet, P.; Boulard, C.; Beaumal, V.; Chardot, T.; Anton, M., Protein components of low-density
419
lipoproteins purified from hen egg yolk. J. Agric. Food Chem. 2006, 54, 4424-4429.
420
16. Jolivet, P.; Boulard, C.; Chardot, T.; Anton, M., New insights into the structure of apolipoprotein B
421 422
from low-density lipoproteins and identification of a novel YGP-like protein in hen egg yolk. J. Agric.
423
17. Mine, Y.; D’Silva, I. Bioactive Components in Egg White. In Egg bioscience and biotechnology, Mine,
424 425 426 427 428
Food Chem. 2008, 56, 5871-5879.
Y. Eds.; John Wiley & Sons, Inc.: Hoboken, New Jersey, 2008; pp 141-184. 18. Sheng, L.; He, Z. J.; Liu, Y. P.; Ma, M. H.; Cai, Z. X., Mass spectrometry characterization for Nglycosylation of immunoglobulin Y from hen egg yolk. Int. J. Biol. Macromol. 2018, 108, 277-283. 19. Arena, S.; Scaloni, A., An Extensive Description of the Peptidomic Repertoire of the Hen Egg Yolk Plasma. J. Agric. Food Chem. 2018, 66, 3239-3255.
19
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
429
20. Sheng, L.; He, Z. J.; Chen, J. H.; Liu, Y. F.; Ma, M. H.; Cai, Z. X., The impact of N-glycosylation on
430 431
conformation and stability of immunoglobulin Y from egg yolk. Int. J. Biol. Macromol. 2017, 96, 129136.
432
21. Tong, C. Y.; Geng, F.; He, Z. J.; Cai, Z. X.; Ma, M. H., A simple method for isolating chicken egg yolk
433 434
immunoglobulin using effective delipidation solution and ammonium sulfate. Poultry Sci. 2015, 94,
435 436 437 438
104-110. 22. Washburn, M. P.; Wolters, D.; Yates III, J. R., Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat. biotechnol. 2001, 19, 242-247. 23. Wiśniewski, J. R.; Zougman, A.; Nagaraj, N.; Mann, M., Universal sample preparation method for proteome analysis. Nat. methods 2009, 6, 359-362.
439
24. Geng, F.; Huang, X.; Majumder, K.; Zhu, Z. H.; Cai, Z. X.; Ma, M. H., Mass Spectrometry and Two-
440 441
Dimensional Electrophoresis To Characterize the Glycosylation of Hen Egg White Ovomacroglobulin.
442
25. Zacharias, L. G.; Hartmann, A. K.; Song, E.; Zhao, J.; Zhu, R.; Mirzaei, P.; Mechref, Y., HILIC and
443 444
ERLIC enrichment of glycopeptides derived from breast and brain cancer cells. J. Proteome Res. 2016,
445
26. Liu, W. Z.; Xie, Y. B.; Ma, J. Y.; Luo, X. T.; Nie, P.; Zuo, Z. X.; Lahrmann, U.; Zhao, Q.; Zheng, Y.
446 447
Y.; Zhao, Y.; Xue, Y.; Ren, J., IBS: an illustrator for the presentation and visualization of biological
448
27. Zielinska, D. F.; Gnad, F.; Wiśniewski, J. R.; Mann, M., Precision mapping of an in vivo N-
449
J. Agric. Food Chem. 2015, 63, 8209-8215.
15, 3624-3634.
sequences. Bioinformatics 2015, 31, 3359-3361.
glycoproteome reveals rigid topological and sequence constraints. Cell 2010, 141, 897-907.
450
28. Zielinska, D. F.; Gnad, F.; Schropp, K.; Wiśniewski, J. R.; Mann, M., Mapping N-glycosylation sites
451 452
across seven evolutionarily distant species reveals a divergent substrate proteome despite a common
453
29. Crooks, G. E.; Hon, G.; Chandonia, J.-M.; Brenner, S. E., WebLogo: a sequence logo generator. Genome
454 455 456
core machinery. Mol. Cell 2012, 46, 542-548.
Res. 2004, 14, 1188-1190. 30. Anton, M. Composition and Structure of Hen Egg Yolk, in Bioactive egg compounds; Huopalahti, R., Anton, M., López-Fandiño, R., Schade, R., Eds.; Springer Berlin Heidelberg: 2007; pp 1-32.
20
ACS Paragon Plus Environment
Page 20 of 34
Page 21 of 34
Journal of Agricultural and Food Chemistry
457
31. Guyot, N.; Réhault-Godbert, S., Understanding the natural antibacterial defences of egg white and their
458 459
regulation. In Achieving sustainable production of eggs, Cargill P., Eds.; Burleigh Dodds Science
460
32. Wang, S.-Y.; Williams, D. L., Identification, purification, and characterization of two distinct avian
461 462 463 464 465 466 467 468 469
Publishing: Sawston, UK, 2017; Volume 1, pp 430-464.
vitellogenins. Biochemistry 1980, 19, 1557-1563. 33. Wang, S.; Smith, D. E.; Williams, D. L., Purification of avian vitellogenin III: comparison with vitellogenins I and II. Biochemistry 1983, 22, 6206-6212. 34. Brockbank, R. L.; Vogel, H. J., Structure of the oligosaccharide of hen phosvitin as determined by twodimensional proton NMR of the intact glycoprotein. Biochemistry 1990, 29, 5574-5583. 35. Goettig, P., Effects of Glycosylation on the Enzymatic Activity and Mechanisms of Proteases. Int. J. Mol. Sci. 2016, 17, 1969-1992. 36. Liu, Y.; Qiu, N.; Gao, D.; Ma, M., Comparative proteomic analysis of chicken, duck, and quail egg yolks. Int. J. Food Prop. 2018, 21, 1311-1321.
470
37. Farinazzo A.; Restuccia U.; Bachi A.; Guerrier L.; Fortis F.; Boschetti E.; Fasoli E.; Citterio A.; Righetti
471 472 473
PG. Chicken egg yolk cytoplasmic proteome, mined via combinatorial peptide ligand libraries. J. Chromatog. A. 2009, 1216, 8, 1241-1252.
21
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
474
Figure Captions
475 476
Figure 1. Characteristics of the identified N-glycopeptides in CEY. A, mass error
477
distribution of identified N-glycopeptides; B, distribution of the MS/MS count number of
478
identified N-glycopeptides; C, N-glycosylation unique sequences as derived using Web-
479
Logo; D, distribution of recognized sequence motifs based on the unique sequence or
480
MS/MS count of the identified N-glycopeptides (X≠P).
481 482
Figure 2. GO analysis (level 2) of identified CEY glycoproteins.
483 484
Figure 3. Distribution of identified N-glycosylation sites (A) or matched MS/MS count
485
(B) in CEY proteins.
486 487
Figure 4 Overlapping Venn diagram of predicted and identified N-glycosylation sites of
488
APOB.
489 490
Figure 5. Comparison of the distributions of identified N-glycosylation sites on VTG1 and
491
VTG2.
492
22
ACS Paragon Plus Environment
Page 22 of 34
Page 23 of 34
Journal of Agricultural and Food Chemistry
493 494
Figure 1. Characteristics of the identified N-glycopeptides in CEY. A, mass error
495
distribution of identified N-glycopeptides; B, distribution of the MS/MS count number of
496
identified N-glycopeptides; C, N-glycosylation unique sequences as derived using Web-
497
Logo; D, distribution of recognized sequence motifs based on the unique sequence or
498
MS/MS count of the identified N-glycopeptides (X≠P).
499
23
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
500
Figure 2. GO analysis (level 2) of identified CEY glycoproteins.
501
24
ACS Paragon Plus Environment
Page 24 of 34
Page 25 of 34
Journal of Agricultural and Food Chemistry
502
Figure 3. Distribution of identified N-glycosylation sites (A) or matched MS/MS count
503
(B) in CEY proteins.
504
25
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
505
Figure 4 Overlapping Venn diagram of predicted and identified N-glycosylation sites of
506
APOB.
507
26
ACS Paragon Plus Environment
Page 26 of 34
Page 27 of 34
Journal of Agricultural and Food Chemistry
508
Figure 5. Comparison of the distribution of identified N-glycosylation sites on VTG1 and
509
VTG2.
510
27
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
511
Graphic for table of contents
512
28
ACS Paragon Plus Environment
Page 28 of 34
Page 29 of 34
Journal of Agricultural and Food Chemistry
Figure 1. Characteristics of the identified N-glycopeptides in CEY. A, mass error distribution of identified Nglycopeptides; B, distribution of the MS/MS count number of identified N-glycopeptides; C, N-glycosylation unique sequences as derived using Web-Logo; D, distribution of recognized sequence motifs based on the unique sequence or MS/MS count of the identified N-glycopeptides (X≠P). 190x139mm (300 x 300 DPI)
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Figure 2. GO analysis (level 2) of identified CEY glycoproteins. 153x94mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 30 of 34
Page 31 of 34
Journal of Agricultural and Food Chemistry
Figure 3. Distribution of identified N-glycosylation sites (A) or matched MS/MS count (B) in CEY proteins. 123x137mm (300 x 300 DPI)
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Figure 4 Overlapping Venn diagram of predicted and identified N-glycosylation sites of APOB. 74x54mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 32 of 34
Page 33 of 34
Journal of Agricultural and Food Chemistry
Figure 5. Comparison of the distributions of identified N-glycosylation sites on VTG1 and VTG2. 190x64mm (300 x 300 DPI)
ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Graphic for table of contents 168x107mm (300 x 300 DPI)
ACS Paragon Plus Environment
Page 34 of 34