Quantitative Effects of O-Linked Glycans on Protein Folding

Jul 26, 2017 - Discover the Most-Read Physical Chemistry Articles of November 2017. What were chemists reading in November of 2017? To find out, we've...
0 downloads 8 Views 1MB Size
Subscriber access provided by UNIV OF NEWCASTLE

Article

Quantitative Effects of O-linked Glycans on Protein Folding Patrick K Chaffey, Xiaoyang Guan, Xinfeng Wang, Yuan Ruan, Yaohao Li, Suzannah G Miller, Amy H Tran, Theo N Koelsch, Lomax F Pass, and Zhongping Tan Biochemistry, Just Accepted Manuscript • DOI: 10.1021/acs.biochem.7b00483 • Publication Date (Web): 26 Jul 2017 Downloaded from http://pubs.acs.org on July 27, 2017

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Biochemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 11

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Quantitative Effects of O-linked Glycans on Protein Folding Patrick K. Chaffey,‡ Xiaoyang Guan,‡ Xinfeng Wang, Yuan Ruan, Yaohao Li, Suzannah G. Miller, Amy H. Tran, Theo N. Koelsch, Lomax F. Pass, Zhongping Tan* Department of Chemistry and Biochemistry and BioFrontiers Institute, University of Colorado, Boulder, CO 80303, United States ABSTRACT: Protein O-glycosylation is a diverse, common, and important post-translational modification of both proteins inside the cell and those that are secreted or membrane bound. Much work has shown that O-glycosylation can alter the structure, function, and physical properties of the proteins to which it is attached. One gap remaining in our understanding of O-glycoproteins is how O-glycans might affect the folding of proteins. Here, we took advantage of synthetic, homogeneous O-glycopeptides to show that certain glycosylation patterns have an intrinsic effect, independent of any cellular folding machinery, on the folding pathway of a model O-glycoprotein, a carbohydrate binding module (CBM) derived from the Trichoderma reesei cellulase TrCel7A. The strongest effect, a six-fold increase in overall folding rate, was observed when a single O-mannose was the glycan, and the glycosylation site was near the N-terminus of the peptide sequence. We were also able to show that glycosylation patterns affected the kinetics of each step in unique ways, which may help to explain the observations made here. This work is a first step towards quantitative understanding of how O-glycosylation might control, through intrinsic means, the folding of O-glycoproteins. Such an understanding is expected to facilitate future investigations into the effects of glycosylation on more biological processes related to protein folding.

INTRODUCTION Proteins are important across many fields as basic and critical components of living systems, tools that answer scientific questions, industrial enzymes that produce and modify compounds on a commodity scale, drug targets and, increasingly, therapeutics in their own right.1-5 Since a proper threedimensional shape, or fold, is essential for a protein’s function, a better understanding of how to improve the folding process is important for any application of proteins, their production, or their biochemical and biological characterization.6-9 It has been shown that glycosylation contributes favorably to protein folding.10-12 Protein glycosylation most commonly comes in two varieties: N-linked glycosylation where the glycan units are attached to the protein backbone via the nitrogen atom of an Asn side chain and O-linked glycosylation where the attachment point is the oxygen of a Ser or Thr side chain.13-15 Through many years of studies, the role of Nglycosylation as a check-point in protein folding and quality control has been well established.16 In addition, N-linked glycans, mainly the core structure, have been shown to accelerate folding, and stabilize the folded, native state of the glycoprotein conjugate independent of any outside chaperone interactions.17 N-glycans can also promote the formation of correct disulfide bonds and secondary structural elements.18 These effects may be accomplished by the combined action of specific contacts between the glycan and protein, like hydrogen bonds and CH-π interactions, as well as non-specific effects due to their steric bulk and large hydration spheres.11 This wealth of knowledge of protein N-glycosylation was facilitated by developments in the controlled preparation of relatively homogeneous N-glycosylated proteins for study17-19 and, in turn, has provided valuable insight into strategies to glycoengineer proteins.20, 21

O-Glycosylation has also long been associated with protein folding.13, 22, 23 However, despite of many decades of research, the correlation between folding and protein O-glycosylation has not been sufficiently investigated. Chief among the difficulties for obtaining such knowledge is the lack of homogeneous samples for investigation. Unlike N-glycosylation, the term “O-glycosylation” is actually a blanket term that describes many unique types of glycosylation, including the addition of N-acetylgalactosamine (GalNAc), Nacetylglucosamine (GlcNAc), mannose (Man), galactose (Gal), fucose (Fuc), glucose (Glc), or xylose (Xyl) to Ser or Thr.13, 24, 25 More complicatedly, in contrast to N-glycosylation, which is initiated in the endoplasmic reticulum (ER), the initiation sites of different types of O-glycosylation vary. Depending on the first sugar unit that is attached to protein side chain, O-glycosylation can take place either in the ER or the Golgi apparatus, and either before or after protein folding.13 Many different enzymes are responsible for the initial Oglycosylation reaction and the complex regulatory networks that control the outer saccharides of the glycans. Most of them have not been well characterized. Further complicating matters is the fact that the most common types of O-glycosylation (OGalNAcylation and O-mannosylation) have no defined consensus sequence for their introduction.25 Together, these factors render it challenging, if not impossible, to using molecular biology and biochemical approaches similar to those developed for N-glycoproteins to prepare structurally-defined Oglycopeptides or glycoproteins for folding study.26, 27 To better understand and apply protein glycosylation, it is necessary to examine the detailed effects of O-linked glycans on protein properties. To address the challenges associated with such studies, we have taken advantage of recent advances in chemical synthesis19, 28-33 and developed a chemical glycobiology approach to analyze changes caused by the O-

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

glycosylation of proteins.34, 35 This approach can potentially be applied to molecules with up to ~200 amino acid residues,36 an upper limit which conveniently covers many soluble proteins.37, 38 Compared to recombinant expression methods, chemical synthesis is a more precise and flexible way to generate molecules with small and predetermined variations in structures, thus allowing the quantitative evaluation of the effects of different sugar moieties at different sites. Here, we applied a chemical glycobiology approach to carry out a detailed study on the effects of O-glycosylation on protein folding in vitro, centering on the changes caused by varied O-mannosylation patterns and differences between mannose and other O-glycan types. Of the many types of Oglycosylation, O-mannosylation is the most abundant in fungi and the second most common in the mammalian nervous system.13, 25 There are also several pieces of evidence that tie Omannosylation to protein folding pathways. For example, blocking O-mannosylation leads to aggregated and mis-folded proteins accumulating in the ER39 and blocking Nglycosylation can lead to an upregulation of O-mannosylation which might be a compensatory adjustment by cells to limit the mis-folding of proteins normally aided by Nglycosylation.40 Additionally, the biosynthesis process for Omannoyslation is strikingly similar to that of N-glycosylation, as both are initiated in the ER by transferring initial glycans via dolichol derivatives. After initiation, the first O-linked mannose residue is extended in the Golgi apparatus to give a variety of complex structures.23 This information hints that under certain conditions, O-mannosylation may play a similar role to N-glycosylation and have a strong connection to protein folding.39, 41 Therefore, it is of great interest to first focus the study on this type of O-glycosylation.

Page 2 of 11

teins: contains three glycosylation sites, two at the N-terminus and one in the middle. It forms two disulfide bonds and folds into a well-defined three-dimensional structure. These features make the CBM ideal molecule for the early study of the role played by O-glycans in folding.

MATERIALS AND METHODS Materials. All commercial reagents and solvents were used as received. Unless otherwise noted, all reactions were performed under argon and purifications were performed under air atmosphere at room temperature. Automated peptide synthesis was performed on an Applied Biosystems Pioneer continuous flow peptide synthesizer. All LC-MS analyses were performed using a Waters AcquityTM Ultra Performance LC system equipped with Acquity UPLC® BEH 300 C4, 1.7µm, 2.1 x 100 mm column at flow rates of 0.3 and 0.5 mL/min. The mobile phase for LC-MS analysis was a mixture of H2O (0.1% formic acid, v/v) and acetonitrile (0.1% formic acid, v/v). A Waters Synapt G2 HDMS (q-TOF) system was used for the mass spectrometric analysis. Design and chemical synthesis CBM variants. Previous research has demonstrated that combining the data from a wide variety of glyco-variants can provide a clear view of not only the influence of protein glycosylation but also the structural factors behind that influence 18, 35, 43. Therefore, we selected 25 CBM glycoforms for the present study, which can be divided into three series for a systematic study of three different features of CBM O-glycosylation. The first series (Fig. 2, 2-10) was meant to investigate both the site-specific and glycan-size dependent consequences of O-mannosylation. The second series (Fig. 2, 11-16) looked at the synergistic effects of O-mannoses at multiple sites. The final series (Fig. 2, 1725) probed how the stereochemistry of the glycan units and linkage along with the amino acid side chains near a glycan site alter the effects of O-glycosylation on folding. The unglycosylated CBM peptide 1 was also included as a control.

Figure 1. The NMR structure of the Family 1 CBM with three Olinked mono-mannose residues at Thr1, Ser3, and Ser14 sites.42 The O-linked glycans, the glycosylated amino acids and the disulfide bonds between Cys8 and 25 and Cys19 and 35 are shown in sticks. The oxygen atoms in the stick representations are in red color, carbon in green, and sulfur in gold.

Like most of our previous studies, we used a model molecule, a carbohydrate binding module (CBM) derived from the Trichoderma reesei cellulase TrCel7A,34, 35, 42 to investigate the effects of O-mannosylation. The CBM is a small, structurally independent domain of only 36 amino acids. Its small size is an advantage since it allows us to prepare large collections of glycoforms in a reasonable amount of time and makes it much easier to gather quantitative observations (Fig. 1). Despite its small size, it has the key features of typical glycopro-

Figure 2. Amino acid sequences and glycosylation patterns of the CBM variants. The native disulfide bonds are indicated with solid

ACS Paragon Plus Environment

Page 3 of 11

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

black lines connecting the Cys residues. Glycans are indicated by R1, R2 and R3.

The synthesis of the unglycosylated and glycosylated CBM variants 1-25 was accomplished in a one-pot process following our previously developed procedures.34, 35 Fmoc protected amino acid (4.0 eq.), HATU (4.0 eq.) and DIEA (8.0 eq.) were used for the coupling steps. The amino acid derivative FmocSer(Ac4Glcα1)-OH, was synthesized by procedures based on previous work (Supporting Information).44-46 The deblock solution was a mixture of piperidine/DBU/DMF (2/2/100, v/v). Upon completion of automated synthesis on a 0.05 mmol scale, the peptide resin was cleaved with TFA/TIS/H2O (95/2.5/2.5, v/v, 10 mL) for 45 min at room temperature. The resin was filtered and washed with TFA/TIS/H2O (95/2.5/2.5, 10 mL). The filtrate was combined and concentrated by compressed air. The oily residue was precipitated with diethyl ether and centrifuged to give a white pellet. After ether was decanted, the solid was dissolved in 10 mL of MeCN/H2O (50/50, v/v,) and was lyophilized to dryness. Correct folding, disulfide bond formation, purity and identity of each variant were experimentally verified by circular dichroism (CD) spectroscopy, thermolysin digestion, and analytical ultraperformance liquid chromatography-mass spectroscopy (UPLC-MS) 34, 35. Measuring the overall folding rates of CBM variants. Overall rates of folding for each CBM variant were measured using a procedure analogous to that developed previously in our lab to produce fully folded CBM variants for later characterization and study.34, 35 Solid, lyophilized CBM peptide (3 mg) was dissolved in 180 µL hydrazine/H2O (5/95, v/v) and stirred for 30 minutes at room temperature. Hydrazine was quenched by addition of 360 µL hydrazine quench solution (AcOH/H2O, 5/95, v/v). Solution was then added to 12 mL of folding buffer (0.2 Trizma, 3 mM reduced glutathione, pH = 8.2) and allowed to stir at room temperature. During this time, at each time point (t = 0 min, 5 min, 15 min, 30 min, 60 min, 120 min, 180 min, 240 min, 300 min, 360 min, 480 min), a sample (40 µL) was removed from the folding solution, quenched with 136 µL of TFA/H2O/MeCN (0.5/85/15, v/v), and stored at -70 oC until LC-MS analysis. This procedure was carried out in triplicate and results were averaged. LC-MS was carried out with a gradient of 20→40% acetonitrile in water on a C4 column. For each sample collected, all the mass spectrum scans were combined and the weighted mass average was calculated. The [M+3H]3+ peak was used for this purpose because it was the highest intensity peak observed in all samples. Weighted average mass is calculated by multiplying the intensity of each peak by the m/z value for that peak, summing up all these numbers and dividing by the sum total of the intensities of each peak. When no disulfide bond formation has taken place, this weighted average mass matches the expected molecular weight calculated by taking into account the natural isotopic distributions of each element in the CBM peptide absent any disulfide bonds. Similarly, once folding is completed, the weighted average mass calculated this way matches the expected molecular weight of the CBM peptide with both disulfide bonds. An example of this analysis is illustrated in the Supporting Information (Fig. S2) for the unglycosylated CBM variant. Plotting the change in mass (relative to the CBM peptide without disulfide bonds) against time gives a curve that reflects the oxidative folding process of each CBM variant, and

fitting this curve allows for the calculation of a rate constant of the overall oxidative folding for each variant. Since these curves were linear through the vast majority of the folding process for each peptide, linear fits of the data were used and the slope of the resulting lines were taken as the rate constants. Tracking each molecular species during folding. CBM folding involves the formation of at least three molecular species: the fully reduced (0S) and fully oxidized (2S) as well as intermediate species containing only one disulfide bond (1S). For the CBM variants that showed good chromatographic separation of these species during the folding reaction, we further examined the changes of each species during folding. Our analysis began with the assignment of the peak for each molecular species according to mass. An example of this analysis for glycosylated CBM variant 5 is shown in the Supporting Information (Fig. S4). Once identified by mass, the area under each peak was tracked for each molecular species for each time point collected. Plotting the percent of the total area under the curve each species represented gave the plots seen in Figures 4 and S5. This analysis was done using data from each of the three trials collected above for overall folding rate. Thus folding reactions were only performed once. Initial rate constants for each species were calculated based on the initial linear portion of each curve, and at least four points were used in each case.

Figure 3. Two representative folding trajectories monitored by mass spectrometry. The upper panel shows the folding of CBM

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

variants with several sizes of O-mannosyl glycans at Thr1, the smallest of which led to significant changes in CBM folding rates. The bottom panel shows the folding of CBM variants with several sizes of O-mannosyl glycans at Ser14, none of which had a large effect on CBM folding.

RESULTS Impact of O-glycosylation on the overall folding rates of CBM variants. The process of protein folding is a complicated process.47 For proteins containing disulfide bonds, like CBM, it is a combination of two distinct, but interrelated processes: conformational folding of the protein sequence into secondary and tertiary structures and formation of disulfide bonds. Collectively, these processes are known as oxidative folding.48 Conformational shuffling is typically a fast process, while the disulfide bond formation steps are much slower.49-51 By taking advantage of the relatively slow kinetics in disulfide bond formation, methods that monitor oxidative folding have facilitated detailed characterizations of several disulfide containing proteins’ folding pathways and thus significantly advanced our understanding of the molecular basis of protein folding.47, 48 We opted here to also make use of disulfide-bond chemistry to compare the folding rates of different CBM variants in vitro.50, 52, 53 To elucidate the potential site- and size specific effects of O-mannosylation on the folding of CBM, nine monoglycosyl-

Page 4 of 11

ated variants, 2-10, were compared with the nonglycosylated CBM 1 on their folding rate constants (Fig. 3, 4). As previously reported, the folding rate of each variant was quantified based on the measurable mass difference between the partially and fully folded CBM and the unfolded peptide.54 Briefly, the folding reactions were initiated by dissolving each CBM peptide in a folding buffer of pH 8.2 containing 0.2 M Trizma and 3 mM reduced glutathione (GSH).55, 56 We tested several different combinations of reduced and oxidized glutathione (GSSG) for the buffer and found that using only the reduced glutathione gave the cleanest chromatographic separation of folding intermediates and the lowest amount of misfolded product (Supporting Information, Fig. S1). At different time intervals after initiation, small aliquots were removed from the folding reaction and quenched by adjusting the pH to 3.57 Because each new disulfide bond formed results in a loss of two hydrogen atoms, the average mass of the peptides in each sample could be used as a measure of the overall progress of oxidative folding at the time of aliquot collection.54, 58 Such aggregate measures of a protein’s folding pathway, which cannot specifically interrogate the individual partially folded intermediates, have a long history in the study of protein folding. Indeed, most of the early papers in the area used enzymatic activity as a measure of how folded the enzyme is under particular conditions.59, 60 Since the CBM domain has no enzymatic activity, and high-resolution mass spectroscopy has advanced significantly in the past fifty years, we chose to use

Figure 4. The relative overall folding rate constants of the synthetic CBM variants. These values were calculated by normalizing each folding rate constant to that of the unglycosylated CBM 1. All error bars reported are standard deviations (SDs) of data achieved from three separate trials. Numerical data can be found in the Supporting Information (Table S1). ** indicates p≤0.001, * indicates p≤0.05, two-tailed t-test. The structural feature of each isoform is implied by its name, i.e. CBMT1M1 representing the isoform containing a single mannose α-linked to Thr1, CBMT3M3+S3M3+S14M3 representing the isoform containing a tri-mannose α-linked to Thr1, Ser3, and Ser14, CBMS3ManNAcα representing the isoform containing a single N-acetylmannosamine α-linked to Ser3, and CBMY5A+S3(Manα) representing the isoform containing a Tyr-to-Ala mutation at position 5 and a single mannose α-linked to Ser3.

ACS Paragon Plus Environment

Page 5 of 11

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

the change in mass that accompanies a disulfide bond formation step to monitor the overall folding progress.47, 57, 61 The curves of the average mass as a function of time displayed nearly linear behavior during the majority of the folding reaction, and so the slopes of linear fits were used to calculate an observed rate constant for each variant (Fig. 3 and Supporting Information Fig. S3).62, 63 As shown in Figure 4, left, our data shows that addition of a single mannose at either Thr1 or Ser3 significantly increased the overall folding rate, while Ser14 had very little effect on the rate as compared to unglycosylated CBM (compare the rate constants of 2, 5 and 8 with 1). For each site, increasing the size of the linear mannose chains resulted in unchanged rate constant as compared to that of unglycosylated CBM (compare 3, 4, 6, 7, 9, and 10 with 1). To better understand the impact of O-glycosylation on the folding of CBM, we next conducted a study of the folding rates of CBM glycoforms with O-linked mannoses at multiple sites. As shown in Figure 4, middle, multiple mannose glycans on the CBM can result in slightly slower or unchanged folding rates, even when the glycans are at the same sites that individually increase the rate (compare 11-13, 15 and 16 with 1). Of the multiply glycosylated variants, the glycoform with a single mannose at each of the three glycosylation sites displayed an increase in the rate (compare 14 with 1). We next decided to investigate how structural details of the carbohydrate moiety contribute to its effect on the folding rate of CBM. For this, we focused on glycosylation at Ser3, which was shown to significantly alter the folding rate when the glycan was mono-mannose and had been revealed in our previous work to have the largest effect on several other physical properties of the peptide.34,35,66 Like our previous experiments, we compared the rate constants of several CBM variants with changes this time to the orientation of functional groups on the sugar ring.35 As shown in Figure 4, right, our results showed that these small changes significantly decreased the differences in the rate constants between the glycosylated variants and the unglycosylated control (compare 17-23 with 1). Of all the amide-containing carbohydrates, only Nacetylmannosamine (ManNAc), which is stereochemically most similar to Man, had a noticeable positive impact on the folding rate (compare 17-19 with 1). Comparing the α- and βlinked Glc and Gal containing glycoforms showed that neither the carbohydrate identity nor the anomeric linkage stereochemistry in these two groups of mono-saccharides had any large influence on the rate (compare 20-23 with 1). Our previous work indicating that both Gln2 and Tyr5 played important roles in stabilizing the CBM structure, hence we were also interested in how these residues might alter the folding rate. As shown in Figure 4, right, mutating either Gln2 or Tyr5 to Ala abolished the accelerating effect of mono-mannose at Ser3, resulting in similar folding rates as the unglycosylated CBM (compare 24, 25 with 5 and 1). Impact of O-glycosylation on the changes of individual molecular species involved in the folding process. Comparing the overall folding rates provided evidence that most Oglycosylation patterns only minimally affect this process, but specific patterns can drastically accelerate oxidative folding. However, the information obtained from this kind of analysis is necessarily limited and a clearer picture of the folding process for each CBM variant is desirable. It is certainly possible, given the evidence uncovered from previous studies, that the glycans might bias the folding pathway to favor formation of

one disulfide bond or direct formation of two disulfide bonds in a glycan-dependent manner.47, 48 We therefore set out to characterize the effects of O-glycosylation on different molecular species, mainly focusing on the initial rates of formation of 1S and 2S, peak time of 1S, and initial rates of the unfolded starting material, 0S (Fig. 5).

Figure 5. Changes in the relative abundances of the 0S ( ), 1S ( ), and 2S ( ) molecular species that contain 0, 1, and 2 disulfide bonds, respectively, during the folding of select CBM variants: CBMS3M1 5, CBMS14M3 10, and CBMS3Glcα 20, each representing a different folding pattern. All error bars reported are SDs of data achieved from three separate trials.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

For a more detailed analysis of folding, we needed a way to track each molecular species individually. Fortunately, we found that, for analogs 5-7, 10-15, 17-23, and 25, the UPLCMS gave modestly resolved separation of the different species observed during folding (Supporting Information Fig. S4). By mass, peaks could be identified for 0S, 1S, and 2S forms. Once the peaks for these species were identified, the area under each peak could be used as a measure of the amount of each species in the folding mixture. Comparing the area of each peak to the total area under the curve for the trace gives a measure of the relative concentration of those species.64 Since we could resolve these peaks by retention time, use their mass as an indication of their identity, and quantify their concentrations based on peak areas, we were able to track the formation and disappearance of each species, and calculate their initial rates of formation or disappearance, and peak times. The resulting plots produced from the detailed analysis of three representative examples, CBMS3M1 5, CBMS14M3 10, and CBMS3Glcα 20, are shown in Figure 5 (Supporting Information contains all the data obtained this way, Fig. S5). As seen in the results, each of the CBM variants showed some similarities in the folding process. At the beginning, the samples mainly contain pure 0S for each analog. As time passes, the fraction of 0S falls while the fraction of 2S increases. The fraction of 1S rises initially before reaching a peak and falling. By comparing the 2S products with purified, folded peptides for which proper disulfide bond connectivity has been verified in our previous studies,35, 65 we can conclude that the 2S products observed in these experiments are correctly folded. Interestingly, the presence of cross-disulfide species consisting of a molecule of glutathione linked to the CBM, which is expected to occur given the fact that glutathione was used in the folding buffer,66 was not seen on any of the UPLC-MS traces. This suggests that these intermediates are very short-lived and react

Page 6 of 11

quickly to form either a new disulfide bond in the forward direction or reduced starting material in the backward direction. However, despite these similarities, there are significant differences in the reactions of different molecular species, probably reflecting differences in the rates of formation and reaction of this intermediate between differently glycosylated CBM analogs. This can also be illustrated by the data of analogs CBMS3M1 5, CBMS14M3 10, and CBMS3Glcα 20. As shown in Figure 5, the disappearance of the 0S species in CBMS3M1 5 follows an exponential decay, rapidly halving the initial concentration within the first hour. This sharp decrease is coupled to a sharp increase in the fraction of 1S species observed, which quickly peaks at around 40% of the total at the end of the first hour of folding. The 2S species increases at a notably slower pace and follows a mostly linear trend upwards until about three hours into the reaction when about three quarters of the CBMS3M1 5 is in the 2S form. After eight hours, the reaction is essentially complete with negligible amounts of 0S and 1S species, and almost complete conversion to the 2S species. In contrast, CBMS14M3 10 shows a consistently linear rate of disappearance of the 0S species. The fraction of the 1S species increases steadily as well, but notably at a much slower pace than that of CBMS3M1 5. As with CBMS3M1 5, the 1S species of CBMS14M3 10 peaks at around 40% of the total, but takes around four hours to reach this level. Another sharp contrast between CBMS14M3 10 and CBMS3M1 5, is the build-up of 2S in CBMS14M3 10. It was unobservable during the first hour of folding, appeared as a very low fraction of the total after two hours, and then steadily increased in a linear fashion until reaction completion. The folding of CBMS14M3 10, like CBMS3M1 5, was essentially complete after eight hours, although there was a small amount of 1S species remaining in the final sample. CBM

Figure 6. Initial rate constant for each species (0S, 1S and 2S) observed in the oxidative folding reactions. Rate constants derived from fitting the initial linear portion of each curve over at least 4 data points. Only CBM variants with good chromatographic separation on UPLC for each of the three species were used in this analysis. Rate of disappearance of 0S is in white and is negative. Rates of formation of 1S and 2S are in grey and black, respectively, and are both positive. Bars represent the average rates from three trials and error bars come from the standard deviation of those three trials. Numerical data can be found in the Supporting Information (Table S2).

ACS Paragon Plus Environment

Page 7 of 11

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

CBMS3Glcα 20 presents yet another variation. The 0S species disappears with an initially high rate before leveling out to be largely linear. The 2S species is formed following an exponential growth pattern, increasing in rate throughout the eight hours measured during the experiment. Unlike both CBMS3M1 5 and CBMS14M3 10, the 1S species peaks at only around 20% of the sample and reaches this peak after 2 hours of folding. After eight hours, the folding of CBMS3Glcα 20 is far less complete than either CBMS3M1 5 or CBMS14M3 10, with only around three-quarters of the total CBM being in the 2S form and a significant amount remaining in the totally reduced 0S form. To quantify the differences between each of CBM variants, we determined the initial reaction rates for each of the molecules. Since the concentrations of intermediates and products are negligible in the initial rates region, the initial rates may more accurately reflect the effects of glycosylation. In total, we were able to achieve a detailed analysis of the folding of 17 CBM glycoforms. As shown in Figure 6, our results revealed that glycosylation of Ser3 with a mannose monomer (CBMS3M1 5) resulted in rates of formation for both 1S and 2S species, and also a rate of disappearance for 0S, that are much higher than most of the other analogs for which data was available. While CBMS3M2 6 had similarly large initial rates for formation of 1S species, its rate of formation of the final 2S product was very low; and overall, the rate of 0S disappearance was lower than that of CBMS3M1 5. It seems that the mannose dimer of CBMS3M2 6 slows mainly the rate of formation of 2S species, while the mannose trimers of CBMS3M3 7 and CBMS14M3 10 seem to slow both the formation of the 1S intermediate and the 2S product. Multiply glycosylated CBMS1M1+S3M1+S14M1 14, which was observed to have an increase in the observed overall folding rate constant, also showed relatively large initial rates for all three species monitored, although not nearly as large as those of 5. Additionally, our results suggest that mono-mannose has a uniquely significant ability to increase the initial rates of formation for 1S and 2S species. All the other monomeric glycans tested showed much lower initial rates for both the formation of the 1S and 2S, as well as the rate of disappearance of 0S.

DISCUSSION Protein glycosylation is an extremely prevalent posttranslational modification, especially among secreted and membrane proteins.14 It can have important effects, both extrinsic and intrinsic.11, 67 Because a very specific consensus sequence makes obtaining homogeneous N-glycosylated proteins fairly convenient, most previous studies have focused on deciphering the effects of N-glycans.10, 68, 69 From these studies, it has been shown that the first sugar unit, GlcNAc, which is directly attached to the protein, confers almost the entire accelerating effect of N-glycans on protein folding.17 Although the exact physical and chemical explanations for these phenomena are not known with absolute certainty, there is considerable evidence that contacts, including hydrogen bonds, hydrophobic interactions, and CH-π interactions, between the core carbohydrates of the N-glycan structure and the proximal amino acids in the protein are the root cause of these observations.17, 21 Although it has thus far received less attention, Oglycosylation is also very common on proteins.70 Whether and how protein O-glycosylation participates in protein folding

and quality control is poorly understood. Recent results show that O-linked mannoses are involved in ER protein quality control, but it is not clear if they also affect other closely related folding properties or what structural features determine these effects.23, 41, 71 As detailed here, through a systematic study of O-glycosylated CBM variants, we were able to quantify our understanding of protein O-glycosylation’s contributions in the realm of oxidative folding. From those results, it was observed that only the first mannose residue is responsible for O-mannose glycans’ acceleration of folding rate in the CBM. This finding, together with the fact that Omannosylation biosynthesis involves the transfer of monomannose to selected amino acid residues in ER with further diversification only in the Golgi apparatus,23, 72 suggests that mono-mannoses added in the ER may to some extent be analogous to N-glycans by acting to intrinsically accelerate correct folding of proteins in vivo. These results also match well with recent work showing that O-fucosylation of thrombospondin type 1 repeats (TSRs) likely represents a novel ER-based quality control mechanism for ensuring TSR-containing proteins achieve their proper disulfide pairing patterns.73 Many naturally-occurring glycoproteins contain more than one glycosylation site. Therefore, it is also essential to establish the differences of different sites in regulating protein folding and the potential of interplay between these sites. Such analysis has rarely been performed so far, in large part due to the difficulty of selectively targeting only particular glycosylation sites using glycosyltransferases without changing protein amino acid sequences.20 The synthetic methods used here to prepare glycosylated CBM variants nicely side-step this issue and allowed us to show that effects of O-glycosylation are site-specific and there is no synergistic enhancement of the folding rate conferred by different glycosylation sites. In particular, our results highlight the importance of the N-terminal sites, both Thr1 and Ser3, in accelerating the folding of CBM over the more central Ser14 site. Protein folding is a complex process involving many different types of interactions.74 In order to develop a better understanding of the effects of O-glycosylation on folding, it is necessary to get some insights into the molecular determinants of such effects. Our results clearly showed that of a suite of monomeric glycans, only O-linked mannose is optimal for accelerating the folding of CBM. Additionally, mutating specific amino acids that are adjacent to a glycosylation site decreases or abolishes the effects of mannoses. Such dependence suggests that the peptide sequence of CBM may be evolutionarily adjusted to satisfy O-mannosylation and the interplay between the mono-mannose and amino acids at the N-terminus that is necessary for the effects induced by the mono-mannosylation at Thr1 and Ser3. An important question is whether glycosylation patterns with overall rate-accelerating effects simultaneously increase the rates of all the steps in the folding process or just some steps. By providing more information about the detailed effects of glycosylation on protein folding, our study revealed an interesting, but not unexpected correlation. From the data presented in Figures 4 and 6, it seems that glycosylation that increases the overall folding rate also increases the formation of both 1S intermediates and 2S products. In summary, by systematically comparing the properties of differently glycosylated CBM variants in vitro, we found that O-mannosylation has a strong site- and size-specific influence

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

on the folding of the CBM. Our results clearly show that only the attachment of a mono-mannose to the two N-terminal glycosylation sites can significantly increase the overall folding rate of CBM. The greatest increase occurring at Thr 1, with an observed folding rate more than 5 times faster than that of unglycosylated CBM. Also noteworthy is the fact that monomeric O-mannose glycans are added in the ER during early protein synthesis, and these were seen here to accelerate the rate of protein folding. Meanwhile larger mannosyl glycans are generated after oxidative folding in the Golgi and were seen here to have a negligible effect on folding. Thus, the effects of O-mannosylation on protein folding seem to vary in a manner that mirrors the biosynthetic pathways of Omannosylated glycoproteins. Our results also support that many factors contribute to the effects of glycosylation and small changes in either glycan structures or amino acid sequences can lead to large differences in the ability of glycosylation to have an effect on protein folding. Furthermore, our data suggest that glycosylation has different effects on the rates of each step in the folding process, although only an acceleration of the rates of all steps will lead to an observed increase in the overall rate. Taken together, this study illustrates that O-glycosylation can significantly affect the folding process of proteins and suggests a role for O-linked glycans in expediting glycoprotein biosynthesis. The generality and implications of these findings need to be explored through detailed in vitro studies of other glycoproteins paired with rigorous in vivo work. Such efforts are currently being pursued and we hope they will answer the many remaining questions.

ASSOCIATED CONTENT Supporting Information. Experimental procedures, spectroscopic and analytical data. This material is available free of charge via the Internet at http://pubs.acs.org.

AUTHOR INFORMATION Corresponding Author * [email protected]

ORCID: 0000-0002-9302-150X Notes The authors declare no competing financial interest.

Author Contributions All authors have given approval to the final version of the manuscript. ‡These authors contributed equally.

ACKNOWLEDGMENT We would like to thank the University of Colorado Boulder (Start-up fund), the NSF CAREER Award (Grant number: CHE1454925) for their support during the course of this study.

ABBREVIATIONS 0S, fully reduced CBM; 1S, CBM species containing one disulfide bond; 2S, fully oxidized CBM; CBM, carbohydrate-binding module; Fuc, fucose; Gal, galactose; GalNAc, N-

acetylgalactosamine; Glc, glucose; GlcNAc, Nacetylglucosamine; Man, mannose; TrCel7A, Trichoderma reesei cellulose; Xyl, xylose.

REFERENCES

Page 8 of 11

(1) Zheng, C. J., Han, L. Y., Yap, C. W., Ji, Z. L., Cao, Z. W., and Chen, Y. Z. (2006) Therapeutic targets: progress of their exploration and investigation of their characteristics, Pharmacol. Rev. 58, 259-279. (2) Olempska-Beer, Z. S., Merker, R. I., Ditto, M. D., and DiNovi, M. J. (2006) Food-processing enzymes from recombinant microorganisms-a review, Regul. Toxicol. Pharmacol. 45, 144158. (3) Fersht, A. R. (2008) From the first protein structures to our current knowledge of protein folding: delights and scepticisms, Nat. Rev. Mol. Cell. Bio. 9, 650-654. (4) Dimitrov, D. S. (2012) Therapeutic proteins, Methods Mol. Biol. 899, 1-26. (5) Nigam, P. S. (2013) Microbial enzymes with special characteristics for biotechnological applications, Biomolecules 3, 597-611. (6) Cleland, J. L. (1993) Protein Folding: In Vivo and In Vitro, Wiley, Hoboken. (7) Stevens, R. C. (2000) Design of high-throughput methods of protein production for structural biology, Structure 8, R177-185. (8) Palomares, L. A., Estrada-Mondaca, S., and Ramirez, O. T. (2004) Production of recombinant proteins: challenges and solutions, Methods Mol. Biol. 267, 15-52. (9) Trevino, S. R., Scholtz, J. M., and Pace, C. N. (2008) Measuring and increasing protein solubility, J. Pharm. Sci. 97, 4155-4166. (10) Sola, R. J., Rodriguez-Martinez, J. A., and Griebenow, K. (2007) Modulation of protein biophysical properties by chemical glycosylation: biochemical insights and biomedical implications, Cell. Mol. Life Sci. 64, 2133-2152. (11) Hebert, D. N., Lamriben, L., Powers, E. T., and Kelly, J. W. (2014) The intrinsic and extrinsic effects of N-linked glycans on glycoproteostasis, Nat. Chem. Biol. 10, 902-910. (12) Varki, A. (2017) Biological roles of glycans, Glycobiology 27, 3-49. (13) Spiro, R. G. (2002) Protein glycosylation: nature, distribution, enzymatic formation, and disease implications of glycopeptide bonds, Glycobiology 12, 43R-56R. (14) Varki, A. (2009) Essentials of glycobiology, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (15) Tan, Z., and Wang, L. X. (2017) Chemical Biology of Glycoproteins, The Royal Society of Chemistry, Cambridge. (16) Roth, J., Zuber, C., Park, S., Jang, I., Lee, Y., Kysela, K. G., Le Fourn, V., Santimaria, R., Guhl, B., and Cho, J. W. (2010) Protein N-glycosylation, protein folding, and protein quality control, Mol. Cells 30, 497-506. (17) Hanson, S. R., Culyba, E. K., Hsu, T. L., Wong, C. H., Kelly, J. W., and Powers, E. T. (2009) The core trisaccharide of an N-linked glycoprotein intrinsically accelerates folding and enhances stability, Proc. Natl. Acad. Sci. U. S. A. 106, 3131-3136. (18) O'Connor, S. E., Pohlmann, J., Imperiali, B., Saskiawan, I., and Yamamoto, K. (2001) Probing the effect of the outer saccharide residues of N-linked glycans on peptide conformation, J. Am. Chem. Soc. 123, 6187-6188. (19) Wang, P., Dong, S., Shieh, J. H., Peguero, E., Hendrickson, R., Moore, M. A., and Danishefsky, S. J. (2013) Erythropoietin derived by chemical synthesis, Science 342, 1357-1360. (20) Sinclair, A. M., and Elliott, S. (2005) Glycoengineering: the effect of glycosylation on the properties of therapeutic proteins, J. Pharm. Sci. 94, 1626-1635. (21) Price, J. L., Culyba, E. K., Chen, W., Murray, A. N., Hanson, S. R., Wong, C. H., Powers, E. T., and Kelly, J. W. (2012) N-glycosylation of enhanced aromatic sequons to increase glycoprotein stability, Biopolymers 98, 195-211. (22) Sola, R. J., and Griebenow, K. (2009) Effects of glycosylation on the stability of protein pharmaceuticals, J. Pharm. Sci. 98, 1223-1245.

ACS Paragon Plus Environment

Page 9 of 11

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

(23) Xu, C., and Ng, D. T. (2015) Glycosylation-directed quality control of protein folding, Nat. Rev. Mol. Cell. Bio. 16, 742-752. (24) Van den Steen, P., Rudd, P. M., Dwek, R. A., and Opdenakker, G. (1998) Concepts and principles of O-linked glycosylation, Crit. Rev. Biochem. Mol. Biol. 33, 151-208. (25) Chaffey, P. K., Chi, L., and Tan, Z. (2017) Chemical Biology of Protein O-Glycosylation, In Chemical Biology of Glycoproteins, pp 48-93, The Royal Society of Chemistry, Cambridge. (26) Thanka Christlet, T. H., and Veluraja, K. (2001) Database analysis of O-glycosylation sites in proteins, Biophys. J. 80, 952960. (27) Krasnova, L., and Wong, C. H. (2016) Understanding the chemistry and biology of glycosylation with glycan synthesis, Annu. Rev. Biochem. 85, 599-630. (28) Koeller, K. M., Smith, M. E. B., Huang, R.-F., and Wong, C.-H. (2000) Chemoenzymatic synthesis of a PSGL-1 N-terminal glycopeptide containing tyrosine sulfate and α-O-linked sialyl Lewis X, J. Am. Chem. Soc. 122, 4241-4242. (29) Takano, Y., Hojo, H., Kojima, N., and Nakahara, Y. (2004) Synthesis of a mimic for the heterogeneous surface of core 2 sialoglycan-linked glycoprotein, Org. Lett. 6, 3135-3138. (30) Mo, K. F., Fang, T., Stalnaker, S. H., Kirby, P. S., Liu, M., Wells, L., Pierce, M., Live, D. H., and Boons, G. J. (2011) Synthetic, structural, and biosynthetic studies of an unusual phospho-glycopeptide derived from alpha-dystroglycan, J. Am. Chem. Soc. 133, 14418-14430. (31) Yoshida, K., Yang, B., Yang, W., Zhang, Z., Zhang, J., and Huang, X. (2014) Chemical synthesis of syndecan-3 glycopeptides bearing two heparan sulfate glycan chains, Angew. Chem. Int. Ed. Engl. 53, 9051-9058. (32) Horiya, S., Bailey, J. K., Temme, J. S., Guillen Schlippe, Y. V., and Krauss, I. J. (2014) Directed evolution of multivalent glycopeptides tightly recognized by HIV antibody 2G12, J. Am. Chem. Soc. 136, 5407-5415. (33) Santra, A., Yu, H., Tasnima, N., Muthana, M. M., Li, Y., Zeng, J., Kenyon, N. J., Louie, A. Y., and Chen, X. (2016) Systematic chemoenzymatic synthesis of O-sulfated sialyl Lewis X antigens, Chem. Sci. 7, 2827-2831. (34) Chen, L., Drake, M. R., Resch, M. G., Greene, E. R., Himmel, M. E., Chaffey, P. K., Beckham, G. T., and Tan, Z. (2014) Specificity of O-glycosylation in enhancing the stability and cellulose binding affinity of Family 1 carbohydrate-binding modules, Proc. Natl. Acad. Sci. U. S. A. 111, 7612-7617. (35) Guan, X., Chaffey, P. K., Zeng, C., Greene, E. R., Chen, L., Drake, M. R., Chen, C., Groobman, A., Resch, M. G., Himmel, M. E., Beckham, G. T., and Tan, Z. (2015) Molecular-scale features that govern the effects of O-glycosylation on a carbohydrate-binding module, Chem. Sci. 6, 7185-7189. (36) Brocchieri, L., and Karlin, S. (2005) Protein length in eukaryotic and prokaryotic proteomes, Nucleic Acids Res. 33, 3390-3400. (37) Nilsson, B. L., Soellner, M. B., and Raines, R. T. (2005) Chemical synthesis of proteins, Annu. Rev. Biophys. Biomol. Struct. 34, 91-118. (38) Fernandez-Tejada, A., Brailsford, J., Zhang, Q., Shieh, J. H., Moore, M. A., and Danishefsky, S. J. (2015) Total synthesis of glycosylated proteins, Top. Curr. Chem. 362, 1-26. (39) Nakatsukasa, K., Okada, S., Umebayashi, K., Fukuda, R., Nishikawa, S., and Endo, T. (2004) Roles of O-mannosylation of aberrant proteins in reduction of the load for endoplasmic reticulum chaperones in yeast, J. Biol. Chem. 279, 49762-49772. (40) Harty, C., Strahl, S., and Romisch, K. (2001) Omannosylation protects mutant alpha-factor precursor from endoplasmic reticulum-associated degradation, Mol. Biol. Cell. 12, 1093-1101.

(41) Xu, C., Wang, S., Thibault, G., and Ng, D. T. (2013) Futile protein folding cycles in the ER are terminated by the unfolded protein O-mannosylation pathway, Science 340, 978-981. (42) Happs, R. M., Guan, X., Resch, M. G., Davis, M. F., Beckham, G. T., Tan, Z., and Crowley, M. F. (2015) Oglycosylation effects on family 1 carbohydrate-binding module solution structures, FEBS J. 282, 4341-4356. (43) Chen, W., Enck, S., Price, J. L., Powers, D. L., Powers, E. T., Wong, C. H., Dyson, H. J., and Kelly, J. W. (2013) Structural and energetic basis of carbohydrate-aromatic packing interactions in proteins, J. Am. Chem. Soc. 135, 9877-9884. (44) Jörgen, O., and Göran, M. (2000) Galabiosyl donors; efficient synthesis from 1,2,3,4,6-penta-O-acetyl-β-dgalactopyranose, Carbohydr. Res. 329, 49-55. (45) Waschke, D., Thimm, J., and Thiem, J. (2011) Highly efficient synthesis of ketoheptoses, Org. Lett. 13, 3628-3631. (46) Gould, N. D., Liana Allen, C., Nam, B. C., Schepartz, A., and Miller, S. J. (2013) Combined Lewis acid and Bronsted acidmediated reactivity of glycosyl trichloroacetimidate donors, Carbohydr. Res. 382, 36-42. (47) Pain, R. H. (2000) Mechanisms of Protein Folding, Oxford University Press, Oxford. (48) Chang, R. J. Y., and Ventura, S. (2011) Folding of Disulfide Proteins, Springer, New York. (49) Creighton, T. E., Zapun, A., and Darby, N. J. (1995) Mechanisms and catalysts of disulfide bond formation in proteins, Trends Biotechnol. 13, 18-23. (50) Wedemeyer, W. J., Welker, E., Narayan, M., and Scheraga, H. A. (2000) Disulfide bonds and protein folding, Biochemistry 39, 4207-4216. (51) Mamathambika, B. S., and Bardwell, J. C. (2008) Disulfidelinked protein folding pathways, Annu. Rev. Cell. Dev. Biol. 24, 211-235. (52) Darby, N., and Creighton, T. E. (1997) Probing protein folding and stability using disulfide bonds, Mol. Biotechnol. 7, 5777. (53) Ventura, S. (2008) Oxidative protein folding: from the test tube to in vivo insights, Antioxid. Redox Signal. 10, 51-53. (54) Rehder, D. S., and Borges, C. R. (2010) Cysteine sulfenic acid as an intermediate in disulfide bond formation and nonenzymatic protein folding, Biochemistry 49, 7748-7755. (55) Chandrasekhar, S., and Topp, E. M. (2015) Thiol-disulfide exchange in peptides derived from human growth hormone during lyophilization and storage in the solid state, J. Pharm. Sci. 104, 1291-1302. (56) Chandrasekhar, S., Moorthy, B. S., Xie, R., and Topp, E. M. (2016) Thiol-disulfide exchange in human growth hormone, Pharm. Res. 33, 1370-1382. (57) Arolas, J. L., Aviles, F. X., Chang, J. Y., and Ventura, S. (2006) Folding of small disulfide-rich proteins: clarifying the puzzle, Trends Biochem. Sci. 31, 292-301. (58) Yergey, J., Heller, D., Hansen, G., Cotter, R. J., and Fenselau, C. (1983) Isotopic distributions in mass spectra of large molecules, Anal. Chem. 55, 353-356. (59) Anfinsen, C. B., Haber, E., Sela, M., and White, F. H., Jr. (1961) The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain, Proc. Natl. Acad. Sci. U. S. A. 47, 1309-1314. (60) Chiti, F., Taddei, N., Giannoni, E., van Nuland, N. A., Ramponi, G., and Dobson, C. M. (1999) Development of enzymatic activity during protein folding. Detection of a spectroscopically silent native-like intermediate of muscle acylphosphatase, J. Biol. Chem. 274, 20151-20158. (61) Borges, C. R., and Sherma, N. D. (2014) Techniques for the analysis of cysteine sulfhydryls and oxidative protein folding, Antioxid. Redox Signal. 21, 511-531.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

(62) Ha, J. H., and Loh, S. N. (1998) Changes in side chain packing during apomyoglobin folding characterized by pulsed thiol-disulfide exchange, Nat. Struct. Biol. 5, 730-737. (63) Knubovets, T., Osterhout, J. J., Connolly, P. J., and Klibanov, A. M. (1999) Structure, thermostability, and conformational flexibility of hen egg-white lysozyme dissolved in glycerol, Proc. Natl. Acad. Sci. U. S. A. 96, 1262-1267. (64) Zhang, G., Ueberheide, B. M., Waldemarson, S., Myung, S., Molloy, K., Eriksson, J., Chait, B. T., Neubert, T. A., and Fenyo, D. (2010) Protein quantitation using mass spectrometry, Methods Mol. Biol. 673, 211-222. (65) Chaffey, P. K., Guan, X., Chen, C., Ruan, Y., Wang, X., Tran, A. H., Koelsch, T. N., Cui, Q., Feng, Y., and Tan, Z. (2017) Structural insight into the stabilizing effect of O-glycosylation, Biochemistry 56, 2897-2906. (66) Broglia, R. A., Romero, L. S., Tiana, G., and fisica, S. i. d. (2007) Protein Folding and Drug Design, Ios Press, Amsterdam. (67) Varki, A. (1993) Biological roles of oligosaccharides: all of the theories are correct, Glycobiology 3, 97-130. (68) Shental-Bechor, D., and Levy, Y. (2009) Folding of glycoproteins: toward understanding the biophysics of the glycosylation code, Curr. Opin. Struct. Biol. 19, 524-533. (69) Ferris, S. P., Kodali, V. K., and Kaufman, R. J. (2014) Glycoprotein folding and quality-control mechanisms in proteinfolding diseases, Dis. Model. Mech. 7, 331-341.

Page 10 of 11

(70) Wang, A. C., Jensen, E. H., Rexach, J. E., Vinters, H. V., and Hsieh-Wilson, L. C. (2016) Loss of O-GlcNAc glycosylation in forebrain excitatory neurons induces neurodegeneration, Proc. Natl. Acad. Sci. U. S. A. 113, 15120-15125. (71) Vasudevan, D., and Haltiwanger, R. S. (2014) Novel roles for O-linked glycans in protein folding, Glycoconj. J. 31, 417426. (72) Lommel, M., and Strahl, S. (2009) Protein Omannosylation: conserved from bacteria to humans, Glycobiology 19, 816-828. (73) Vasudevan, D., Takeuchi, H., Johar, S. S., Majerus, E., and Haltiwanger, R. S. (2015) Peters plus syndrome mutations disrupt a noncanonical ER quality-control mechanism, Curr. Biol. 25, 286-295. (74) Narayan, M., Welker, E., Wedemeyer, W. J., and Scheraga, H. A. (2000) Oxidative folding of proteins, Acc. Chem. Res. 33, 805-812.

ACS Paragon Plus Environment

Page 11 of 11

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 ACS Paragon Plus Environment

11