2136
Biochemistry 1994,33, 2136-2141
A Protein Dissection Study of a Molten Globule7 Zheng-yu Peng and Peter S. Kim’ Howard Hughes Medical Institute, Whitehead Institute for Biomedical Research, Department of Biology, Massachusetts Institute of Technology, Nine Cambridge Center, Cambridge, Massachusetts 021 42 Received September 7, 1993; Revised Manuscript Received December 16, 1993”
ABSTRACT: Proteins have many distinct tertiary folds (Richardson, J. S. (1981) Adu. Prot. Chem. 34, 167-339). The term tertiary fold refers to the spatial organization of secondary structure elements (a-
helices and &strands). It is not known when, in the process of protein folding, a native tertiary fold emerges. Here, we show that the helical domain of human a-lactalbumin, in isolation, forms a molten globule with the same overall tertiary fold as that found in intact a-lactalbumin. Formation of this nativelike fold does not require extensive, specific side-chain packing. Our results suggest that much of the information transfer from one-dimension to three-dimensions has occurred a t the molten globule stage of protein folding.
Molten globules have received substantial attention because they have been postulated to be general, early intermediates of protein folding (for reviews, see Ptitsyn, 1987, 1992; Kuwajima, 1989;Christensen & Pain, 1991; Haynie & Freire, 1993). The properties of molten globules include the following: (i) substantial secondary structure, comparable to that of the native protein; (ii) absence of well-defined tertiary packing; (iii) lack of a cooperativethermal unfolding transition; and (iv) compactness. Very little is known about the tertiary structures of molten globules; such information would clarify the role of these intermediates in the process of protein folding. The best-studied molten globule is the low-pH form (Astate)’ of a-lactalbumin (a-LA) (Kuwajima et al., 1976,1985; Nozaka et al., 1978; Dolgikh et al., 1981, 1985; Ikeguchi et al., 1986; Gast et al., 1986; Baum et al., 1989;Xieet al., 1991; Alexandrescu et al., 1993). The following observationssuggest that the a-helical domain forms the core of the a-LA molten globule. First, the molten globule of a-LA has essentially the same helix content as native a-LA (Kuwajima et al., 1976; Nozaka et al., 1978; Dolgikh et al., 1985). Second, NMR amide proton exchange and NOE studies suggest that at least a subset of the helices in native a-LA remains helical in the A-state (Baum et al., 1989;Alexandrescuet al., 1993). Finally, the a-helical domain forms early in the folding of hen eggwhite lysozyme (Miranker et al., 1991; Radford et al., 1992), a protein that is structurally homologous to a-LA. One approach for studying protein-folding intermediates is to remove regions that are thought to be unnecessary for the structure and properties of interest. We refer to this approach as “protein dissection”. Protein dissection has been successfully applied to a number of proteins (Bierzynski et al., 1982; Vita et al., 1984; Dyson et al., 1985; Oas & Kim, 1988; Staley & Kim, 1990; Tasayco & Carey, 1992). We have designed and constructed a single-chain, recombinant model of the a-helical domain of a-LA, called a-Domain (Figure 1). a-Domain consists of residues 1-39 Supported by the Howard Hughes Medical Institute. Abstract published in Aduance ACS Abstracts, February 1, 1994. 1 Abbreviations: a-LA, a-lactalbumin;A-state,low-pH form of a-LA; a-Domain, a recombinantmodel of the isolated helical domain of human a-LA containing residues 1-39 and 81-123 connected by a linker of threeglycineresiduesand containingthe substitutionCys 91 -Ala; CD, circular dichroism; NMR, nuclear magnetic resonance; NOE,nuclear Overhauser effect; HPLC, high-performance liquid chromatography; GuHCl, guanidine hydrochloride; DTT, dithiothreitol; TFA, trifluoroacetic acid. f
0006-2960/94/0433-2 136$04.50/0
~ i 2 0j C
FIGURE1: Schematic representation (Priestle, 1988) of a-LA. The a-helical domain contains all four a-helices in a-LA, whereas the &sheet domain contains a small antiparallel &sheet and several looplike structures (Acharya et al., 1989, 1991). The recombinant a-Domain (shaded) consists of residues 1-39 and 81-123 of a-LA plus a linker of three glycines and the substitution Cys 91 Ala. a-LA has four disulfide bonds; two of them (6-120 and 28-1 11) are in the a-helical domain, one (61-77) is in the &sheet domain, and one (73-91) connects the two domains. The two disulfide bonds in a-Domainox(6-120 and 28-1 11) are shown in black.
-
and 81-123 of human a-LA, connected by a short linker of three glycines (the distance between the carbonyl of Gln 39 and the amide of Leu 81 is 7 A in the crystal structure of a-LA). Cys 91, which forms an interdomain disulfide bond in a-LA, is changed to Ala to avoid unwanted thiol-disulfide reactions. a-Domain has 86 amino acids, including an N-terminal methionine as a result of expression in Escherichia coli. For simplicity, the a-LA numbering system is used for residues in a-Domain.
MATERIALS AND METHODS Recombinant a-Domain was expressed in E . coli from a synthetic gene driven by the T7 promoter, using the cloning vector pAED4 (Doering, 1992). Cells were lysed by sonication in buffer containing 50 mM Tris, 25% sucrose, and 1 mM EDTA, pH 8. The insoluble fraction was washed with the same buffer and with buffer containing 20 mM Tris, 1%Triton X- 100, and 1 mM EDTA, pH 8. The resulting pellet (inclusion bodies) was solubilized in 50 mM Tris, 50 mM NaCl, 8 M urea, 5 mM EDTA, and 0.1 M DTT, pH 8.5, and loaded onto 0 1994 American Chemical Society
Protein Dissection of a Molten Globule a fast-flow DEAE-Sepharose (Sigma) column equilibrated with 50 mM Tris, 50 mM NaCl, 4 M urea, 1 mM EDTA, and 5 mM DTT. The column was eluted with a linear gradient of NaCl (final concentration, 0.5 M) in 4 M urea. The partially purified a-Domain was diluted to -0.5 mg/mL with 4 M urea and dialyzed extensively at 4 "C against a buffer containing 10 mM Tris, 50 mM NaCl, 1 mM CaCL2.5 mM reduced glutathione, and 0.5 mM oxidized glutathione, pH 8.5, during which oxidative refolding occurs. The oxidized a-Domain, denoted a-DomainoX,was purified by reverse-phase HPLC. The identity of a-DomainoXwas confirmed by laser desorption mass spectrometry (Finnigan LASERMAT; calculated molecular weight, 9663 Da; observed, 9662-9665 Da). The disulfide-bond pattern in a-DomainoXwas assigned by digestion with pepsin (Carr et al., 199 1) followed by analytical reverse-phase HPLC. Two peptide fragments with elution times that are shifted in the presence of DTT were isolated. N-terminal sequencing, amino acid analysis, and mass spectrometry indicated that these fragments correspond to (i) Phe 3-Leu 11/Trp 118-Leu 123 (disulfide 6-120) and (ii) Ile 27-Phe 31/Trp 104-Thr 112 (disulfide 28-111). Sedimentation equilibrium studies were performed at 4 OC on a Beckman XL-A analytical ultracentrifuge, using an An60 Ti rotor at speeds of 28 and 33 krpm for a-DomainoXor 25 and 30 krpm for human a-LA. The protein solutions (- 1 mg/mL) were dialyzed against buffers containing 10 mM Tris and 0.5 mM EDTA, pH 8.5, for &-DomainoX and 10 mM Tris and 1 mM CaC12, pH 8.5, or 10 mM HCl, pH 2, for human a-LA. Three dilutions with final protein concentrations of 60-80, 20-27, and 6-9 pM were run in Beckman 6-sector cells. The apparent molecular weights were calculated for each sector using the program NONLIN (courtesy of M. L. Johnson and J. Lary) with a partial specific volume of 0.736 for a-DomainoXand 0.729 for human a-LA (Laue et al., 1992). CD spectra were recorded at 0 OC on an Aviv 62DS circular dichroism spectrometer equipped with a thermoelectric temperature controller; 10,20,40, and 80 pM protein in a l-mm path length cuvette were used for far-UV studies (1.5-nm bandwidth), and 40 pM protein in a 10-mm cuvette was used for near-UV studies (5-nm bandwidth). Samples at pH 8.5 contained 10 mM Tris and 0.5 mM EDTA for a-DomainoX or 10 mM Tris and 1 mM CaC12 for human a-LA. In addition, the sample of reduced a-Domain contained 2 mM DTT. Samples at pH 2 contained no buffer; the pH was adjusted with 0.1 M HCl. Unfolded a-DomainoXwas studied in 6 M GuHC1, 0.1 M Tris, and 0.5 mM EDTA, pH 8.5. At pH 2, both a-DomainoX and human a-LA exhibited a higher ellipticity in the far-UV region than at pH 8.5, for reasons that are not well understood. The mean residue ellipticities of a-Domainox, [6-28; 111-1201, [6-111; 28-1201, and reduced a-Domain did not depend on protein concentration from 10 to 80 pM. For determination of the temperature dependence at 222 nm, 10 pM protein in a 10-mm cuvette was used. All temperature traces were over 80% reversible, and the qualitative features in the forward and reverse directions were the same. The protein concentrations were determined by absorbance at 280 nm in 6 M GuHCl (Edelhoch, 1967). Fluorescence spectra were recorded at 4 "C on an ISS PC spectrofluorimeter operating in ratio mode. The excitation wavelength was 278 nm. Both the excitation and emission slits were set to 1 mm. The samples contained 10 pM protein in the same buffers used for CD studies. NMR spectra were collected at 4 OC on a Bruker AMX 500-MHz N M R spectrometer. Free induction decays were
Biochemistry, Vol. 33, No. 8, 1994 2137 averaged over 1024 scans with 4 K complex data points. The samples contained -80 pM protein in D2O; additionally, pH 8.5 samples contained 0.5 mM EDTA (deuterated) for a-DomainoXand 1 mM CaC12 for a-LA. All proteins are monomeric at this concentration, as determined by sedimentation equilibrium. Chemical shifts were determined relative to an internal (trimethylsily1)propionate (TMSP) standard, corrected for the pH-dependence of the TMSP chemical shift (DeMarco, 1977). Dynamic light-scattering studies were performed at 4 OC on a Brookhaven Instrument BI-2030 laser light-scattering apparatus equipped with an Ar+ laser operating at 488 nm. Autocorrelation functions were sampled at 0.5- or 1-ps time intervals, for a total accumulation time of 15-60 min. Experiments were carried out in the same buffers as used for CD studies. Protein solutions were passed through a 100kDa Centricon (Amicon) to remove trace amounts of large aggregates and dust. The protein concentrations were -80 pM (- 120 pM for unfolded). Hydrodynamic diameters were calculated by the cumulant method (Koppel, 1972), with polydispersity in the range of 0.2-0.3. A refractive index of 1.333 and a viscosity of 1.567 centipoise were used for dilute aqueous solutions at 4 OC. A refractive index of 1.465 and a viscosity of 2.23 centipoise were used for 6 M GuHCl solutions. These values were determined with a refractometer (Bausch & Lomb) and a capillary viscometer (Cole-Parmer). Non-native disulfide-bonded species of a-Domain were purified from a mixture obtained by complete air oxidation of reduced a-Domain in 2-6 M GuHCl. The purity of each species is >90%, as judged by analytical HPLC. The disulfidebond patterns for non-native species were assigned by proteolysis experiments. The following peptides have been identified. For [6-28; 11 1-1201: (i) Phe 3-Leu 1l/Ile 27Phe 31 (disulfide 6-28) and (ii) Trp 104-Leu 123 (disulfide 111-120). For [6-111;28-1201: (i)Phe3-Leu 11/Trp 104Thr 112 and Phe 3-Leu 11/Trp 104-Gln 117 (disulfide 6-1 11) and (ii) Ile 27-Phe 31/Trp 11&Leu 123 (disulfide 28-120). Disulfide-exchange experiments were carried out at room temperature in an anaerobic glovechamber (Coy Laboratory Products). The native buffer contained 10 mM Tris and 0.5 mM EDTA, pH 8.5; the denaturing buffer contained 0.1 M Tris, 6 M GuHCl, and 0.5 mM EDTA, pH 8.5. Disulfide exchange was initiated by adding a-Domain to buffers containing 100 pM N-(2,4-dinitrophenyl(DNP))-~-cysteine and 10 pM N,N-bis(2,4-DNP)-~-cystine (Sigma). (The DNPlabeled cysteine was obtained by reducing the DNP-labeled cystine with DTT followed by HPLC purification.) The final concentration of a-Domain was 5 pM. The 2,4-DNP group served as a spectroscopic label (340 nm) for identifying mixed disulfide species. After 10 h of equilibration, samples were quenched by adding formic acid to a final concentration of 10% (v/v) followed by analytical HPLC using a HzOacetonitrile gradient (0.09%/min increase of acetonitrile) with 0.1% TFA. Identical results were obtained using a redox buffer with 100 pM reduced glutathione and 10 pM oxidized glutathione. The errors in peak quantitation were estimated to be less than 1%. The redox conditions were chosen such that the mixed disulfide population at equilibrium was small; none of the HPLC peaks discussed in this paper showed significant absorbance at 340 nm. In addition, these peaks do not contain free thiols, as the elution times of these peaks did not change if 5,5'-dithiobis(2-nitrobenzoic acid) (DTNB) (Ellman, 1959) was added (1 mM final concentration) before the acid quench, with or without the addition of 6 M GuHCl.
-
2138 Biochemistry, Vol. 33, No. 8, 1994
Peng and Kim
Table 1: Summary of Physical Properties of a-DomainaXand a-LA at 4 ' C apparent mean residue mean residue tryptophan substantial molecular hydrodynamic ellipticity, ellipticity, fluorescence cooperative chemical weight diameter 222 nm 270 nm maximum thermal shift (kDa) (A) (10' deg cm2/dmol) (deg cm2/dmol) (nm) transition dispersion native a-LA (pH 8.5, 1 mM CaC12) molten globule a-LA (pH 2.0) a-Domainox (pH 8.5,0.5 mM EDTA) a-DomainoX(pH 8 . 5 , 6 M GuHCl)
13.6 13.6 10.1
35.3 37.5 36.8 -17
RESULTS A species denoted a-DomainoX,which has the same disulfide bonds (6-120 and 28-1 11) as native a-LA, exhibits properties of a molten globule under a broad range of conditions. Some physical properties of a-DomainoXare summarized in Table 1. Since molten globules have a tendency to aggregate (Goto & Fink, 1989; Kuwajima, 1989), it is especially important to establish that one is studying a monomeric species. Sedimentation equilibrium studies show that a-DomainoXis >90% monomeric at concentrations below 100 pM at pH 8.5 (10 mM Tris, 0.5 mM EDTA) with no additional salt. Nonideality was not observed at protein concentrations in the range of 5-30 pM, and the residuals revealed no systematic deviation. At the highest protein concentration (>60 pM), the apparent molecular weight decreases by 5-lo%, probably as a result of electrostatic interaction with the counterion gradient (Williams et al., 1958). All studies of a-DomainoX(including NMR) were carried out under these conditions. The CD and NMR spectra of a-DomainoXdo not change when EDTA is replaced by 1 mM CaC12. For comparison, intact human a-LA was studied both in the A-state (pH 2) and in native conditions (pH 8.5, 1 mM CaC12). Far-UV CD spectra indicate that a-DomainoXhas substantial a-helical secondary structure (Figure 2A), as expected for a molten globule. These spectra suggest that essentially all of the helices in a-LA are preserved in a-DomainoX. The near-UV CD spectrum of a-DomainoXlacks the characteristic features seen in spectra of native a-LA but is similar to the spectrum of the A-state of a-LA (Figure 2B), suggesting disrupted side-chain packing. (a-DomainoXretains two of the three tryptophans and three of the four tyrosines in human a-LA.) The temperature dependence of the CD signal for a-DomainoXparallels that of the A-state of a-LA; neither exhibits a cooperative thermal transition (Figure 2C,D). As with the A-state of a-LA, the fluorescence maximum of a-DomainoXis red-shifted compared to the maximum of native a-LA (Table l), presumably because the tryptophans are more solvent exposed than in the native protein. The proton NMR spectrum of a-DomainoXresembles the spectrum of the A-state of a-LA (Figure 3). Both spectra exhibit line broadening and considerably less chemical shift dispersion than is observed for native CY-LA,suggesting that a-DomainoXhas a slowly fluctuating structure and that the side chains in a-DomainoXare not in well-defined conformations (Dolgikh et al., 1985; Baum et al., 1989; Alexandrescu et al., 1993). Dynamic light-scattering studies show that a-DomainoXis compact, as compared to the unfolded form in 6 M GuHCl (Table 1). With a spherical approximation, the hydrodynamic diameter of a-DomainoXis -20% larger than the value predicted if it were as compact as native a-LA. This result is consistent with the hydrodynamic properties observed for other molten globules (Gast et al., 1986; Goto et al., 1990). Hypotheses about the structures of molten globules often fall into two categories (for reviews, see Kuwajima, 1989;
-7.9 -12.5 -11.7 --1
-220 -50 -20 -50
326 336 338 345
Yes no no
Yes no no
Ptitsyn, 1992): (i) a nonspecific assembly of secondary structure elements or (ii) an ensemble of slightly different structures sharing a common tertiary fold. For proteins containing disulfide bonds, the tertiary fold can be probed by studying the specificity of disulfide-bond formation. We examined each of the three possible forms of a-Domain with two disulfide bonds. Non-native disulfides force peptide segments that are distant in the native structure to be close in space, thereby imposing a non-native tertiary fold. FarUV CD spectra show that both of the species with non-native disulfide bonds, [6-28; 111-1201 and [6-111; 28-1201, have considerably less helical secondary structure than a-DomainoX (Figure 4). The helix content of a-DomainoXis higher than that of reduced a-Domain, which in turn is higher than the helix content of either of the non-native species. The relative stability of different two-disulfide species at equilibrium can be determined by disulfide-exchange experiments (Figure 5). Because the native disulfide bonds in a-Domain are between cysteines that are distant in sequence, a random-walk model predicts that only 7% of the two-disulfide species should have native disulfides (Kauzmann, 1959; Snyder, 1987). Instead, under native conditions, 90% of the two-disulfide species are a-DomainoX.As a control, in 6 M GuHCl, only 8% of the two-disulfide species have native disulfide bonds, in excellent agreement with the prediction of the random-walk model. In both conditions, identical ratios of the two-disulfide species are obtained using either a-DomainoXor reduced a-Domain as the starting material. We conclude that, under native conditions, a nativelike tertiary fold is the most stable fold for a-Domain.2
DISCUSSION Our results demonstrate that a-DomainoXis a molten globule with a nativelike tertiary fold (Le., the spatial arrangement of helices is similar to that of native a-LA). Complementary results from hydrogen4euterium exchange studies have shown that the helices in molten globules, including the A-state of a-LA, are formed by the same, or nearly the same, residues that form helices in the corresponding native proteins (Baum et al., 1989; Hughson et al., 1990; Jeng et al., 1990; Alexandrescu et al., 1993). Taken together, these results suggest that molten globules are stabilized by native interactions and contain structures that are closer to expanded native proteins than to nonspecific, collapsed polypeptides. Previous studies of disulfide exchange in the molten globule of a-LA suggested that, although there were some restrictions, a variety of disulfide pairings were favored (Ewbank & The free-energy differences between conformations with native and non-nativetertiary foldscan beestimated from the ratiosoftheequilibrium populations between a-DomainoXand each of the non-native disulfide species under native versus denaturing conditions;the results are 2.9 and 1.8 kcal/mol favoring the native disulfide species for [6-28; 111-1201 and [6-111; 28-1201, respectively. These values are comparable to the free energy of unfolding the molten globule of a-LA (Kuwajima et al., 1976; Ikeguchi et al., 1986b).
Biochemistry, Vol. 33, No. 8, 1994 2139
Protein Dissection of a Molten Globule
15
7-
10
E -
5
0
b
NE P
m
-5
-10 Y
-15 -20
190 200 210 220 230 240 250 260
250 260 270 280 290 300 310 320
Wavelength (nm)
Wavelength (nm)
L
E
'0
-100
N5
t
-250'
-1J
0
10 20 30 40 50 60 70 80
Temperature ("C)
I
0
I
I
I
'
I
I
'
10 20 30 40 50 60 70 80
Temperature ("C)
FIGURE 2: CD studies indicate that a-Domainoxforms a molten globule. (A) Far-UV CD spectra of a-DomainnX(0)and native a-LA (0) at pH 8.5. The ratio of the mean residue ellipticities at 222 nm of a-DomainoXand a-LA indicates that the two molecules have approximately the same number of residues in an a-helical conformation. (B) Near-UV CD spectra of a-DomainOX at pH 8.5 (@), native a-LA at pH 8.5 ( O ) , and molten globule a-LA at pH 2 (0). a-Domainoxretains two of the three tryptophans and three of the four tyrosines in human a-LA. (C) Temperature dependence of the CD signal at 222 nm and (D) at 270 nm for a-DomainoXat pH 8.5 (a),native a-LA at pH 8.5 (O), and molten globule a-LA at pH 2 (13). Creighton, 1991, 1993). One factor to be considered in the interpretation of these results, in light of our current observations, is that the molten globule of a-LA may have an a-helical domain with a nativelike fold but a relatively unstructured 0-sheet domain. More work is needed to determine if the molten globule of a-LA includes both ordered and disordered regions. The preference for native disulfides does not imply a rigid structure. The disulfide constraints still allow for substantial flexibility, including multiple side-chain conformations and small adjustments of helix crossing angles. Indeed, we refer to a-DomainoXas a molten globule, rather than a destabilized native species, because even with native disulfide bonds, a-Domainox lacks a native near-UV CD spectrum, lacks chemical shift dispersion in NMR studies, and is less compact than native a-LA. Short peptide fragments, including those corresponding to individual helices in a-LA (Alexandrescu et al., 1993), generally do not fold into stable structures in aqueous solution (Wright et al., 1988; Kim & Baldwin, 1990). By contrast, the helices in a-Domainox are stabilized cooperatively by forming a nativelike tertiary fold, even though folding does not proceed beyond the molten globule stage. In addition, we
find that non-native tertiary folds, imposed by non-native disulfides, decrease the helix content in a-Domain. Thus, formation of a high level of secondary structure in a-Domain is linked to formation of a nativelike tertiary fold. Remarkably, the nativelike tertiary fold in the a-DomainoX molten globule is formed without extensive, specific side-chain packing. It is possible that a few side-chain interactions, yet to be discovered, play a crucial role in determining the tertiary fold of this molten globule. Alternatively, the amino acid sequence of a protein could specify the tertiary fold in a manner that does not depend on the details of side-chain packing (Behe et al., 1991). In the latter case, properties such as the pattern of hydrophobic and hydrophilic residues, side-chain volumes, and secondary structure propensities are probably the important determinants of tertiary folds (e.g., Lau & Dill, 1990; Bowie et al., 1991). This would imply that the native fold of a protein could be predicted without detailed knowledge of tertiary packing interactions. a-DomainoXis an example of a compact, autonomous folding unit (Wetlaufer, 1973), sufficient for formation of a molten globule with properties similar to the A-state of a-LA. In the folding of multidomain proteins, individual domains often fold independently (Jaenicke, 1991). Our resultssuggest that some
[
2140 Biochemistry, Vol. 33, No. 8, 1994 (a)
a-Domainox
b
Peng and Kim Native Conditions [6-120;28-111]
(a-Do m a I no')
+
[6-28; 111-1201 [6-111; 28-1 201
Conditions
I t- [6-28;111-120]
[6-120;28-111] 1
"
8
'
1
~
6
"
/
'
~
4
'
/
'
"
2
1
'
'
0
(a-Domai nor)
(6-111;28-120]
PPm FIGURE 3: IH NMR spectra. The spectra of a-DomainoX are similar to those of the A-state of a-LA and lack the chemical shift dispersion observed in native a-LA: (a) a-DomainOx at pH 8.5, (b) molten globule a-LA at pH 2, and (c) native a-LA at pH 8.5.
15 10
-
"0
.-
8
+
-5
-10 -15
ACKNOWLEDGMENT
-20 190 200 210 220 230 240 250 260 Wavelength (nm)
FIGURE 4: Far-UV CD spectra. a-Domaina (a),reduced a-Domain (+), [6-28; 111-1201 (O), and [6-111; 28-1201 (0).The spectra indicate that non-nativetertiary folds, imposed by non-nativedisulfide bonds, significantly decrease the helix content of a-Domain. of these folded domains in the folding intermediates of multidomain proteins may correspond to molten globules with nativelike tertiary folds (see, also, Goldberg et al., 1990). Finally, even though they are equilibrium intermediates, molten globules appear to show characteristic properties of early kinetic folding intermediates (Kuwajima, 1989; Ptitsyn, 1992; Baldwin, 1993). Early formation of a nativelike tertiary fold within a molten globule intermediate will lead to a vast reduction of the conformational space to be searched at later stages of folding. At the same time, the potential energy surface near the molten globule state, which lacks extensive side-chain packing, should be smoother and shallower than that near the native state. Thus, the molten globule provides an approximate solution to the protein-folding problem while granting enough flexibility to make kinetic traps less likely. Our results, combined with other studies of molten globules, suggest that the process of protein folding depends on the
We thank J. Weissman for advice and suggestions, Prof. R. Langer for access to the dynamic light-scattering instrument, Dr. J. Okada for instruction in the use of this instrument, and D. Lockhart, J. Staley, R. Rutkowski, and members of the Kim group for discussions and comments on the manuscript. P.S.K. is a Pew Scholar in the Biomedical Sciences. REFERENCES Acharya, K. R., Stuart, D. I., Walker, N. P. C., Lewis, M., & Phillips, D. C. (1989) J . Mol. Biol. 208, 99-127. Acharya, K. R., Ren, J., Stuart, D. I., & Phillips, D. C. (1991) J . Mol. Biol. 221, 571-581. Alexandrescu, A. T., Evans, P. A., Pitkeathly, M., Baum, J., & Dobson, C. M. (1993) Biochemistry 32, 1707-1718. Baldwin, R. L. (1993) Curr. Opin. Struct. Biol. 3, 84-91. Baum, J., Dobson, C. M., Evans, P. A., & Hanley, C. (1989) Biochemistry 28, 7-13. Behe, M. J., Lattman, E. E., & Rose, G. D. (1991) Proc. Nutf. Acad. Sci. U.S.A. 88, 4195-4199. Bierzynski, A., Kim, P. S., & Baldwin, R. L. (1982) Proc. NutI. Acad. Sci. U.S.A. 79, 2740-2744. Bowie, J. U., Luthy, R., & Eisenberg, D. (1991) Science 253, 164-170. Carr, C., Aykent, S., Kimack, N. M., & Levine, A. D. (1991) Biochemistry 30, 1515-1 523.
Protein Dissection of a Molten Globule Christensen, H., & Pain, R. H. (1991) Eur. Biophys. J . 19,221229.
DeMarco, A. (1977) J . Magn. Reson. 26, 527-528. Doering, D. S. Functional and Structural Studies of a Small $actin Binding Domain, Massachusetts Institute of Technology, 1992. Dolgikh, D. A., Gilmanshin, R. I., Brazhnikov, E. V., Bychkova, V. E., Semisotnov, G. V., Venyaminov, S. Y., & Ptitsyn, 0. B. (1981) FEBS Lett. 311-315. Dolgikh, D. A., Abaturov, L. V., Bolotina, I. A., Brazhnikov, E. V., Bychkova, V. E., Gilmanshin, R. I., Lebedev, Y. O., Semisotnov, G. V., Tiktopulo, E. I., & Ptitsyn, 0. B. (1985) Eur. Biophys. J . 13, 109-121. Dyson, H. J., Cross, K. J., Houghton, R. A., Wilson, I. A,, Wright, P. E., & Lerner, R. A. (1985) Nature 318, 480-483. Edelhoch, H. (1967) Biochemistry 6 , 1948-1954. Ellman, G. L. (1959) Arch. Biochem. Biophys. 82, 70-77. Ewbank, J. J., & Creighton, T. E. (1991) Nature 350, 518-520. Ewbank, J. J., & Creighton, T. E. (1993) Biochemistry32,36773693.
Gast, K., Zirwer, D., Welfle, H., Bychkova, V. E., & Ptitsyn, 0. B. (1986) Int. J . Biol. Macromol. 8 , 231-236. Goldberg, M. E., Semisotnov, G. V., Friguet, B., Kuwajima, K., Ptitsyn, 0. B., & Sugai, S. (1990) FEBS Lett. 263, 51-56. Goto, Y., & Fink, A. L. (1989) Biochemistry 28, 945-952. Goto, Y .,Calciano, L. J., & Fink, A. L. (1990) Proc. Natl. Acad. Sci. U.S.A. 87, 573-577. Haynie, D. T., & Freire, E. (1993) Proteins: Struct. Funct. Genet. 16, 115-140.
Hughson, F. M., Wright, P. E., & Baldwin, R. L. (1990) Science 249, 1544-1 548. Ikeguchi, M., Kuwajima, K., Mitani, M., & Sugai, S. (1986a) Biochemistry 25, 6965-6972. Ikeguchi, M., Kuwajima, K., & Sugai, S. (198613) J . Biochem. (Tokyo) 99, 1191-1201. Jaenicke, R. (199 1) Biochemistry 30, 3 147-3 161. Jeng, M.-F., Englander, S. W., Elove, G. A., Wand, A. J., & Roder, H. (1990) Biochemistry 29, 10433-10437. Kauzmann, W. (1959) in Sulfur in Proteins (Benesch, R., et al., Eds.) pp 93-108, Academic Press, New York.
Biochemistry, Vol. 33, No. 8, 1994 2141
Kim, P. S., & Baldwin, R. L. (1990) Annu. Rev. Biochem. 59, 631-660.
Koppel, D. E. (1972) J . Chem. Phys. 57,4814-4820. Kuwajima, K. (1989) Proteins: Struct. Funct. Genet. 6,87-103. Kuwajima, K., Nitta, K., Yoneyama, M., & Sugai, S. (1976) J . Mol. Biol. 106, 359-373. Kuwajima, K., Hiraoka, Y., Ikeguchi, M., & Sugai, S. (1985) Biochemistry 24, 874-881. Lau, K. F., & Dill, K. A. (1990) Proc. Natl. Acad. Sci. U.S.A. 87, 638-642. Laue, T. M., Shah, B. D., Ridgeway, T. M., & Pelletier, S. L. (1992) in Analytical Ultracentrifugation in Biochemistry and Polymer Science (Harding, S. E., Rowe, A. J., & Horton, J. C., Eds.) pp 90-125, The Royal Society of Chemistry, Cambridge. Miranker, A., Radford, S. E., Karplus, M., & Dobson, C. M. (1991) Nature 349, 633-636. Nozaka, M., Kuwajima, K., Nitta, K., & Sugai, S. (1978) Biochemistry 17, 3753-3758. Oas, T. G., & Kim, P. S. (1988) Nature 336, 42-48. Priestle, J. P. (1988) J . Appl. Crystallogr. 21, 572-576. Ptitsyn, 0. B. (1987) J . Protein Chem. 6 , 273-293. Ptitsyn, 0 . B. (1991) FEBS Lett. 285, 176-181. Ptitsyn, 0 .B. (1992) in Protein Folding (Creighton, T. E., Ed.) pp 243-300, W. H. Freeman and Co., New York. Radford, S. E., Dobson, C. M., & Evans, P. A. (1992) Nature 358, 302-307. Snyder, G. H. (1987) Biochemistry 26, 688-694. Staley, J. P., & Kim, P. S. (1990) Nature 344, 685-688. Tasayco, M. L., & Carey, J. (1992) Science 255, 594-597. Vita, C., Dalzoppo, D., & Fontana, A. (1984) Biochemistry 23, 55 12-55 19.
Wetlaufer, D. B. (1973) Proc. Natl. Acad. Sci. U.S.A.70,697701.
Williams, J. W., Van Holde, K. E., Baldwin, R. L., & Fujita, H. (1958) Chem. Rev. 58, 715-806. Wright, P. E., Dyson, H. 51, & Lerner, R. A. (1988) Biochemistry 27. 7167-7175. Xie,D.,Bhakuni,V., & Freire,E. (1991) Biochemistry30,1067310678.