J. Phys. Chem. B 2002, 106, 13047-13057
13047
Long-Range Effects of Mutating R248 to Q/W in the p53 Core Domain Sergey Yu Noskov,†,§,| Jon D. Wright,†,| and Carmay Lim*,†,‡ Institute of Biomedical Sciences, Academia Sinica, 11529 Taipei, Taiwan, and Department of Chemistry, National Tsing-Hua UniVersity, 300 Hsinchu, Taiwan ReceiVed: September 25, 2002
The mutations of R248 to Trp and Gln in the core domain (CD) of the p53 protein are some of the most common mutations found in human cancer. Although the mutant 248Q and 248W p53-CDs retain the wildtype conformation and stability, they lack sequence-specific DNA binding and transactivation functions and the ability to suppress cell growth. The structural and energetic bases for the observed loss of DNA binding are unclear as the DNA-free and DNA-bound mutants are not available. Hence, we have generated threedimensional models of the wild type, 248Q, and 248W p53-CDs, free and bound to DNA, using molecular dynamics simulations in the presence of explicit water molecules. Based on the simulation structures, the free energies of wild type and mutant p53-CDs binding to DNA have been computed and decomposed into component energies (electrostatic vs van der Waals vs cavity) and contributions from the interface residues. The DNA-free mutant structures were found to be consistent with antibody-binding and NMR data. The predicted DNA binding losses of both mutants were also in accord with experimental data. The calculations revealed that mutating R248 in the minor groove yielded long-range changes in the major groove DNAbinding interface, and the extent of these changes differs depending on the mutation type. The DNA-binding loss of the 248Q p53-CD mutant is due mainly to the loss of major groove contacts from K120 (located ∼20 Å from the mutation site) as well as unfavorable interactions of D281. In contrast, the DNA-binding loss of the 248W p53-CD mutant is due mainly to the loss of minor groove contacts from the mutant residue itself. It is also due, to a lesser extent, to the loss of major groove contacts from loop L1 and to the poorer packing of the protein-DNA interface in the 248 W mutant complex relative to the wild-type. The results obtained here are important in deciding the rescue strategy of mutant p53 DNA binding.
Introduction The DNA-binding protein, p53, functions as a tumor supressor in human cells by activating transcription of genes that mediate cell cycle arrest and DNA damage repair,1-4 and by inducing apoptosis in response to DNA damage, or other genotoxic stress such as hypoxia, oncogene activation, and viral infection.5-7 Mutations of this genome guardian are associated with about 50% of human cancers,8 95% of lung cancers,9 and has also been associated with resistance to therapy.10,11 The human p53 protein comprises 393 amino acids that can be grouped4 into (i) a transcription/transactivation region (residues 1-42), (ii) a sequence-specific DNA-binding core domain (residues 102292), (iii) a tetramerisation domain (residues 323-356), and (iv) a C-terminal region (residues 363-393) that acts as a negative regulator of p53 sequence-specific DNA binding12 and can also weakly bind DNA/RNA. The most common type of p53 mutations are missense mutations that result in full-length, albeit mutant, proteins. This fraction of mutations is much higher than in other tumor supressor genes such as APC, ATM, and BRCA-1.13 The majority of the p53 mutations found in human cancers map to the p53 core domain14 (referred to as p53-CD) and ∼40% of * Corresponding author. E-mail:
[email protected]. Phone: 886-2-2652-3031. Fax: 886-2-2788-7641. † Academia Sinica. ‡ National Tsing-Hua University. § Permanent address: Institute of Solution Chemistry, Russian Academy of Sciences, Akademicheskaya str., 1, 153045, Ivanovo, Russia. | The first two authors should be regarded as joint first authors.
them occur at R175, G245, R248, R249, R273, and R282.8 Based on the 2.2 Å X-ray structure of a 21-base pair DNA duplex and three p53-CDs,15 mutations of R175, G245, R249, and R282 have been classified as conformational because these four residues do not contact the DNA directly, whereas mutations of R248 and R273 have been classified as contact because the side chains of these two residues contact (within 4 Å) phosphate oxygens in the minor and major DNA groove, respectively. R248 is located in loop L3 (residues 237-250), while R273 is near the C-terminus of β-strand S10 (residues 264-274), which is part of the loop-sheet-helix motif consisting of loop L1 (residues 112-124), a three-stranded β sheet (residues 124-141 and 271-274), and helix H2 (residues 278286). Because the R248 and R273 contact mutants account for about one fifth of all p53 mutations and are assumed to retain a native-like three-dimensional (3D) structure, they have attracted considerable interest as targets for mutant p53 rescue.8,16,17 Here, we have chosen to study the R248 f W and R248 f Q p53-CDs because they are two of the most common tumor-derived mutants15 and have been found in brain, breast, colon, esophageal, head and neck, skin, lung, and liver cancer.13 The p53-CD‚DNA X-ray structure shows that on the DNA side, the R248 side chain has “close” contacts (defined by a heavy donor-heavy acceptor distance >3.5 Å but e4 Å) to the Thy 12′ and Thy 14 phosphate oxygens in the minor groove, while on the protein side, it is hydrogen bonded to S240, which is flanked by two zinc ligands, C238 and C242. When R248 is mutated to Gln or Trp in full-length p53 the wild-type
10.1021/jp022140w CCC: $22.00 © 2002 American Chemical Society Published on Web 11/19/2002
13048 J. Phys. Chem. B, Vol. 106, No. 50, 2002 conformation18 (i.e., PAb1620+, PAb240-, Hsp 70-) is retained. However, both 248Q and 248W p53 mutants lack sequence-specific DNA binding and sequence-specific transactivation function and growth suppression.3,19-21 Our goal here is to identify the key structural changes and residues governing the loss of DNA binding upon mutating R248 to a neutral Gln or Trp in the p53-CD. Because the DNAfree and DNA-bound 3D structures of these two 248 mutants have not been reported, their 3D solution structural models, free and DNA bound, were obtained using molecular dynamics (MD) simulations with explicit water molecules. The simulation protocol (see Methodology) had been calibrated in our previous work,22 which showed that the free and DNA-bound wild-type p53-CD simulations preserved the key X-ray protein-protein and protein-DNA interactions as well as the qualitative features of the X-ray B-factors. As another validation check of the simulation protocol, albeit indirect, our previous work also showed that the DNA binding affinity and specificity of the p53-CD were lost upon mutating R273 f H, but restored by a second mutation of T284 f R, in accord with experimental findings.22 Here the computed DNA-free mutant structures were verified by their consistency with antibody-binding and NMR data. Based on the average MD structures of wild-type and mutant p53-CDs, the DNA-binding free energy of the 248 mutant relative to that of the wild-type protein was computed and compared to experiment. By decomposing the free energy change upon mutation into changes in the component energies and individual residue contributions, and correlating the energy changes with changes in the hydrogen bonding interactions and solvent accessibility, the key contributions to the loss of p53CD binding upon mutating R248 f W/Q were identified. The implications of the results obtained here to the strategy of identifying potential linker molecules that may restore the DNA binding affinity and specificity of the two p53-CD mutants studied are discussed. Methodology MD Simulations. The core domains of wild-type p53, mutant 248Q, and 248W, free and bound to a 15-base-pair DNA duplex, 5′-3AATTGGGCAAGTCTA17-3′, were subjected to MD simulation using the CHARMM23 program at a mean temperature of 300 K. The total numbers of atoms in the simulations of the core domain of wild-type p53, mutant 248Q, and 248W p53, free and DNA-bound, are 37615, 37618, 37626, 58389, 58383, and 58385, respectively. The net charge of the wild-type protein (+5) bound to zinc (+2), 20 counterions (+20) (see below), and the 15-base-pair DNA duplex (-30) is -3. Starting Structures. The initial 3D structure for the wild-type p53-CD-DNA simulation was taken from chain B of PDB entry 1TSR, in which the core domain (residues 96-289) is bound to a consensus DNA binding site, GGGCA AGTCT; the second p53-CD is bound to a nonconsensus DNA binding site, while the third p53-CD is DNA-free.15 Initial 3D structures for the mutant p53-CD‚DNA simulations were prepared by a side chain rotamer library method24 using a locally modified version of the SCWRL program.25,26 The most common χ1 and χ2 angles for the mutant residue with respect to the wild-type backbone Φ and Ψ angles at that position were identified and the side chain without hydrogen atoms was built using the AMBER94 force field.27 Subsequently, the side chain was checked for steric clashes against the backbone using a simple distance-dependent potential that approximates the van der Waals (vdW) potential. If there were unfavorable interactions with the backbone, the
Noskov et al. rotamer was rejected and the next most common rotamer was used. This was repeated until no clashes or no more rotamers were left. The built rotamer was then checked for side chainside chain interactions using the same distance-dependent potential as before. If there were side chain to side chain clashes, then more rotamers were searched and the one with the lowest conflict was employed. For the DNA-free proteins, the starting structure was extracted from the average MD structure of the respective complex. Simulation Protocol. Each simulation was carried out at physiological pH (pH 7) using the all-hydrogen CHARMM27 force field28 and the TIP3P model of water29 with spherical boundary conditions.30 The protonation state of the ionizable residues and the treatment of nonbonded vdW and electrostatic interactions are as described in our previous work.22 Hydrogen atoms were added to the initial protein-DNA complex and the resulting structure was minimized in the presence of strong harmonic constraints on all heavy atoms to remove bad contacts. Next, the structure was immersed in the center of a previously equilibrated TIP3P water sphere of 50 Å radius. The size of the water sphere was chosen such that it provided at least three layers of solvation around the complex. Water molecules within 2.6 Å of any of the protein or DNA heavy atoms were deleted. The water sphere was rotated four times and overlaid on the previous system to ensure that no cavities were formed in the system. Twenty sodium ions were added to the protein-DNA system by replacing water molecules with the highest electrostatic energies on the oxygen atom and located g7 Å apart from each other and g4 Å apart from the protein or DNA atoms. The fully solvated protein-DNA complex was then further minimized (with the constraints on heavy atoms reduced to zero in successive rounds of minimization) to relieve bad vdW contacts. It was then equilibrated for 60 ps with heavy constraints on only the terminal DNA residues to prevent the DNA strands from unwinding.31 These constraints were reduced and each simulation was continued for 250 ps, during which coordinates were saved every 100 fs. As the X-ray structure shows that DNA binding does not alter the protein structure significantly,15 the p53 protein was extracted from the average MD structure of each complex and subjected to 250 ps of MD simulation, assuming the same protonation state of the ionizable residues, force field, and boundary conditions as the bound-state simulations. The length of each trajectory appears to be sufficiently long to obtain equilibrium structures (see Results, also Figures 1 and 2). Furthermore, the same simulation length was employed in our previous work,22 which showed that the free and DNA-bound wild-type p53-CD simulations generally preserved the key features of the respective X-ray structures and B-factors, while the DNA-bound R273H + T284R double mutant simulation could restore the native-like conformation of the protein-DNA interface, in accord with experimental findings. Analyses. For each trajectory the root-mean-square deviations (RMSDs) of the backbone protein and DNA atoms (excluding the terminal DNA residues) from the initial structure as well as the B-factors for the protein backbone atoms were computed. The B-factors for the protein backbone atoms were computed over the sampled phase of the simulations with error bars being calculated by splitting the sampled phase into three parts of 50 ps each and recomputing the B-factors for each part. Hydrogen bond analyses were performed by computing the average distance between a light/heavy donor and heavy acceptor from the last 150 ps portion of the trajectory. A hydrogen bond was defined by a heavy donor-heavy acceptor distance e3.5 Å, a
Long-Range Effects of Mutating R248 to Q/W
J. Phys. Chem. B, Vol. 106, No. 50, 2002 13049
Figure 1. Plot of the RMSD of the backbone atoms from the initial structure as a function of simulation time in (a) wild-type, (b) R248fQ, and (c) R248fW p53-CDs.
light donor-heavy acceptor distance e2.5 Å, and a deviation of less than (60° from linearity. An average MD structure was obtained by averaging over the last 150 ps portion of the trajectory (see Results). Solvent accessible surface area (SASA)32 calculations were performed on the MD structures using the GEPOL program33 and a solvent probe radius of 1.4 Å for absolute values or using the MolMol program34 for a percentage residue accessibility; i.e., the percentage ratio of the side chain X water-accessible surface area in the protein to that in the tripeptide -Gly-X-Gly-. Free Energy Decomposition. RelatiVe Binding Free Energy. The free energy difference between the wild-type and mutant p53-CD binding to DNA in solution, ∆∆G°, is given by wt ∆∆G° ) (∆Gmut gas - ∆Ggas) +
{Gsolv(p53
mut
‚DNA) - Gsolv(p53 ‚DNA)} wt
{Gsolv(p53
) - Gsolv(p53 )} (1)
mut
wt
∆Ggas is the standard free energy change per mole for the noncovalent association of the p53-CD and DNA in the gas phase at 300 K, and -Gsolv corresponds to the work of transferring the molecule in its solution conformation to the same conformation in the gas phase at 300 K. In the following, a single ∆ denotes the difference in energy/free energy between the DNA-bound and DNA-free state of wild-type or mutant p53CD, and ∆∆ represents the DNA-binding free energy of the mutant p53-CD relative to the wild-type protein. The ∆∆G° calculation was based on “snapshot” configurations taken every
Figure 2. Plot of the RMSD of the protein and DNA backbone atoms from the initial structure as a function of simulation time in (a) wildtype p53-CD‚DNA, (b) 248Q p53-CD‚DNA, and (c) 248W p53CD‚DNA.
5 ps from the last 100 ps of the MD trajectories.35,36 All water molecules were removed from the MD trajectories to avoid boundary problems for the different sized systems, as the DNAbound and free proteins had the same number of solute atoms but different numbers of solvent atoms. Previous works have shown that the free energy can be decomposed into physically meaningful components, even though the components are path dependent.37-40 Here, ∆Ggas and Gsolv are computed assuming that their energy/free energy components are additive.41,42 Gas-Phase Binding Free Energy Difference, ∆∆Ggas. The free energy difference between wild-type and mutant p53-CD binding to DNA in the gas phase was estimated by elec ∆∆Ggas ∼ ∆∆EvdW gas + ∆∆Egas )
- ∆EvdW,wt ) + (∆Eelec,mut - ∆Eelec,wt ) (2) (∆EvdW,mut gas gas gas gas Equation 2 assumes similar conformational changes as well as similar vibrational entropy changes between wild-type and mutant p53-CDs binding to DNA. Based on the MD structures of wild type and mutant p53-CDs (see above), the changes in vdW and electrostatic energies upon binding were calculated using the CHARMM program23 and an all-hydrogen force field43 with unit dielectric constant and no cutoffs. SolVation Free Energy, GsolV. The solvation free energy, Gsolv, can be expressed as elec Gsolv ) Gnonel solv + Gsolv
(3)
13050 J. Phys. Chem. B, Vol. 106, No. 50, 2002
Noskov et al.
The nonelectrostatic contribution to the solvation free energy, Gnonel solv , was approximated by a linear function of the SASA; i.e.,
Gnonel solv ) γ × SASA
(4)
where γ was set to 7.2 cal/mol/Å2, as rationalized in previous works,42,44 and the SASA of the free or bound molecule was computed using the GEPOL program (see Analyses above). Hence ∆Gnonel solv is proportional to the loss in SASA at the protein-DNA interface. The electrostatic contribution to the solvation free energy (Gelec solv) was calculated using the finite-difference PoissonBoltzmann (FDPB) method,45 as implemented in CHARMM.23 The protein was treated as a low dielectric medium (in ) 1) surrounded by a high dielectric solvent (out ) 80 for water). The ionic strength was set to 0.145 M. The low-dielectric region of the protein was defined as the region inaccessible to contact by a 1.4 Å sphere rolling over the molecular surface, defined by atomic coordinates of the MD structure and vdW radii taken from the all-hydrogen CHARMM27 force field. Due to the large dimension of the system and the long-range Coulombic forces, several focusing steps were performed to define proper boundary conditions. Initial electrostatic potential calculations for the wildtype and mutant structures employed a 200 × 200 × 200 grid with a spacing of 1.6 Å. Two focusing steps were performed to reach a final grid spacing of 0.4 Å.46 The difference between the electrostatic potential calculated in solution and in the gas phase yielded the electrostatic contribution to the solvation free energy. Results Figures 1 and 2 show plots of the RMSDs of the backbone atoms from the initial coordinates of the p53-CD, free and bound to DNA, respectively, as a function of simulation time. For the free-state simulations (Figure 1), the protein backbone RMSDs of the wild-type, 248Q, and 248W p53-CDs plateaued after ∼160 ps and fluctuated around a mean of 1.55, 1.25, and 1.50 Å, respectively. For the bound-state simulations (Figure 2), the protein backbone RMSDs of the p53-CD‚DNA complexes also plateaued after ∼160 ps, and fluctuated around a mean of 1.3 Å for the wild-type protein, and at a higher mean of 1.5 and 1.6 Å for the 248Q and 248W mutants, respectively. The protein backbone RMSDs of the MD structures from their respective initial ones are similar to those of NMR solution structures from their corresponding X-ray crystal structures47 and also similar to analogous values reported for simulations of other proteins.48-51 Relative to the protein backbone RMSD, the DNA backbone RMSD (not including the terminal bases) fluctuated around a higher mean of 1.8 to 2.0 Å, reflecting the greater mobility of the DNA compared to the protein. The last 150 ps of each trajectory were employed in analyses (see Methodology). Changes in Mobility. Figure 3a shows that the mutant 248Q main-chain B-factors (averaged for the N, CR, and C(dO) atoms) are generally higher than the corresponding wild-type values, indicating a relatively flexible structure. The mutant 248W main-chain B-factors are also generally higher than the corresponding wild-type values, except near the N-terminus and in the three loops connecting β-strands S3 and S4 (residues 147-155), S7 and S8 (residues 220-229), and S9 and S10 (residues 259-263). However, it is more rigid than the 248Q mutant, as its main-chain B-factors are generally lower than those of the 248Q mutant. When the protein is bound to DNA,
Figure 3. Plot of the B-factors averaged over the protein backbone atoms as a function of residue number in the simulations of (a) DNAfree and (b) DNA-bound p53-CDs. The solid, dotted, and dashed lines correspond to wild-type, 248Q, and 248W p53-CDs, respectively. The standard errors (not plotted) were 5.0, 6.4, and 3.7 Å2 for the DNAfree wild-type, 248Q, and 248W p53-CDs, respectively, and 4.6, 2.5, and 3.4 Å2 for the DNA-bound wild-type, 248Q, and 248W p53-CDs, respectively.
the two mutants and the wild-type protein exhibit similar trends in the main-chain B-factors, except that part of the L2 loop (residues 175-190) and the loop connecting β-strands S7 and S8 are disordered in the 248Q p53-CD‚DNA complex (Figure 3b). Figure 4 shows that in the 248Q mutant, the backbone RMSDs from the wild-type simulation structure for base-pairs 3-7, 9, and 10 are higher than the respective X-ray RMSDs, whereas those for base-pairs 11, 12, 14-17 are lower than the respective X-ray RMSDs. This indicates that the DNA major groove (base-pairs 6-10) in the 248Q p53-CD‚DNA complex seems to be more flexible than that in the wild-type X-ray complex, whereas the DNA minor groove (especially base pairs 11 and 12) is more rigid. In the 248W mutant, the DNA major groove as well as minor groove are overall more flexible than those in the wild-type complex, except for base pairs 8, 11, and 12. Changes in Component Free Energies Upon Mutation. The contributions to ∆∆G° (eq 1) from the protein-DNA interface residues were decomposed as follows: vdW elec nonel + ∑∆∆Gi,gas + ∑ ∆∆Gi,solv + ∑i ∆∆Gi ) ∑i ∆∆Gi,gas i i elec (5) ∑i ∆∆Gi,solv
Long-Range Effects of Mutating R248 to Q/W
J. Phys. Chem. B, Vol. 106, No. 50, 2002 13051
Figure 4. Plot of the RMSD of the DNA backbone atoms (averaged over the base pair) from the wild-type p53-CD‚DNA simulation structure as a function of base-pair number for the crystal structure (solid line), 248Q p53 (dotted line) and 248W p53 (dashed line).
TABLE 1: Changes in the DNA-binding Free Energy Components of the Mutant 248Q p53-CD Relative to Wild-typea interface residue i
∆∆Gi
elec ∆∆Gi,gas
elec ∆∆Gi,solv
vdw ∆∆Gi,gas
nonel ∆∆Gi,solv
L1: A119 L1: K120 L1: S121 Q136 L137 L3: N239 L3: S240 L3: S241 L3: C242 L3: M243 L3: N247 L3: 248Q L3: R249 S10: V272 S10: R273 S10: V274 C275 A276 C277 H2: R280 H2: D281 H2: R283 ∑ib
5.2 18.6 -11.7 -0.2 -0.1 2.0 0.7 -1.1 0.0 0.7 4.3 5.5 -0.7 0.0 -5.4 0.6 -0.1 -0.9 -0.2 -2.8 14.6 6.8 35.8
4.2 30.6 -6.9 1.9 0.6 9.6 2.9 -2.7 3.2 -0.0 5.2 143.4 -10.2 0.0 -5.0 0.2 0.3 1.0 0.9 -6.1 6.3 -0.1 179.2
-0.6 -14.5 -2.2 -2.1 -0.6 -7.4 -2.2 1.8 -3.2 0.4 -2.2 -139.6 9.5 0.0 0.1 0.5 -0.4 -2.6 -1.0 3.1 8.6 6.9 -147.7
1.4 2.2 -2.0 0.0 -0.0 -0.2 -0.0 -0.2 0.0 0.4 1.1 1.4 -0.0 0.0 -0.5 -0.2 -0.0 0.6 -0.0 0.3 -0.2 0.0 4.1
0.2 0.4 -0.6 0.0 -0.0 -0.0 0.0 -0.0 0.0 -0.1 0.2 0.3 0.0 0.0 0.0 0.0 0.0 0.1 -0.2 -0.1 -0.1 -0.0 0.2
a Values greater than 7 kcal/mol are highlighted in bold. b Free energy contributions from all interface residues.
where the summation is over all interface residues (defined as residues whose gas-phase interaction energy changes by >1 kcal/mol upon DNA binding). The relative free energies for 248Q (Table 1) and 248W (Table 2) p53-CDs are distinguished by ∆∆G and ∆∆′G, respectively. The subscript “gas” in eq 5 denotes protein-protein or protein-DNA interactions (collectively known as solute-solute interactions), whereas the subscript “solv” denotes protein-solvent interactions. In what follows, we first identify the interface residues whose |∆∆Gi| g (1/5)|∑i∆∆Gi|. We then correlate the changes in the respective residue-protein/DNA/water hydrogen bonding interactions with the changes in the corresponding interaction energies upon mutation. In interpreting the results below, we emphasize the trend (sign) of the large free energy changes
(given to g2 significant figures below) upon each mutation rather than their absolute values. R248 f Q. When R248 is mutated to Gln, the p53-CD mutant is predicted not to bind to DNA as ∆∆G° is large and positive (37 kcal/mol), stemming mostly from a net unfavorable free energy contribution of the interface residues (∑i∆∆Gi ) 36 kcal/ mol, Table 1). The loss of binding free energy upon mutating R248 f Q is due mainly to unfaVorable solute-solute elec ) 179 kcal/mol), which electrostatic effects (∑i∆∆Gi,gas outweigh favorable solute-solvent electrostatic changes elec (∑i∆∆Gi,solv ) -148 kcal/mol) by ∼31 kcal/mol. SolutevdW ) 4.1 kcal/mol) and nonelecsolute vdW energy (∑i∆∆Gi,gas nonel ) 0.2 kcal/mol) trostatic solvation free energy (∑i∆∆Gi,solv changes of the interface residues, however, do not make significant contributions to the observed loss of binding affinity. Table 1 shows that as expected, the mutant 248Q contributes less faVorably than the wild-type R248 to DNA binding (∆∆G248Q ) 5.5 kcal/mol); however, it is not the major contributor to the observed loss of binding affinity. Instead, the main contributors to the loss of DNA binding upon mutating R248 f Q in the minor groove are two charged residues in the major groove, K120 (∆∆GK120 ) 19 kcal/mol) and D281 (∆∆GD281 ) 15 kcal/mol). The R248 f Q mutation resulted in a loss of two “close” contacts (defined in Introduction) to the Thy 12′ and Thy 11′ phosphate oxygens and a water-bridged hydrogen bond to the Thy 12′ sugar oxygen (compare Figure 6 with Figure 5), hence ∆∆G248Q,gaselec is large and positive (143 kcal/mol, Table 1). However, it is nearly compensated by a elec similarly large but negative ∆∆G248Q,solv (-140 kcal/mol) because in the unbound state, the neutral glutamine is poorly solvated compared to the positively charged arginine (Gelec solv ) -6 kcal/mol for 248Q vs. -100 kcal/mol for R248, Table 3). Thus, the mutant 248Q incurred a smaller desolvation penalty upon binding DNA compared to the wild-type R248 (∆Gelec solv ) 22 kcal/mol for 248Q vs 162 kcal/mol for R248, Table 3). To compensate for the loss of charge-charge interactions with 248Q, the negatively charged Thy 11′ phosphate O2P oxygen formed two new “close” contacts with the nearby positively charged R273 guanidinium nitrogens (the average distance between Thy 11′ O2P to the two R273 amino nitrogens decreased from 4.92 ( 0.44 Å in the p53-CD‚DNA simulation to 3.67 ( 0.42 Å in the respective 248Q mutant simulation). Furthermore, the Thy 11′ O5′‚‚‚R273 NΗ2 distance in the wildtype complex simulation (3.95 ( 0.40 Å) decreased to 3.27 ( 0.24 Å in the mutant complex simulation, but the Thy 11′ O5′‚‚‚R273 HΝΗ2 distance (2.70 ( 0.47 Å) indicates that this is not a hydrogen bond. The favorable R273-Thy 11′ chargecharge interactions in the mutant complex likely result in a negative ∆∆Gelec R273,gas (-5.0 kcal/mol, Table 1), which dominates ∆∆GR273 (-5.4 kcal/mol). Thus, R273 contributes favorably to DNA binding upon substituting R248 for Gln, effectively canceling the unfavorable contribution from the mutant residue (∆∆G248Q ) 5.5 kcal/mol). The proximity of the R273 side chain to the Thy 11′ phosphate in turn drew the D281 side chain, to which it is hydrogen bonded, to the Thy 11′ phosphate oxygen in the mutant structure (D281 Oδ2‚‚‚Thy 11′ O1P decreased from 5.79 ( 0.52 Å in the wild-type complex simulation to 3.85 ( 0.33 Å in the mutant complex simulation). The proximity of these two elec negatively charged groups resulted in a positive ∆∆GD281,gas + (6.3 kcal/mol, Table 1). Although the R273 -D281 -R280+ hydrogen-bonded network observed in the free and DNA-bound wild-type p53-CD simulations is preserved in the 248Q p53CD‚DNA simulation, D281 interacted with water molecules
13052 J. Phys. Chem. B, Vol. 106, No. 50, 2002
Figure 5. Hydrogen-bonding network in the average wild-type p53CD‚DNA structure in the vicinity of the major and minor grooves. Hydrogen bonds, as defined in Analyses, are depicted by dotted lines, whereas “close” contacts, as defined in Introduction, are depicted by dashed lines. Only certain key residues (discussed in text) and their hydrogen bonding partners are shown for the sake of clarity. The loopsheet-helix motif is in magenta while the L3 loop is in yellow. Oxygen, phosphate, hydrogen, and nitrogen atoms are colored red, orange, green, and blue, respectively.
rather than the R280 side chain but maintained its salt bridge with R273 in the free 248Q mutant simulation. Hence, the electrostatic solvation of the D281 side chain in the free mutant (Gelec solv ) 75 kcal/mol, Table 3) is not as unfavorable as that in the wild-type protein (Gelec solv ) 82 kcal/mol, Table 3), so elec is positive (9 kcal/mol, Table 1). Consequently, ∆∆GD281,solv both D281-DNA phosphate and D281-water interactions contribute unfavorably to DNA binding upon mutating R248 f Q in the wild-type p53-CD. The proximity of the D281 side chain to the Thy 11′ phosphate in turn attracted the R280 side chain to hydrogen bond to a Gua 10′ phosphate oxygen (O1P) in the mutant complex simulation, instead of a Gua 10′ base oxygen (O6) seen in the wild-type complex simulation (R280 NH2‚‚‚Gua 10′ O1P (3.80 ( 0.74 Å) decreased to 2.81 ( 0.16 Å in the 248Q mutant complex, whereas R280 NH1‚‚‚Gua 10′ O6 (3.53 ( 0.50 Å) increased to 4.49 ( 0.51 Å; compare Figure 6 with Figure 5). The gain in the R280-phosphate salt bridge outweighed the loss of the R280-base hydrogen bond in the mutant complex, resulting in a negative ∆∆Gelec R280,gas (-6.1 kcal/mol, Table 1) elec contribution that almost canceled the unfavorable ∆∆GD281,gas (6.3 kcal/mol). The observed changes in the R280-Gua 10′ interactions also caused R280 in the 248Q mutant to be less well solvated, especially when bound to the DNA. Consequently, upon binding DNA the R280 desolvation penalty in the 248Q mutant is slightly larger than that in the wild-type protein
Noskov et al.
Figure 6. Hydrogen-bonding network in the average 248Q p53CD‚DNA structure in the vicinity of the major and minor grooves. S121 Oγ hydrogen bonds to Thy 6 O4′ (atom hidden from view) and Thy 6 O5′. See also legend to Figure 5.
(∆Gelec R280,solv ) 174 vs 171 kcal/mol, Table 3). The positive ∆∆Gelec R280,solv (3.1 kcal/mol, Table 1) partially offsets the negative ∆∆Gelec R280,gas, hence R280 makes a small but favorable contribution to DNA binding upon mutating R248 f Q in the wild-type protein (∆∆GR280 ) -2.8 kcal/mol, Table 1). The observed changes in the network of interactions among Thy 11′, R273, D281, R280, and Gua 10′ were accompanied by a distortion of the major groove conformation. This is suggested by the different behavior of the RMSDs of the major groove base pairs (6-10) in the 248Q mutant complex simulation and wild-type X-ray structure (compare dotted and solid lines in Figure 4). In particular, the RMSDs of base pairs 7-9 showed a V shape in the 248Q mutant complex simulation but an inverted V shape in the wild-type X-ray structure. This distortion in the Gua 7-9 conformation correlates with the loss of native hydrogen bonds from Gua 7-9 to K120: the Gua 7 O1P‚‚‚K120 N (3.21 ( 0.30 Å), Gua 8 N7‚‚‚K120 Nζ (3.01 ( 0.21 Å), and Gua 9 O6‚‚‚K120 Nζ (2.90 ( 0.21 Å) distances lengthen to 5.81 ( 0.42 Å, 3.81 ( 0.42 Å, and 5.34 ( 0.42 Å, respectively, in the mutant complex simulation. In addition, the close contact between Thy 6 O3′ and K120 N (3.74 ( 0.39 Å) is lost (Thy 6 O3′‚‚‚K120 N ) 7.46 ( 0.41 Å). Hence elec ∆∆GK120,gas is large and positive (31 kcal/mol, Table 1). The observed loss in K120-DNA electrostatic interactions is partially compensated by a gain in K120-solvent electrostatic interactions in the 248Q mutant complex relative to the wild-type complex; hence, K120 incurred a smaller desolvation penalty upon binding
Long-Range Effects of Mutating R248 to Q/W
J. Phys. Chem. B, Vol. 106, No. 50, 2002 13053
TABLE 2: Changes in the DNA-Binding Free Energy Components of the Mutant 248W p53-CD Relative to Wild-type interface residue i
∆∆′Gi
elec ∆∆′Gi,gas
elec ∆∆′Gi,solv
vdw ∆∆′Gi,gas
nonel ∆∆′Gi,solv
L1: A119 L1: K120 L1: S121 Q136 L137 L3: N239 L3: S240 L3: S241 L3: C242 L3: M243 L3: N247 L3: 248W L3: R249 S10: V272 S10: R273 S10: V274 C275 A276 C277 H2: R280 H2: D281 H2: R283 ∑ib
4.7 2.2 -0.3 -0.2 0.1 0.0 0.6 3.4 -1.1 2.9 4.4 9.3 1.9 0.0 -4.3 -0.4 -0.8 0.9 0.5 -9.6 12.0 4.7 30.9
2.1 -3.5 0.3 0.8 -0.6 1.4 -0.6 3.4 -7.9 2.0 3.9 155.6 5.5 0.1 -2.5 -1.4 -1.4 1.8 1.3 -11.1 11.3 -1.4 159.2
1.8 4.2 -0.2 -1.1 0.7 -1.4 1.2 -0.5 6.6 -1.4 -1.1 -147.7 -3.8 -0.1 -1.6 0.9 0.7 -1.0 -0.8 0.8 1.0 6.0 -136.7
0.8 1.5 -0.3 0.0 0.0 0.0 0.1 0.5 0.1 2.4 1.4 1.0 0.0 0.0 -0.2 0.0 -0.0 0.1 -0.1 0.7 -0.4 0.1 7.8
0.1 0.1 -0.1 0.0 0.0 0.0 -0.0 0.0 0.1 0.0 0.2 0.3 0.1 0.0 0.0 0.1 0.0 -0.0 0.0 -0.0 -0.0 -0.1 0.7
a Values greater than 6 kcal/mol are highlighted in bold. b Free energy contributions from all interface residues.
DNA by the 248Q mutant compared to the wild-type protein elec ) 177 vs 191 kcal/mol, Table 3). (∆GK120,solv The loss of the K120 N‚‚‚Thy 6 “close” contact (see above) is compensated by the introduction of two hydrogen bonds from the serine side chain to Thy 6 in the mutant complex (the S121 Oγ to Thy 6 O4′ and Thy 6 O5′ distances decreased from 9.44 ( 0.48 and 7.42 ( 0.51 Å to 3.16 ( 0.31 and 3.04 ( 0.19 Å, elec respectively, Figure 6); hence, ∆∆GS121,gas is negative (-6.9 kcal/mol). Like R273 and R280, S121 contributes more favorably to DNA binding in the 248Q mutant than in the wild-type protein (∆∆GS121 ) -12 kcal/mol, Table 1). However, the formation of two hydrogen bonds from S121 to a single base (Thy 6) in the DNA major groove is not sufficient to compensate for the loss of hydrogen bonds from K120 to three bases (Gua 7, Gua 8, and Gua 9). The observed changes in the hydrogen bonding interactions and interaction energies indicate that mutating R248 f Q affects not only the interactions of the mutant side chain with DNA but also those of distant major groove-contact residues from the L1 loop (K120) and H2 helix (R280 and D281) as well as the phosphate-contact residue (R273). Thus, replacing R248 with Gln would significantly perturb binding of both DNA minorand major-grooves. R248 f W. As for the 248Q mutant, the 248W p53-CD is predicted not to bind to DNA as ∆∆G° ) 30 kcal/mol, similar to the sum of the free energy contributions from the interface residues (∑i∆∆′Gi ) 31 kcal/mol, Table 2). The loss of binding free energy upon mutating R248 f W is due mainly to unfavorable solute-solute electrostatic interactions elec (∑i∆∆′Gi,gas ) 159 kcal/mol) that are partially compensated elec by favorable solute-solvent electrostatic changes (∑i∆∆′Gi,solv ) -137 kcal/mol). It is also due, to a much lesser extent, to unfavorable protein-protein/DNA vdW interactions vdW (∑i∆∆′Gi,gas ) 7.8 kcal/mol). As for the 248Q mutant, the nonelectrostatic solvation free energy changes of the interface residues do not seem to contribute to the loss in binding affinity nonel (∑i∆∆′Gi,solv ) 0.7 kcal/mol).
Figure 7. Hydrogen-bonding network in the average 248W p53CD‚DNA structure in the vicinity of the major and minor grooves. See also legend to Figure 5.
Table 2 shows that the key residues contributing to the loss of DNA binding upon mutating R248 f W are the mutant residue itself and D281 (∆∆′Gi > 6 kcal/mol for i ) 248W and D281, Table 2). Like the mutation of R248 f Q, the R248 f W substitution resulted in a loss of native proteinphosphate interactions at the mutation site (compare Figure 7 with Figure 5), hence ∆∆′Gelec 248W,gas is positive (156 kcal/mol, Table 2). The loss of 248W-DNA electrostatic interactions is partially compensated by a much smaller desolvation penalty upon binding DNA for the neutral 248W mutant residue (∆Gelec solv ) 14 kcal/mol, Table 3) compared to the positively charged R248 wild-type residue, resulting in a negative ∆∆′Gelec 248W,solv (-148 kcal/mol, Table 2). As observed in the 248Q mutant complex, to compensate for the loss of 248W-DNA electrostatic interactions Thy 11′ O2P formed two new “close” contacts with the nearby R273 amino nitrogens in the 248W p53-CD‚DNA simulation. Hence ∆∆′Gelec R273,gas is negative (-2.5 kcal/mol, Table 2), but its magnitude is half of that found for the 248Q mutant in Table 1, probably because of the reduced charge-charge interactions between Thy 11′ O5′ and R273 as the Thy 11′ O5′‚‚‚R273 NΗ2 distance (3.67 ( 0.40 Å) is longer than that in the 248Q complex simulation (3.27 ( 0.24 Å). The proximity of the R273 guanidinium side chain to Thy 11′ drew its hydrogen bonding partner, the D281 side chain, to within “close” contact of the Thy 11′ phosphate group in the 248W mutant complex (D281 Oδ2‚‚‚Thy 11′ O1P ) 3.91 ( 0.27 Å compared to 5.79 ( 0.52 Å in the wild-type complex). In addition, the D281 backbone amide proton and carbonyl oxygen lost two hydrogen bonds seen in the wild-type and 248Q complex
13054 J. Phys. Chem. B, Vol. 106, No. 50, 2002
Noskov et al.
TABLE 3: Electrostatic Contributions to DNA-Binding Free Energies of Wild-type and Mutant p53-CDs elec ∆Gi,gas
elec a ∆Gi,solv
elec ∆Gi,solv
interface residue i
WT
248Qb
248Wc
WT
248Qb
248Wc
WT
K120
-216.3
-186.1
-219.7
-100.2 90.7 -1.3 0.2 -2.8 5.3 -100.1 61.4 -79.1 66.8 -0.4 7.7 -100.9 70.2 81.7 -74.5
-95.7 80.8 -0.15 0.55 -1.6 -0.9 -5.9 15.9 -81.5 73.9 0.5 7.6 -97.2 77.0 74.8 -72.8
-98.7 96.4 -1.0 0.3 -1.9 4.8 -0.1 13.7 -72.8 69.3 0.1 7.4 -95.1 76.8 72.5 -82.7
a
S121
0.16
-6.7
0.4
N239
-10.48
-0.9
-9.1
248
-152.3
-10.7
3.3
R249
-128.0
-138.1
-122.4
C277
1.5
2.5
2.8
R280
-172.9
-178.9
-184.0
D281
129.9
136.1
141.2
248Qb
248Wc
191.0
176.5
195.1
1.5
-0.7
1.3
8.1
0.7
6.7
161.5
21.8
13.8
145.9
155.4
142.1
8.1
7.1
7.3
171.1
174.2
171.9
-156.2
-147.6
-155.2
Numbers in top row are for the free state and those in the row below are for the DNA-bound state. b R248 f Q p53-CD. c R248 f W p53-CD.
simulation to the C277 carbonyl oxygen and the E285 amide proton, respectively. The additional loss of these backbonebackbone hydrogen bonds probably resulted in a more positive elec elec (11 kcal/mol, Table 2) than the ∆∆GD281,gas for ∆∆′GD281,gas the 248Q mutant (6.3 kcal/mol, Table 1). As found in the 248Q mutant complex, the R280 amino nitrogen that is hydrogen bonded to a D281 carboxylate oxygen is also hydrogen bonded to a Gua 10′ phosphate oxygen (R280 NH2‚‚‚Gua 10′ O1P decreased from 3.80 ( 0.74 Å in the wildtype complex to 2.82 ( 0.18 Å), at the expense of the other R280 amino nitrogen losing a hydrogen bond with a Gua 10′ base oxygen (R280 NH1‚‚‚Gua 10′ O6 increased from 3.53 ( 0.50 to 4.22 ( 0.58 Å, compare Figure 7 with Figure 5). The substitution of an R280-base charge-dipole interaction with an R280-phosphate charge-charge salt-bridge resulted in a negative ∆∆′Gelec R280,gas (-11 kcal/mol, Table 2) that almost canceled elec (11 kcal/mol), as found in the 248Q the positive ∆∆′GD281,gas mutant (see above and Table 1). In analogy to the 248Q mutant complex, the interaction changes in the Thy 11′‚‚‚R273‚‚‚D281‚‚‚R280‚‚‚Gua 10′ network correlated with a distortion in the major groove conformation, as evidenced by the different behavior of the RMSDs as a function of major groove base pairs (6-10) in the 248W mutant complex simulation and wild-type X-ray structure (compare dashed and solid lines in Figure 4). In contrast to the 248Q mutant complex, however, the Gua 9 O6 and Gua 8 N7 are e3.0 Å to the K120 side chain nitrogen, but their distances to the K120 ammonium protons (>2.5 Å) indicate an absence of hydrogen bonds. The Gua 7 O1P‚‚‚K120 N hydrogen bond and the Thy 6 O3′‚‚‚K120 N “close” contact found in the wildtype complex simulation are also lost in the 248W mutant complex simulation (compare Figure 7 with Figure 5). Although K120 contributes unfavorably to DNA binding upon mutating R248 f Trp in the wild-type protein, its relatively small ∆∆′GK120 (2.2 kcal/mol) suggests that it does not play a major role in the observed DNA affinity loss as compared to its role in the 248Q mutant (∆∆GK120 ) 19 kcal/mol, Table 1). The above results indicate that like the R248 f Q mutation, the R248 f W mutation affects not only the interactions of the mutant side chain with DNA but also those of distant major groove-contact residues. However, mutating R248 to Trp seems to have fewer long-range, extensive effects than mutating it to Gln (compare Figure 7 and Figure 6).
Discussion Comparison with Available Experimental Data. Mutant Structures. The two mutant structures obtained from the simulations show an overall wild-type conformation despite structural changes at the protein-DNA interface. The predicted wild-type conformation is in accord with the findings that the 248Q mutant is 86% folded at 37 °C21 and the R248 f W mutant yielded a stable wild-type-like core domain upon proteolytic digestion.19 Although 3d structures of the 248Q and 248W p53-CD mutants in either the DNA-bound or free state are unavailable, we can employ antibody binding data to verify the average mutant structures generated by the simulations. The wild-type p53 and the two mutants are recognized by the monoclonal antibody PAb1620 but not by the monoclonal antibody PAb240.18,52 PAb1620 is specific for p53 native-like conformations and recognizes, in particular, R209 and N210.52 These two residues are solvent exposed in the X-ray structure as well as in the free simulations reported here with SASA values greater than 44% (Table 4), indicating that they would be accessible to PAb1620. PAb240, however, recognizes a denatured conformation of the p53-CD that exposes its epitope, residues 212-217, located in the S7 β-strand.53 In the wild-type p53 X-ray structure, this region is folded against the hydrophobic core of the β-sandwich and is not solvent exposed (average SASA of residues 212-217 is 14%). The latter is comparable to the average SASA values for these residues in the wild-type (20%), 248Q (17%), and 248W (22%) p53-CD simulations, indicating that they are relatively buried and would therefore be inaccessible to PAb240. Thus, in the mutant simulation structures R209 and N210 can bind to PAb1620, whereas residues 212-217 cannot bind to PAb240, in accord with the experimental antibody binding results. NMR and equilibrium studies21,54 of the free 248Q p53-CD showed that 248Q is an “intermediate mutant-type”55 with an altered interface structure but an overall wild-type conformation. Chemical-shift changes of the 248Q p53-CD relative to the wildtype molecule were seen not only for residues in the vicinity of the mutation site but also for residues far from the mutation site, in helix H2 (containing R280 and D281), loop L1 (containing K120), loop L2, strand 2, and strand 2′.54 This is consistent with the change in the hydrogen-bonding network involving these residues in the simulation of the DNA-free 248Q p53-CD relative to the wild-type protein. Upon mutating
Long-Range Effects of Mutating R248 to Q/W
J. Phys. Chem. B, Vol. 106, No. 50, 2002 13055
TABLE 4: % SASA in Wild-type and Mutant p53-CD, Free and Bound to DNAa interface residue i
WTb
248Qc
248Wd
WT + DNAb
248Qc + DNA
248Wd + DNA
R156 L206 R209 N210 212-217 K120 S121 V122 248 R249 R273 C277 R280 D281 R283 T284 E285
(21.0) 33.0 (25.6) 18.2 (60.8) 73.1 (46.1) 50.5 (14.3) 20.4 (55.3) 49.5 (49.7) 57.7 (11.6) 19.1 (52.6) 67.0 (13.3) 21.1 (23.1) 22.0 (21.8) 32.2 (41.7) 38.2 (3.5) 11.4 (39.6) 37.5 (22.8) 31.6 (19.1) 34.2
26.0 27.3 64.7 44.1 16.6 38.3 40.8 8.3 48.6 21.6 28.1 33.1 47.1 19.3 38.0 42.8 35.3
21.3 22.7 65.0 50.5 21.6 48.2 34.3 18.6 56.4 23.8 22.7 33.5 41.7 7.5 39.1 34.6 24.7
(18.4) 25.2 (21.3) 49.7 (61.3) 65.8 (52.3) 55.9 (12.1) 18.8 (5.9) 13.8 (20.3) 23.3 (13.1) 12.6 (21.9) 42.4 (13.5) 22.0 (13.2) 18.6 (3.1) 6.6 (10.9) 19.1 (8.9) 8.3 (31.4) 40.3 (27.3) 42.1 (21.9) 31.3
27.8 23.3 68.3 51.2 19.1 28.6 45.6 10.1 45.0 22.4 13.3 23.3 18.5 9.6 43.0 44.4 25.7
22.7 22.9 66.2 51.4 19.2 22.3 38.6 11.7 44.4 23.3 14.8 6.3 20.4 7.8 42.8 48.0 35.3
a Percentage ratio of the water-accessible surface area of the side chain X in the protein to the accessible surface area of X in the tripeptide -Gly-X-Gly- where 50% indicates solvent exposed.61 b Numbers with and without brackets are based on the X-ray and average MD structures, respectively. c 248Q p53-CD. d 248W p53-CD.
R248 f Q in the free p53-CD, R249, located next to the mutation site, lost a backbone-backbone hydrogen bond with M246; D281 in helix H2 lost its salt-bridge with R280 and backbone-backbone hydrogen bonds with C277 O and E285 N; G117 in loop L1 lost its hydrogen bond with the T125 side chain. New hydrogen bonds linking helix H2 with β-strand S2 (via the R282 and T125 side chains) and with the L1 loop (via the G279 and K120 backbones) were formed during the 248Q p53-CD simulation. DNA Binding Affinity. Sequence-specific DNA binding is essential for the p53 tumor suppression function.3,56,57 The DNAbinding site for p53 contains a repeat of two consensus pentamer sequences: 5′-Pu-Pu-Pu-C-(A/T) (T/A)-G-Py-Py-Py-3′ (where Pu represents a purine (Gua/Ade), Py denotes a pyrimidine (Cyt/ Thy), and invariant bases are in bold).58 For the p53-CD and the DNA sequence studied here, DNA-binding specificity stems from the K120, C277, and R280 side chains hydrogen bonding to the Gua 8, Gua 9, Cyt 9′, and Gua 10′ base oxygen or nitrogen in the major groove, although K120 and C277 could also hydrogen bond to Ade 8 and Thy 9′, respectively. Nonspecific binding affinity is due to K120, S241, R273, A276, and R283 forming salt-bridges with the DNA phosphate. The binding affinity of p53 to a DNA consensus site depends on the precise p53 and DNA sequences as well as on the temperature. For instance, p53 with the last 30 amino acids removed or masked by an antibody such as PAb421 has a much higher DNA binding affinity than full length p53,56,59 and mutant p53-CDs often have higher DNA-binding affinity at 25 °C than at 37 °C.3,55,56 Consequently, the predicted change in DNA binding affinity upon R248 f Q or R248 f W substitution was compared with experimental data corresponding to the p53 core-domain (if available) binding to a site similar to the consensus site studied here, 7GGGCA11 12AGTCT16 at around 25°C. The 248Q p53-CD mutant failed to bind to the gadd45 oligonucelotide (GAACA TGTCT, where bases that differ from the consensus DNA sequence studied here are underlined) at 20 °C.21 Although data is not available for the binding affinity of the 248W p53-CD, the full-length mutant, in the absence or presence of monoclonal antibody PAb421, which binds to the p53 C-terminus, showed no detectable binding to the highaffinity DNA site, GGGCA TGTCC, at 25°C.55,56,60 Furthermore, a peptide derived from the C-terminus of full-length p53 that included the binding site for monoclonal antibody PAb421
restored sequence-specific DNA binding to various degrees for a subset of p53 mutants, but not for the 248W mutant.57 The nondetectable binding of the 248Q p53-CD or 248W full-length p53 mutant is consistent with the large positive free energy change computed for the 248Q and 248W mutants (g30 kcal/ mol). Factors Affecting Loss of DNA Binding upon Mutating of R248 f Q in p53. In previous work it was believed that the DNA binding loss of 248 mutants was due to the loss of critical minor groove DNA contacts from the positively charged R248 side chain.15 The results here show that indeed the 248Q mutant has lost two native close contacts with the DNA phosphate groups in the minor groove, but this is not the only effect the mutation has on DNA binding. The R248 f Q substitution also leads to significant structural changes in regions important for DNA binding distant from the mutation site, in accord with chemical shift observations (see above).54 In particular, one of the structural changes involves a loss of critical major groove DNA interactions from K120, which is predicted here to contribute more to the observed loss of binding affinity than the mutant 248Q residue (∆∆GK120 ) 19 kcal/mol vs ∆∆G248Q ) 6 kcal/mol, Table 1). The net DNA-binding free energy contribution from the three loop L1 interface residues (A119, K120 and S121) in the 248Q p53-CD is more unfavorable than that in the wild-type protein (∆∆GA119 + ∆∆GK120 + ∆∆GS121 ) 12 kcal/mol, Table 1). The motion of 248Q appears to be anticorrelated with that of the nearby R273: as 248Q moved away from the DNA phosphate, R273 moved closer to the Thy 11′ phosphate to compensate for the loss of 248Q-DNA phosphate contacts. Because R273 is salt-bridged to D281, which in turn is linked to R280, the proximity of R273 to the Thy 11′ phosphate drew the negatively charged D281 and the positively charged R280 side chains closer to the negatively charged DNA phosphate, resulting in a positive ∆∆′GD281 and a negative ∆∆′GR280, respectively. The net DNA-binding free energy contribution from the three hydrogen-bonded interface residues (R273, D281, and R280) in the 248Q p53 mutant is more unfavorable than that in the wild-type protein (∆∆GR273 + ∆∆GD281 + ∆∆GR280 ) 6 kcal/mol, Table 1). Thus, the loss of DNA binding affinity upon mutating R248 f Q in the wild-type p53-CD is due mainly to the loss of major groove contacts from K120 in loop L1 as well as to
13056 J. Phys. Chem. B, Vol. 106, No. 50, 2002 unfavorable interactions of D281 in helix H2. It is also due, to a lesser extent, to the loss of minor groove contacts from the mutant residue itself. Factors Affecting Loss of DNA Binding upon Mutating of R248 f W in p53. As for the R248 f Q substitution, the mutation site has lost contact with the DNA minor groove, resulting in an unfavorable DNA-binding free energy contribution (∆∆′G248W ) 9 kcal/mol, Table 2). The net binding free energy contribution from the three loop L1 interface residues in the 248W p53 mutant is unfavorable compared to that in the wild-type protein (∆∆′GA119 + ∆∆′GK120 + ∆∆′GS121 ) 6.6 kcal/mol), but favorable relative to that in the 248Q mutant (by 5.5 kcal/mol, see above). This is probably because, unlike the 248Q mutant, the K120 side chain nitrogen remains within hydrogen bonding distance of the Gua 8 and Gua 9 base oxygens, even though it does not form hydrogen bonds seen in the wild-type complex. Again in common with the R248 f Q mutation, to compensate for the loss of 248W-DNA phosphate contacts, R273 moved closer to the Thy 11′ phosphate, dragging with it the salt-bridged D281, which in turn drew its hydrogen-bonding partner, R280, to interact with the Gua 10′ phosphate instead of the base oxygen. Consequently, the positively charged R273 and R280 contribute favorably, but the negatively charged D281 contributes unfavorably to binding the negatively charged DNA upon mutating R248 f W. The net binding free energy contribution from these three hydrogen-bonded residues, R273, D281, and R280, in the 248W p53 mutant is comparable to that in wildtype p53 (∆∆′GR273 + ∆∆′GD281 + ∆∆′GR280 ) -1.9 kcal/ mol) and more favorable than that in the 248Q mutant (by 8.3 kcal/mol, see above). Thus, the loss of DNA binding affinity upon mutating R248 f W in the wild-type p53-CD is due mainly to the loss of minor groove contacts from the mutant residue itself. It is also due, to a lesser extent, to the loss of major groove contacts from loop L1 as well as to the poorer packing of the proteinDNA interface in the 248W mutant complex relative to the wildtype one (∆∆′GvdW i,gas ) 7.8 kcal/mol). Implications for Structure-Based Linker Rescue. The mutant structures generated in this study can be used with molecular docking methods to screen databases of small molecule compounds to identify potential ligands that may restore the binding affinity of mutant p53-CD toward DNA. The energetic and structural analyses suggest plausible sites that are not necessary at the mutation site to dock such ligands. They show that the major groove binding residue, K120, is a key contributor to the loss of DNA binding upon mutating R248 f Q in the wild-type p53-CD. Furthermore, experimental studies on the rescue of p53 mutants by second-site mutants found that the additional mutation of T284 f R in the DNA major groove enhanced binding of the 248Q full-length p53 mutant, with the last 30 residues deleted, to the high-affinity DNA site, GGGCA TGTCC, and rescued the tumor suppressor functions.16 This together with the present results suggest that it may be possible to rescue the DNA binding affinity of the 248Q mutant using an appropriate small ligand in the vicinity of the 284 site, instead of the mutation site at position 248 to provide a bridge to the DNA. This would bring the DNA closer to the protein and force the removal of competing water molecules from the proteinDNA interface, which may allow K120 to reestablish its native major groove contacts. Such a rescue mechanism was found in our previous work, in which mutating T284 f R in the mutant 273H p53-CD did not play a role by acting simply in place of R280 to bind to the DNA major groove. Instead, it helped to
Noskov et al. restore a “native-like” conformation of the protein-DNA interface, thus enabling R280 and K120 to regain their major groove contacts.22 However, for the 248W p53-CD, the mutant residue itself is the major contributor to the loss of DNA binding upon mutating R248 f W in the wild-type protein. In this case it may be possible to rescue the DNA binding affinity of the 248W mutant using an appropriate small ligand in the vicinity of the mutation site itself to provide a bridge to the Thy 11′ phosphate. Apart from the hope of regaining favorable interactions with the Thy 11′ phosphate, this may inhibit Thy 11′ phosphate from contacting the nearby R273, which in turn would help to maintain the native network of interactions among Thy 11′, R273, D281, R280, and Gua 10′, and thus a “native-like” conformation of the protein-DNA interface. We are carrying out calculations to dock ligands at the proposed sites of the 248Q and 248W p53 mutants. Acknowledgment. We thank Drs. C. S. Babu, and Y.-S. Lin for helpful discussions. We are grateful to Professor M. Karplus for the CHARMM program. S.N. is supported by a postdoctoral fellowship from Academia Sinica. This work is supported by grant # 91-2311-B-001-063 from the National Science Council, the Institute of Biomedical Sciences at Academia Sinica, and the National Center for High Performance Computing in the Republic of China. References and Notes (1) Harper, J. W.; Adami, G. R.; Wei, N.; Keyomarsi, K.; Elledge, S. J. Cell 1993, 75, 805-816. (2) El-Deiry, W. S.; Tokino, T.; Velculescu, V. E.; Levy, D. B.; Parsons, R.; Trent, J. M.; Lin, D.; Mercer, W. E.; Kinzler, K. W.; Vogelstein, B. Cell 1993, 75, 817-825. (3) Pietenpol, J. A.; Tokino, T.; Thiagalingam, S.; El-Deiry, W. S.; Kinzler, K. W.; Vogelstein, B. Proc. Natl. Acad. Sci. U.S.A. 1994, 91, 19982002. (4) May, P.; May, E. Oncogene 1999, 18, 7621-7636. (5) Wu, W.; Levine, A. J. Proc. Natl. Acad. Sci. U.S.A. 1994, 91, 36023606. (6) Ko, L. J.; Prives, C. Genes DeV. 1996, 10, 1054-1072. (7) Levine, A. Cell 1997, 88, 323-331. (8) Hollstein, M.; Rice, K.; Soussi, T.; Fuchs, R.; Sorlie, T.; Hovig, E.; Smith-Sorensen, B.; Montesano, R.; Harris, C. C. Nucl. Acids Res. 1994, 22, 3551-3555. (9) Bullock, A. N.; Henckel, J.; DeDecker, B. S.; Johnson, C. M.; Nikolova, P. V.; Proctor, M. R.; Lane, D. P.; Fersht, A. R. Proc. Natl. Acad. Sci. U.S.A. 1997, 94, 14338-14342. (10) Fisher, D. E. Cell 1994, 78, 539-542. (11) Morgan, S. E.; Kim, R.; Wang, P. C.; Bhat, U. G.; Kusumoto, H.; Lu, T.; Beck, W. T. Oncogene 2000, 19, 5010-5019. (12) Wang, Y.; Prives, C. Nature 1995, 376, 88-91. (13) Hussain, S. P.; Harris, C. C. Cancer Research 1998, 58, 40234037. (14) Hollstein, M.; Sidransky, D.; Vogelstein, B.; Harris, C. C. Science 1991, 253, 49-53. (15) Cho, Y.; Gorina, S.; Jeffrey, P. D.; Pavletich, N. P. Science 1994, 265, 346-355. (16) Wieczorek, A. M.; Waterman, J. L.; Waterman, M. J.; Halazonetis, T. D. Nat. Med. 1996, 2, 1143-1146. (17) Selivanova, G.; Kawasaki, T.; Ryabchenko, L.; Wiman, K. G. Seminars Cancer Biol. 1998, 8, 369-378. (18) Ory, K.; Legros, Y.; Auguin, C.; Soussi, T. EMBO J. 1994, 13, 3496-3504. (19) Bargonetti, J.; Manfredi, J.; Chen, X.; Marshak, D. R.; Prives, C. Genes DeV. 1993, 7, 2565-2574. (20) Park, J. D.; Nakamura, H.; Chumakov, A. M.; Said, J. W.; Miller, C. W.; Chen, D. L.; Koeffler, H. P. Oncogene 1994, 9, 1899-1906. (21) Bullock, A. N.; Henckel, J.; Fersht, A. R. Oncogene 2000, 19, 1245-1256. (22) Wright, J. D.; Noskov, S. Y.; Lim, C. Nucl. Acids Res. 2002, 30, 1563-1574. (23) Brooks, B. R.; Bruccoleri, R. E.; Olafson, B. D.; States, D. J.; Swaminathan, S.; Karplus, M. J. Comput. Chem. 1983, 4, 187-217. (24) Dunbrack, R. L. J.; Karplus, M. Nat. Struct. Biol. 1994, 1, 334340.
Long-Range Effects of Mutating R248 to Q/W (25) Bower, M. J.; Cohen, F. E.; Dunbrack, R. L. J. Mol. Biol. 1997, 267, 1268-1282. (26) Wright, J. D.; Lim, C. Prot. Eng. 2001, 14, 479-486. (27) Cornell, W. D.; Cieplak, P.; Bayly, C. I.; Gould, I. R.; Merz, K. M.; Ferguson, D. M.; Spellmeyer, D. C.; Fox, T.; Caldwell, J. W.; Kollman, P. A. J. Am. Chem. Soc. 1995, 117, 5179-5197. (28) MacKerell, J. A.; Wiorkiewicz-Kuczera, J.; Karplus, M. J. Am. Chem. Soc. 1995, 117, 11946-11975. (29) Jorgensen, W. L.; Chandrasekhar, J.; Madura, J. D.; Impey, R. W.; Klein, M. L. J. Chem. Phys. 1983, 79, 926-923. (30) Berkowitz, M.; McCammon, J. A. Chem. Phys. Lett. 1982, 90, 215217. (31) Eriksson, M. A. L.; Hard, T.; Nilsson, L. Biophys. J. 1995, 68, 402-426. (32) Lee, B.; Richards, F. M. J. Mol. Biol 1971, 55, 379-400. (33) Pascual-Ahuir, J. L.; Silla, E. J. Comput. Chem. 1990, 11, 10471060. (34) Koradi, R.; Billeter, M.; Wuthrich, K. J. Mol. Graphics 1996, 14, 51-55. (35) Nifosi, R.; Reyes, C. M.; Kollman, P. A. Nucl. Acids Res. 2000, 28, 4944-4955. (36) Wang, W.; Lim, W. A.; Jakalian, A.; Wang, J.; Wang, J.; Luo, R.; Bayly, C. I.; Kollman, P. A. J. Am. Chem. Soc. 2001, 123, 3986-3994. (37) Boresch, S.; Archontis, G.; Karplus, M. Proteins: Struct. Funct. Genet. 1994, 20, 25-33. (38) Boresch, S.; Karplus, M. J. Mol. Biol. 1995, 254, 801-807. (39) Brady, G. P.; Sharp, K. A. J. Mol. Biol. 1995, 254, 77-85. (40) Brady, G. P.; Szabo, A.; Sharp, K. A. J. Mol. Biol. 1996, 263, 123-125. (41) Gilson, M. K.; Given, J. A.; Bush, B.; McCammon, J. A. Biophys. J. 1997, 72, 1047-1069. (42) Noskov, S.; Lim, C. Biophys. J. 2001, 81, 737-750. (43) MacKerell, J. A. D.; Bashford, D.; Bellott, M.; Dunbrack, R.; Evanseck, J. D.; Field, M. J.; Fischer, S.; Gao, J.; Guo, H.; Ha, S.; JosephMcCarthy, D.; Kuchnir, L.; Kuczera, K.; Lau, F. T. K.; Mattos, C.;
J. Phys. Chem. B, Vol. 106, No. 50, 2002 13057 Michnick, S.; Ngo, T.; Nguyen, D. T.; Prodhom, B.; Reiher, W. E. I.; Roux, B.; Schlenkrich, M.; Smith, J. C.; Stote, R.; Straub, J.; Watanabe, M.; Wiorkiewicz-Kuczera, J.; Yin, D.; Karplus, M. J. Phys. Chem. B 1998, 102, 3586-3616. (44) Qiu, D.; Shenkin, P. S.; Hollinger, F. P.; Still, W. C. J. Phys. Chem. 1997, 101, 3005-3014. (45) Gilson, M. K.; Honig, B. Proteins: Struct. Func. Genet. 1988, 4, 7-18. (46) Sharp, K. Biophys. Chem. 1996, 61, 37-49. (47) Billeter, M. Quart. ReV. Biophys. 1992, 25, 325-377. (48) Chandrasekhar, I.; Clore, G. M.; Szabo, A.; Gronenborn, A. M.; Brooks, B. R. J. Mol. Biol. 1992, 226, 239-250. (49) Daggett, V.; Levitt, M. J. Mol. Biol. 1993, 232, 600-619. (50) Smith, P. E.; van Schaik, R. C.; Szyperski, T.; Wuthrich, K.; van Gunsteren, W. F. J. Mol. Biol. 1995, 246, 356-365. (51) Philippopoulos, M.; Lim, C. J. Mol. Biol. 1995, 254, 771-792. (52) Wang, P. L.; Sait, F.; Winter, G. Oncogene 2001, 20, 2318-2324. (53) Gannon, J. V.; Greaves, R.; Lane, D. P. EMBO J. 1990, 9, 15951602. (54) Wong, K.-B.; DeDecker, B. S.; Freund, M. V.; Proctor, M. R.; Bycroft, M.; Fersht, A. R. Proc. Natl. Acad. Sci. U.S.A. 1999, 96, 84388442. (55) Rolley, N.; Butcher, S.; Milner, J. Oncogene 1995, 11, 763-770. (56) Friedlander, P.; Legros, Y.; Soussi, T.; Prives, C. J. Biol. Chem. 1996, 271, 25468-25478. (57) Abarzua, P.; LoSardo, J. E.; Gubler, M. L.; Lu, Y.-A.; Felix, A.; Neri, A. Oncogene 1996, 13, 2477-2482. (58) El-Deiry, A. A.; Kern, S. E.; Pietenpol, J. A.; Kinzler, K. W.; Vogelstein, B. Nature Genet. 1992, 1, 45-49. (59) Hupp, T. R.; Meek, D. W.; Midgley, C. A.; Lane, D. P. Cell 1992, 71, 875-886. (60) Zhang, W.; Funk, W. D.; Wright, W. E.; Shay, J. W.; Deisseroth, A. B. Oncogene 1993, 8, 2555-2559. (61) Gromiha, M. M.; Oobatake, M.; Kono, H.; Uedaira, H.; Sarai, A. Prot. Eng. 1999, 12, 549-555.