Parametrization of Backbone−Electrostatic and ... - ACS Publications

Parametrization of Backbone−Electrostatic and Multibody Contributions to the UNRES Force Field for Protein-Structure Prediction from Ab Initio Energ...
0 downloads 0 Views 627KB Size
J. Phys. Chem. B 2004, 108, 9421-9438

9421

Parametrization of Backbone-Electrostatic and Multibody Contributions to the UNRES Force Field for Protein-Structure Prediction from Ab Initio Energy Surfaces of Model Systems† Adam Liwo,‡,§,| Stanisław Ołdziej,‡,§ Cezary Czaplewski,‡,§ Urszula Kozłowska,‡,§ and Harold A. Scheraga*,§ Faculty of Chemistry, UniVersity of Gdan´ sk, Sobieskiego 18, 80-952 Gdan´ sk, Poland, Baker Laboratory of Chemistry and Chemical Biology, Cornell UniVersity, Ithaca, New York 14853-1301, and Academic Computer Center in Gdan´ sk TASK, Narutowicza 11/12, 80-952 Gdan´ sk, Poland ReceiVed: July 14, 2003; In Final Form: October 3, 2003

The multibody terms pertaining to the correlation between backbone-local and backbone-electrostatic interactions in the UNRES force field for energy-based protein-structure prediction, developed in our laboratory, were reparametrized on the basis of the results of high-level ab initio calculations on relevant model systems. MP2/6-31G(d,p) ab initio calculations were carried out to evaluate the energy surfaces of pairs consisting of N-acetyl-N′-methylacetamide molecules (AcNHMe, which model a regular peptide group) and N-acetylN′,N′-dimethylacetamide molecules (AcNMe2, which model a peptide group preceding proline) at various intermolecular distances and orientations. For each pair, the calculated ab initio energy surface was subsequently fitted by a sum of Coulombic and Lennard-Jones components. Then, the restricted free-energy (RFE) surfaces of pairs of free peptide groups as well as the RFE factors corresponding to the coupling of backbone-local and backbone-electrostatic interactions in model tetrapeptides were calculated by numerical integration, with the use of the ab initio-fitted simplified energy functions and the ab initio energy maps of model terminally blocked amino acid residues calculated recently (Ołdziej, S.; Kozłowska, U.; Liwo, A.; Scheraga, H. A. J. Phys. Chem. B, in press, 2003). Next, analytical expressions based on Kubo’s generalized cumulant theory from our previous work were fitted to the resulting RFE surfaces to parametrize the backbone-electrostatic and multibody terms in the UNRES force field. The computed coefficients of the cumulant-based expressions are different from those derived earlier, which had been based on the ECEPP/3 force field. To complete the force-field parametrization, the weights of the energy terms were determined, and the coefficients of the cumulant-based expressions were refined simultaneously by using our recently developed method of hierarchical optimization of a protein energy landscape using the protein 1IGD. The resulting force field was able to predict significant portions of the structures of proteins with R, β as well as both R and β structure correctly.

1. Introduction In our continuing effort to develop a physics-based method for protein-structure prediction from amino acid sequence alone and to understand the mechanism of protein folding,1-7 in the past few years we have been developing a physics-based unitedresidue force field, hereafter referred to as UNRES.8-16 The force field is derived as a restricted free energy (RFE) function, which corresponds to averaging the energy over the degrees of freedom that are neglected in the united-residue model.8-10,12-14,17-19 An inherent feature of such force fields, resulting from the fact that they correspond to an averaged energy, is the appearance of correlation or multibody terms, the inclusion of which is necessary to describe the system of interest.10,14,20-22 The early version of the UNRES force field contained only a limited number of multibody terms corresponding to the correlation between hydrogen-bonding interactions of pairs of adjacent peptide groups.10 Nevertheless, this †

Part of the special issue “Jack H. Freed Festschrift”. * Corresponding author. E-mail: [email protected]. Tel: (607) 255-4034. Fax: (607) 254-4700. ‡ University of Gdan ´ sk. § Cornell University. | Academic Computer Center in Gdan ´ sk TASK.

version was capable of predicting the structures of helical proteins, as assessed during the CASP3 experiment,23,24 although it was unable to model β structures.12 Recently,14,19 we developed a general theory for deriving united-residue force fields from the all-atom energy surfaces through the factorization of the RFE function and the expansion of the factors in the series of Kubo’s generalized cumulants,25 which enabled us to derive analytical expressions for the multibody terms corresponding to the coupling between backbone-local and backbone-electrostatic interactions. With these terms, the UNRES force field is now able to treat proteins containing both R and β structure, as assessed in the CASP4 experiment.26 Until now, the multibody and also some other terms in the UNRES force field have been parametrized by averaging allatom energy surfaces of model systems14,18,19 computed with the ECEPP/327 force field. In general, empirical all-atom force fields are far from a perfect description of the details of interactions; additionally, the ECEPP/3 force field assumes rigid valence geometry, which can lead to an even worse description of the regions of the conformational space outside energy minima, which are included in the averaging. Therefore, and also encouraged by the fact that the model systems necessary

10.1021/jp030844f CCC: $27.50 © 2004 American Chemical Society Published on Web 03/26/2004

9422 J. Phys. Chem. B, Vol. 108, No. 27, 2004

Liwo et al.

to derive RFE surfaces are relatively small, we recently decided to reparametrize crucial terms that involve interactions within a protein backbone by using ab initio methods of molecular quantum mechanics. In our most recent study,16 we computed ab initio energy surfaces of terminally blocked glycine, proline, and alanine and used them to reparametrize torsional and doubletorsional potentials. The goal of our present work was to reparametrize backbone-electrostatic terms and the multibody terms corresponding to the coupling between backbone-local and backbone-electrostatic interactions. 2. Methods 2.1. UNRES Force Field. In the UNRES model,8-16 a polypeptide chain is represented as a sequence of R-carbon (CR) atoms linked by virtual bonds with attached united side chains (SC) and united peptide groups (p). Each united peptide group is located in the middle of two consecutive R carbons. Only these united peptide groups and the united side chains serve as interaction sites, the R carbons serving only to define the chain geometry (Figure 1). All virtual bond lengths (i.e., CR-CR and CR-SC) are fixed; the distance between neighboring CR atoms is 3.8 Å, corresponding to trans peptide groups, and the sidechain angles (RSC and βSC) and virtual-bond (θ) and dihedral (γ) angles can vary. The energy of the virtual-bond chain is expressed by eq 1.

U)

USC SC + wSCp∑USC p ∑ i