Addressing the Stereochemistry of Complex Organic Molecules by

Journal of the American Chemical Society 2014 136 (40), 14277-14288 ..... advances in the structure elucidation of small organic molecules by the LSD ...
0 downloads 0 Views 966KB Size
ARTICLE pubs.acs.org/JACS

Addressing the Stereochemistry of Complex Organic Molecules by Density Functional Theory-NMR: Vannusal B in Retrospective Giacomo Saielli,† K. C. Nicolaou,‡,§ Adrian Ortiz,‡ Hongjun Zhang,‡ and Alessandro Bagno*,^ †

Istituto per la Tecnologia delle Membrane del CNR, Unita di Padova, Via Marzolo 1, 35131 Padova, Italy Department of Chemistry and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States § Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States ^ Dipartimento di Scienze Chimiche, Universita di Padova, via Marzolo 1, 35131 Padova, Italy ‡

bS Supporting Information ABSTRACT: We have employed density functional theory (DFT) protocols to calculate the NMR properties of the vannusals, a class of natural products whose structures have been the subject of recent investigations. The originally assigned structure of vannusal B was revised after a long synthetic journey which generated a series of closely related diastereomers. In this work we show how DFT calculations based on density functionals and basis sets designed for the prediction of NMR spectra (M06/pcS-2 level of theory) can be used to reproduce the observed parameters, thereby offering to the synthetic chemist a useful tool to discard or accept putative structures of unknown organic molecules.

’ INTRODUCTION The domain of naturally occurring substances is a bottomless trove of intriguing molecules. Many of them are characterized not only by interesting biological activities (which is the reason why they are often chased) but, also from a more fundamental point of view, by their unusual structures. Indeed, the isolation of new molecules from natural extracts marks the starting point of structural investigations which generally culminate with a proposed structure. Such endeavors take advantage of a variety of spectroscopic methods, among which NMR spectroscopy plays an undisputed pivotal role. Thus, the isolated fractions are submitted to the large array of available experimental NMR methods, which ultimately (should) lead to stringent constraints on molecular structure and conformation. Nevertheless, even the wealth of information that can be so gathered may not be sufficient to arrive at an unambiguous structural proposal. It is not infrequent in the literature to see that an original structure is revised to one that better fits further spectroscopic data, or after total synthesis of the proposed structure has revealed that the spectra do not match.1 r 2011 American Chemical Society

However, the total synthesis of a natural product can only be undertaken once its structure has been narrowed down to a limited manifold, owing to the substantial cost of such work. It is then apparent that determining the structure of complex naturally occurring molecules involves a multifaceted investigation that exploits several resources and expertise, and often results in a long and winding path to the final target. Given such complexity, it is desirable to devise novel avenues that help to sort out candidate structures, especially at the spectroscopic stage. Modern computational chemistry methods, especially DFT, have proven to be excellent tools for determining molecular structures. Recently, such capabilities have been broadened to span spectroscopic properties such as NMR chemical shifts and couplings,27 possibly aided by empirical methods.8 These developments have been widely exploited to determine molecular structures, including those of natural substances,9 with little if any recourse to empirical evidence; the latter strength is particularly critical whenever molecules with unusual or Received: February 4, 2011 Published: March 25, 2011 6072

dx.doi.org/10.1021/ja201108a | J. Am. Chem. Soc. 2011, 133, 6072–6077

Journal of the American Chemical Society

ARTICLE

Figure 1. The eight diastereomeric structures of the vannusals investigated in ref 15.

unprecedented constituents or connectivities are considered. Thus, high-level DFT calculations have aided the structure determination of arsenicin A10 and clarified issues on the structure of hexacyclinol,11,12 as well as helped in the structural revision of several other natural substances.13 However, such achievements have involved the comparison of constitutional isomers, which is just one of the issues at stake when complex organic molecules are involved. Indeed, most of their complexity arises from the possibility of many stereoisomers having very similar connectivities and magnetic environments of each nucleus. Goodman and co-workers have extensively investigated this issue, defining a novel statistical approach to select the correct diastereomer to be assigned to a single set of experimental NMR shifts, a common occurrence when dealing with natural substances.6b Not surprisingly, the issue of stereoisomerism is precisely the ground where many structural revisions have been undertaken. One such case is provided by the vannusals. Vannusal B is a marine natural product that was isolated from the tropical interstitial ciliate Euplotes vannus. The originally assigned structure (structure 2-1 in Figure 1), a rather unusual molecular architecture, consisting of a C30 molecular framework, seven rings and thirteen stereogenic centers, was proposed on the basis of spectroscopic data, mainly NMR.14 In this study, we retrace the path that has, with time and effort, led to a revision of the originally proposed structure of vannusal B15 to the correct structure of this natural product.16 During this re-examination, we shall indicate the stages at which DFT calculations would have provided critical information to that effect.

’ COMPUTATIONAL SECTION All structures were optimized at the B3LYP/6-31G(d,p) level of theory, which we found to be adequate for organic molecules.3,9 NMR chemical shifts were calculated using the recently introduced hybrid M06 functional17 with the pcS-2 basis set, specifically designed for the calculations of NMR shielding constants.18 13C chemical shifts were calculated as δ = σref  σ, where σref is the shielding constant of TMS calculated at the same level of theory (σref = 176.225 ppm). For

spinspin coupling constants on model systems we used the hybrid B97-2 functional19 with the pcJ-2 basis set, specifically designed for the calculations of NMR scalar couplings20 for consistency with our previous studies.12 Test calculations to estimate long-range solvent effects on the NMR properties were conducted using the PCM model with methanol as a solvent. All optimizations were run using the software package Gaussian 0321 while NMR properties were calculated using Gaussian 09.22 We optimized only a single conformation for each one of the eight vannusals, where the hydroxyl groups were arranged so as to form intramolecular hydrogen bonds. It is unlikely that this conformation is highly populated in methanol solution, where hydroxyl groups will be involved in hydrogen bonding with the solvent; nevertheless, it is a consistent way to treat the molecules in the gas phase. It is expected that the OH orientations will only slightly affect 13C resonances.23 On the other hand, it would be impractical to account for the conformational population in methanol of the vannusals by computer simulation, since a force field having the necessary accuracy for such natural substances is not available. The results have been statistically analyzed by linear regression of calculated shifts (δcalcd) against experimental ones (δexpt). The results were then evaluated in terms of the maximum absolute error MaxErr and the corrected mean absolute error (CMAE).4a,9 Both parameters are calculated with respect to the value predicted by the linear fit rather than to the experimental value, so as to avoid the possible bias introduced by a systematic error in the correlation, for example, caused by an inaccurate evaluation of the reference shielding. Thus, MaxErr = max(|δcalcd  δfit|) and CMAE = (Σi|δcalcd  δfit|/n)/b, where δcalcd is the calculated chemical shift; δfit is the chemical shift obtained from the linear fit: δfit = (δexpt  a)/b and a and b are the intercept and the slope of the fitting line comprising n data points. We have excluded from the correlations the resonances of olefinic and carbonyl carbons (C1, C2, C11, C12, C27, C31; see Supporting Information for full correlation graphs) which would flatten all statistical parameters by widening the range of chemical shifts up to about 200 ppm; focusing on a narrower range allowed us to highlight the differences in the region of interest for the comparison. We also remark that the vannusals have conformational degrees of freedom in the hydroxyisopropyl, acetate, aldehyde, and olefinic groups attached to the main carbon skeleton. Also important, as already 6073

dx.doi.org/10.1021/ja201108a |J. Am. Chem. Soc. 2011, 133, 6072–6077

Journal of the American Chemical Society

ARTICLE

Figure 2. Correlation between calculated and experimental 13C chemical shifts of strychnine. Calculated data with the B3LYP/cc-pVTZ protocol are displaced by 30 ppm along the y axis for clarity. Fitting parameters of δcalcd = a þ bδexpt are a = 5.3 ppm, b = 1.0127 (B3LYP/ccpVTZ); a = 0.7 ppm, b = 1.1117 (M06/pcS-2). The structure of strychnine is shown in the inset. mentioned, is the flexibility of the hydroxyl groups and their interaction with the protic solvent (methanol) in which the NMR spectra have been collected. We have neglected all these additional sources of variance in the calculated chemical shifts but, as we will see, the resulting 13C shifts are not significantly affected by these contributions.

’ RESULTS AND DISCUSSION Calibration of the Computational Protocol. Before discussing in detail the results obtained for vannusals, we present the results of a calibration of the computational protocol used (see Computational Section). To this end we have selected strychnine as a test case since it is a rather rigid molecule, with several functional groups, and for which accurate NMR properties were recently calculated.9 In Figure 2 we compare the results obtained for 13C chemical shifts of strychnine using the previously tested protocol (B3LYP/cc-pVTZ) against the new one used here (M06/pcS-2). Clearly, using the newest functional and basis set increases the quality of the correlation. It is important to note that this excellent agreement is also a result of some advantageous occurrences. First, strychnine is fairly rigid and thus no conformational averaging of chemical shifts takes place. Second, the molecule is relatively nonpolar and the experimental data are collected in chloroform. Thus, external perturbations which affect the chemical shift to some degree are excluded and this allows a better appreciation of the performance of the various protocols. Therefore, we expect the agreement for vannusals (which are more flexible and possess many hydroxyl groups involved in hydrogen bonds with methanol) to be lower.9,23 13 C NMR Chemical Shifts of Vannusals. The structures of the previously synthesized vannusals15,16 are shown in Figure 1. The original structure (2-1) proposed by Guella and coworkers14 features a carbon skeleton where an ethylene bridge (C14C17) is arranged on the top side of the molecule. This

Figure 3. Calculated 13C chemical shifts of the originally proposed structure 2-1 plotted against (a) experimental values of natural vannusal B, 5-2; (b) experimental values of structure 2-1. The data point for C21 (a major outlier) is labeled.

arrangement is common to the four structures n-1 displayed on top of Figure 1. The four molecules differ in the stereochemistry of C21 and C25: (S,S), (R,S), (R,R) and (S,R) for 2-1, 3-1, 4-1, and 5-1, respectively. The other four structures at the bottom of Figure 1 (n-2), are epimeric to the corresponding top structures at the carbons indicated. These involve the whole “northeast” region of the molecule. The “southwest” region of the molecule has, instead, the same configuration for all compounds. Thus, the four bottom structures in Figure 1 can be viewed as the “northeast” enantiomers of the four top structures. The reassignment of the structure of vannusal B from the originally proposed architecture 2-1 to the correct structure 5-2 thus followed a two-station path: first it was necessary to realize that the true structure of vannusal B is a “northeast” enantiomer of the originally proposed molecule; then it had to be determined which of the four possible configurations at C21 and C25 was the correct one. In Figure 3 we show the correlation between calculated 13C NMR chemical shifts of stereoisomer 2-1 (the originally proposed structure) with the experimental values of natural vannusal B (5-2) and the experimental values of the same compound (2-1). This presentation showcases the virtue of the correlation when a given putative structure is compared with the experimental values of the natural substance and underscores the performance of the DFT protocol when the correct experimental values are used. The bottom panel of Figure 3 can then be used as a sort of reference for the predictive power of the computational protocol. The calculated values are in rather good agreement with the experimental values of 2-1, highlighting the reliability of the 6074

dx.doi.org/10.1021/ja201108a |J. Am. Chem. Soc. 2011, 133, 6072–6077

Journal of the American Chemical Society

ARTICLE

Table 1. Statistical Parameters for the Correlations of 13C Chemical Shifts calcd vs expt

a

b

R2

MaxErr

CMAE

2-1 vs 5-2

0.15

1.0983

0.9580

16.2

2.5

2-1 vs 2-1 2-2 vs 5-2

1.03 0.06

1.0934 1.1104

0.9877 0.9662

4.5 11.9

1.7 2.3

2-2 vs 2-2

0.84

1.1087

0.9819

8.0

2.0

3-1 vs 5-2

1.00

1.1331

0.9499

11.2

3.2

3-1 vs 3-1

0.66

1.0933

0.9846

6.0

1.8

3-2 vs 5-2

0.90

1.1066

0.9631

14.4

2.4

3-2 vs 3-2

0.52

1.0722

0.9852

7.9

1.7

4-1 vs 5-2

0.42

1.0895

0.9670

11.7

2.4

4-1 vs 4-1 4-2 vs 5-2

0.79 0.04

1.0845 1.1012

0.9830 0.9802

10.0 8.9

1.7 1.8

4-2 vs 4-2

0.24

1.1007

0.9847

9.3

1.5

5-1 vs 5-2

1.48

1.0737

0.9927

3.7

1.3

5-1 vs 5-1

1.89

1.0644

0.9948

3.3

1.0

5-2 vs 5-2

0.26

1.0898

0.9948

3.0

1.1

a and b are the intercept and slope of the linear fitting line, respectively, and R2 is its correlation coefficient. MaxErr is the maximum absolute error with respect to the linear fit. CMAE is the corrected mean absolute error (see text for the definition of both parameters). a

Figure 4. Correlation between experimental and calculated mical shifts of vannusal B, 5-2.

13

C che-

protocol for this type of molecules even when solvent and conformational effects are neglected. In Table 1 we report the statistical parameters for all correlations presented. For structure 2-1 the R2 coefficient is close to 1 and both the maximum absolute error (MaxErr) and the corrected mean absolute error (CMAE) are small. If the correlation is done using the experimental values of vannusal B (Figure 3a) all statistical parameters drop substantially; in particular C21 is largely in error and lies off its correlation line by more than 16 ppm. Had DFT-NMR shifts been available when the assignments were done, this observation would have suggested that the proposed structure was questionable. The graphs of the correlations for the other structures are reported in the Supporting Information; it suffices here to analyze the statistical parameters of Table 1. Intercepts and slopes of the fitting lines are very similar to what is generally observed for 13C chemical shift correlations using DFT

protocols.49 The R2 coefficient is quite instructive: when the correlation is done with respect to the experimental values of that particular structure R2 is well above 0.98, confirming the generally good performance of the DFT protocol. In contrast, when the calculated values are correlated with the experimental values of vannusal B the quality of the correlation is much lower (except, obviously, for 5-2 which is the true structure of vannusal B), indicating that the computational protocol is capable of distinguishing among the different vannusal structures. In fact, R2 rises systematically as we move from the originally proposed structure 2-1 to the correct structure 5-2, following the same path that was walked through during the quest for the true structure of vannusal B, that is, 2-1, 2-2, 3-1, 3-2, 4-1, 4-2, 5-1, and 5-2, gathering new intelligence as a new diastereomeric structure was synthesized and its NMR spectra were compared with those of the natural substance. Similarly, MaxErr and CMAE decrease, confirming that the synthetic route followed was converging toward the correct assignment. Finally, in Figure 4 we show the correlation between experimental and calculated 13C chemical shifts obtained for the true structure of vannusal B. The statistical parameters, reported in Table 1 (last entry), are very good, particularly MaxErr and CMAE which are the best of all. We note that also the “northeast” enantiomer of vannusal B (5-1) has a very good correlation with the experimental values of the natural substance. Thus, it would have been rather difficult to distinguish between 5-1 and 5-2 based only on 13C chemical shifts. As a final test we have estimated the long-range solvent effects on the carbon shielding repeating the calculations for vannusal B (5-2) but including the self-consistent solvent reaction field of methanol by means of the PCM method. Calculated shieldings were hardly distinguishable from the results obtained in the gas phase, thus confirming the weak dependence of carbon resonances on dielectric polarization9 (see results in the Supporting Information). Vicinal Coupling Constants in Models of the “Northeast” Region of Vannusal B. In the experimental revision of the structure of vannusal B, an important role was played by the analysis of 3J(H,H) coupling constants, particularly for the couplings between H6 and H7 and between H21 and H25. For the first pair, the experimental value15a of 3J(H6,H7) (10.0 Hz) was in agreement with a trans arrangement of the two protons. This observation, together with others, confirmed the correct assignment of the “southwest” region of vannusal B. In contrast, the experimental coupling 3J(H21,H25) of vannusal B (2.0 Hz) was somewhat too small for the proposed configuration in 2-1, where both protons are on the same side and the dihedral angle is expected to be close to 0, corresponding to a maximum in the Karplus curve. However, a simple Karplus approach may be questionable for vicinal couplings in a cyclic system with several substituents.4c Thus, Nicolaou and co-workers embarked on the synthesis of the four model systems of the “northeast” region, displayed in Figure 5. These diastereoisomers correspond to the four possible configurations of C21 and C25: (S,S), (R,S), (R,R), and (S,R). It is straightforward to discard the (R,S) and (R,R) configurations since the experimental values of 3J(H21,H25) (310 Hz) are too large compared with that of vannusal B (2.0 Hz).14a The measured couplings (Table 2) suggest that natural vannusal B has the same relative configuration as the (S,R) model system. The vicinal coupling in model (S,S), corresponding to the 6075

dx.doi.org/10.1021/ja201108a |J. Am. Chem. Soc. 2011, 133, 6072–6077

Journal of the American Chemical Society

ARTICLE

Figure 5. Model systems of the “northeast” region of vannusals. TIPS, triisopropylsilyl; Bz, benzyl. For the sake of comparison, numbering is the same as in vannusals. Configuration labels refer to C21 and C25.

Table 2. Experimental and Calculated 3J(H21,H25) Values (Hz) in the Model Systems of Figure 5a model

expt

calcd

jb

S,S