Biochemistry 2004, 43, 14529-14538
14529
Structural Analysis of Actinorhodin Polyketide Ketoreductase: Cofactor Binding and Substrate Specificity‡ Tyler Paz Korman, Jason Anthony Hill, Thanh Nhat Vu, and Shiou-Chuan Tsai* Department of Molecular Biology and Biochemistry and Department of Chemistry, UniVersity of California, IrVine, California 92697 ReceiVed August 30, 2004; ReVised Manuscript ReceiVed October 6, 2004
ABSTRACT: Aromatic polyketides are a class of natural products that include many pharmaceutically important aromatic compounds. Understanding the structure and function of PKS will provide clues to the molecular basis of polyketide biosynthesis specificity. Polyketide chain reduction by ketoreductase (KR) provides regio- and stereochemical diversity. Two cocrystal structures of actinorhodin polyketide ketoreductase (act KR) were solved to 2.3 Å with either the cofactor NADP+ or NADPH bound. The monomer fold is a highly conserved Rossmann fold. Subtle differences between structures of act KR and fatty acid KRs fine-tune the tetramer interface and substrate binding pocket. Comparisons of the NADP+and NADPH-bound structures indicate that the R6-R7 loop region is highly flexible. The intricate protonrelay network in the active site leads to the proposed catalytic mechanism involving four waters, NADPH, and the active site tetrad Asn114-Ser144-Tyr157-Lys161. Acyl carrier protein and substrate docking models shed light on the molecular basis of KR regio- and stereoselectivity, as well as the differences between aromatic polyketide and fatty acid biosyntheses. Sequence comparison indicates that the above features are highly conserved among aromatic polyketide KRs. The structures of act KR provide an important step toward understanding aromatic PKS and will enhance our ability to design novel aromatic polyketide natural products with different reduction patterns.
Nature creates a huge array of natural products that are diverse in their chemical structures and bioactivity. One such example are the polyketides, a large family of natural products that are an extremely rich source of bioactive molecules (1, 2). Representative compounds include cholesterol-lowering drugs (such as lovastatin) (3), antibiotics (such as tetracyclines and actinorhodin), and anticancer agents (such as doxorubicin, Figure 1A) (1, 2). The biosynthesis of these medically important polyketides is achieved by polyketide synthase (PKS),1 which synthesizes polyketides in high quantity and yields. Similar to fatty acid synthase (FAS), the PKSs are multifunctional enzymes that catalyze repeated chain elongations followed by optional chain modifications (4). The variation in chain length, choice of chain-building units, and chain modifications leads to the huge diversity among naturally occurring polyketides. Over the past decade, PKSs have been targets of intensive manipulation and analysis via genetic engineering (2, 5). These studies have given rise to >100 “unnatural” natural products as well as new technologies for manipulating natural product biosynthesis (6, 7). However, this endeavor has been severely hampered by the lack of molecular information ‡ The atomic coordinates have been deposited in the Protein Data Bank (accession codes 1X7G and 1X7H). * Address correspondence to this author. E-mail:
[email protected]. Phone: 949-824-4486. Fax: 949-824-8552. 1 Abbreviations: KR, ketoreductase; FabG, β-ketoacyl (acyl carrier protein) reductase; act, actinorhodin; PKS, polyketide synthase; NADP, nicotinamide adenine dinucleotide diphosphate; NADPH, reduced nicotinamide adenine dinucleotide diphosphate; SDR, short-chain dehydrogenase/reductase; ACP, acyl carrier protein.
about PKS subunits. Structural analyses of PKS subunits will help to answer important questions about polyketide biosynthesis, such as the molecular basis of regio- and stereospecificity of each enzyme, as well as the influence of protein-protein interactions on substrate specificity. There are at least three architecturally different types of PKSs (8-10), and the focus of this paper is on type II or “aromatic” PKSs. The type II PKSs synthesize aromatic polyketides such as actinorhodin and tetracycline. They are comprised of 5-10 distinct enzymes whose active sites are used iteratively in the chain elongation cycle (11, 12). The polyketide chain is covalently linked to acyl carrier protein (ACP). Following iterative chain elongation by the ketosynthase (KS)/chain length factor (CLF) heterodimer, the first ring is formed uncatalyzed either in the active site of KS/ CLF or in the active site of KR, leading to intermediate 1 (Figure 1B). The polyketide chain is then reduced at the C9 position by ketoreductase (KR) to form intermediate 2, followed by subsequent aromatic ring formations catalyzed by aromatase and cyclase (12). Past studies has provided proposals for the function of each type II PKS subunit, as well as the origin of chain length control (11, 12). However, many key events are not well understood, such as the first ring formation and the molecular basis of the polyketide reduction. The first polyketide chain modification reaction by the ketoreductase (KR) (Figure 1B) is chemically identical to the corresponding fatty acid ketoreduction, in which NADPH reduces a ketone to an alcohol. However, the regiospecificity is very different. Whereas fatty acid KR reduces every
10.1021/bi048133a CCC: $27.50 © 2004 American Chemical Society Published on Web 10/30/2004
14530 Biochemistry, Vol. 43, No. 46, 2004
Accelerated Publications
FIGURE 1: (A) Examples of aromatic polyketide compounds that are therapeutically important. (B) Biosynthesis of actinorhodin. (C) Alternative cyclization between C5-C10 leads to the reduction at C7 position and the eventual product RM18b.
carbonyl group on the elongating chain, the aromatic polyketide KR has a high specificity for the C9-carbonyl group, except in special examples where an unusual cyclization pattern can lead to the reduction of other carbonyl groups (Figure 1C) (13, 14). Subsequently, the regiospecificity of the dehydration/cyclization reaction (catalyzed by ARO/ CYC) is closely related to the C9 reduction (12). How KR achieves such accurate regiospecificity is not well understood. Similarly, very little is known about the molecular basis of KR stereospecificity. To expand polyketide biosynthesis beyond the current scope, it is essential to understand the molecular basis of KR regio- and stereospecificity. The aromatic polyketide KRs are highly homologous, with a sequence identity of 39-80% (Figure 2). They belong to the short-chain dehydrogenase/reductase (SDR) family (15),
a large family of proteins that use NADPH or NADH as the cofactor and have an active site tyrosine in the center of the Rossmann fold (16). Among the 33 SDRs whose structures are available, the fatty acid KRs share the highest sequence identity to the polyketide KRs. Currently, four fatty acid KRs (also known as FabG) have been solved from Escherichia coli (17, 18), Thermatoga maritima (unpublished results), Mycobacterium tuberculosis (19), and Brassica napus (20). These fatty acid KRs share 30-43% sequence identity with the aromatic polyketide KRs. Past studies on these KRs have identified many residues that are important for enzyme catalysis. Further, the act KR-NADP+ complex crystallized in a condition similar to many SDR crystallization conditions and native gels indicate that the act KR-NADP+ complex is a tetramer, similar to many other SDRs (21). However,
Accelerated Publications
Biochemistry, Vol. 43, No. 46, 2004 14531
FIGURE 2: Sequence alignment among different SDRs. ActKR, granaticin, frenolicin, doxorubicin, griseusin, kinamycin, monesin, and encD are representative aromatic polyketide KRs. FabG•tuber, FabG•coli•1Q7B, and FabG•plnat•1EDO are the fatty acid KRs of M. tuberculosis, E. coli, and B. napus. TRI and TR2 are tropinone reductase I and II. DEBS•KR1, DEBS•KR6, and EPO•KR2 are type I (modular) polyketide KRs from DEBS module 1, module 6, and the second KR of epothilone synthase, respectively. Key: red star, catalytic tetrad; blue square, ACP docking site; blue triangle, cofactor binding; black circle, interface of monomers A and D; purple circle, interface of monomers A and B; blue circle, interface of monomers A and C; #, conserved hydrophilic residues; ! and $, conserved hydrophobic residues; %, conserved aromatic residues.
one important question remains unanswered: if enzymes in FAS and PKS pathways are similar, why does FAS make aliphatic acids while PKS generates such diverse products? Here, we report the cocrystal structures of the actinorhodin polyketide KR (act KR) bound to NADP+ or NADPH. No polyketide KR structure has been previously reported. These structures allow extensive structural comparisons between act KR and other SDR enzymes. Combined with docking analysis of act KR with either the polyketide substrate or ACP, we discuss possible mechanisms leading to the reduced polyketide product with respect to selective binding motifs and protein-protein interactions.
MATERIALS AND METHODS Expression and Purification. Strain BL21 λ(DE3) is an E. coli B strain lysogenized with λDE3, a prophage that expresses the T7 RNA polymerase from the IPTG-inducible lacUV5 promoter. Recombinant act KR was cloned into the pET28c vector (Novagen) by overnight restriction digestion of pRZ153 (22) with NdeI and EcoRI, followed by overnight ligation using T4 ligase, resulting in the plasmid pYT238. E. coli BL21 transformed with pYT238 was inoculated into 1 L of LB culture at 37 °C for 4 h followed by the induction of IPTG (1 mM) at 18 °C overnight. The cells were harvested
14532 Biochemistry, Vol. 43, No. 46, 2004
Accelerated Publications
Table 1: Crystallization, Data Collection, and Refinement Statistics KR-NADP+ (A) crystallization
(B) crystallographic data space group cell dimension (Å)
KR-NADPH
2 M ammonium 2 M sodium sulfate, 100 mM formate, 100 mM Tris (pH 8.5) sodium phosphate (pH 7.5)
P3221 104.61, 104.61, 124.25, R ) β ) 90, γ ) 120 resolution (Å) 2.3 mosaicity (deg) 0.4 no. of observations 943814 no. of unique reflections 35529 completeness (%) 99.8 (100.0) (last shell) I/σ(I) (last shell) 18.0 (2.9) Rmerge (%) (last shell) 11.5 (54.5) (C) refinement resolution (Å) 2.3 no. of reflections 33832 no. of protein atoms 3680 no. of cofactor atoms 96 no. of waters 164 Rfree (%) 24.2 Rcrys (%) 21.0 (D) geometry RMS bonds (Å) 0.006 RMS angles (deg) 1.31 RMS B main chain 1.35 RMS B side chain 2.49 Ramachandran plot (%) most favored 91 favored 9 generously allowed 0
P3221 103.70, 103.70, 122.68, R ) β ) 90, γ ) 120 2.2 0.4 1361501 39267 100.0 (99.8) 16.0 (1.8) 9.2 (50.6) 2.3 32785 3739 96 194 24.6 21.9 0.006 1.29 1.27 2.27 88.7 10.8 0.5
(5000 rpm × 30 min), resuspended in lysis buffer (50 mM Tris, pH 7.5, 0.5 mM DTT, 5 mM imidazole, 10% glycerol), sonicated (5 × 30 s, 15000 rpm × 1 h), and purified by Ni-NTA (10 mL gel volume, 80 mL wash with lysis buffer, 30 mL elution with 100 mM imidazole) to yield >95% pure protein. The buffer was exchanged to either 50 mM sodium phosphate (pH 7.0) or 50 mM Tris (pH 7.0) by overnight dialysis, and the protein was concentrated with the Centricon YM-10 apparatus to concentrations of 20 and 15 mg/mL, respectively. Crystallization of Act KR plus Cofactors. Cocrystals of act KR and NADP+ or NADPH were grown in sitting drops at room temperature by vapor diffusion. The protein buffer was 50 mM sodium phosphate (pH 7.0) and 10% glycerol for act KR plus NADP+ and 10 mM Tris (pH 7.0) for act KR plus NADPH. Drops were generated by mixing 2 µL of the purified protein solution (10 mg/mL protein, 5 mM NADP+ or NADPH) with 2 µL of well buffer above a well solution of 500 µL. The crystals of KR-NADP+ and KRNADPH grew in 1 week. The crystallization condition of KR-NADP+ is very similar to the one previously reported (21). The crystals of both KR-NADP+ and KR-NADPH yield the same space group and similar cell dimensions (Table 1). Data Collection. X-ray diffraction data of KR-NADP+ and KR-NADPH were collected at the Stanford Synchrotron Radiation Laboratory (SSRL) and Advanced Light Source (ALS) to 2.2-2.3 Å (Table 1). Crystals were frozen in 30% glycerol plus 70% well solution. Diffraction intensities were
integrated and reduced using the program DENZO and scaled using SCALEPACK (23). A summary of the crystallographic data is shown in Table 1. Molecular Replacement and Refinement. Initial phases were determined by molecular replacement using a homology model based on the crystal structures of E. coli FabG (PDB code 1Q7C) that was generated by Swiss Model (17). A cross-rotational search followed by a translational search was performed using the program CNS (24). The noncrystallographically related monomers were treated as rigid bodies and were refined using CNS to give an initial Rcrys of 48%. After the structure was rebuilt using Quanta, further refinement was performed using CNS (24). A preliminary round of refinement, using torsion angle simulated annealing followed by energy minimization and positional and individual B-factor refinement, reduced Rcrys to 35%. Subsequent rounds of model building and refinement were carried out using the maximum likelihood based approach implemented within CNS using all data to the highest resolution. Refinement was continued to an Rcrys of