Sequential Assignments and Identification of Secondary Structure

Mar 4, 1994 - Michael J. Osborne,* 1 Lu-Yun Lian,§ Russell Wallis,11 Ann Reilly, Richard James,* Colin Kleanthous,*and. Geoffrey R. Moore*·1. School...
0 downloads 0 Views 958KB Size
0 Copyright 1994 by the American Chemical Society

Volume 33, Number 41

October 18, 1994

Art ides

Sequential Assignments and Identification of Secondary Structure Elements of the Colicin E9 Immunity Protein in Solution by Homonuclear and Heteronuclear NMR+ Michael J. Osborne,* Lu-Yun Lian,s Russell Wallis,ll Ann Reilly,ll Richard James,ll Colin Kleanthous,ll and Geoffrey R. Moore'.$ Schools of Chemical and Biological Sciences, University of East Anglia, Norwich, NR4 7TJ, U.K., and Biological NMR Centre, Medical Sciences Building, University Road, University of Leicester, Leicester, LE1 9HN, U.K. Received March 4, 1994; Revised Manuscript Received May 25, 1994'

'H-lH, lH-15N, and 'H-'H-15N multidimensional N M R spectroscopic studies of the 86 amino acid protein that provides immunity against the DNase action of colicin E9 are reported. Through a combination of 2D NOESY and TOCSY and 3D TOCSY-HMQC, NOESY-HMQC, and HMQC-NOESYH M Q C experiments, almost complete 'H N M R and backbone I5N N M R assignments have been obtained, and the secondary structure of the protein has been partially elucidated. Approximately 50% of the protein forms three helices. The specificity determining region of the DNase immunity protein, identified from previously reported biochemical studies to include residues 32-40, is helical, indicating that the proteinprotein interaction involves residues from at least one helix.

ABSTRACT:

Colicins are plasmid-encoded bacterial proteins which have antibiotic activity and are secreted as part of the stress response system of the producing organism (Luria & Suit, 1987). E-group colicins recognize the vitamin Blz extracellular receptor encoded by the btuB gene on target cells (DiMasi et al., 1973). Following binding to their receptor, the E-type colicin is translocated across the outer and cytoplasmic membranes, and its cytotoxic activity then leads to cell death. Three cytotoxic classes have been identified in the E-group colicins: pore-forming ionophores such as colicin E l (Cramer et al., 1983; Pattus et al., 1990), RNases such as colicins E3 This work was supported by the Wellcome Trust and by the U.K. Science and Engineering Research Council via their establishment of both the Leicester Biological NMR Centre and the UEA research groups as Molecular Recognition Centres and via providing research assistantships to M.J.O. and R.W. * Author to whom correspondence should be addressed; Fax: 0603259396; e-mail address: [email protected]. t School of Chemical Sciences, University of East Anglia. 5 University of Leicester. 11 School of Biological Sciences, University of East Anglia. e Abstract published in Advance ACS Abstracts, September 1, 1994.

0006-2960/94/0433-12347%04.50/0

andE6 (Bowmanet al., 1971;Senior & Holland, 1971;Akutsu et al., 1989), and DNases such as colicins E2 (Schaller & Nomura, 1976), E7 (Chak et al., 1991), E8 (Toba et al., 1988), and E9 (Eaton & James, 1989). Colicins are produced by host organisms in concert with the production of a small immunity protein that inhibits their cytotoxic action. The immunity proteins are completely specific for their coordinately produced colicin despite their high sequence similarities (James et al., 1992). Wearestudying themodeofactionandinhibition of the DNaseclass of E-type colicins. Concentrating on colicin E9 and its 86 amino acid immunity protein Im9,' we have shown that the specificity determining region of Im9 is located between residues 16 and 43 (Wallis et al., 1992a), and the Abbreviations: Im9, immunity protein of colicin E9; Im3, immunity protein of colicin E3; NMR, nuclear magnetic resonance; NOE, nuclear Overhauser effect; 2D, two dimensional; 3D, three dimensional; HOHAHA, homonuclear Hartman-Hahn; TOCSY, total correlation spectroscopy; NOESY, nuclear Overhauser enhancement spectroscopy; HMQC, heteronuclear multiple-quantum coherence; HSQC, heteronuclear single-quantum coherence;COSY, correlated spectroscopy;DQFCOSY, double-quantum filtered COSY; INEPT, insensitive nuclei enhanced by polarization transfer.

0 1994 American Chemical Society

12348 Biochemistry, Vol. 33, No. 41, 1994 DNase activity of colicin E9 resides in its C-terminal domain composed of 128 amino acids (Curtis & James, 1991; Wallis et al., 1994). Through a combination of gene fusion, chemical modification, and site-directed mutagenesis of the DNase colicins and their immunity proteins, we aim to define the mechanisms of toxicity and immunity. This requires that the structures of the colicins and immunity proteins be determined. We have used NMR to provide structural information on these proteins since attempts to obtain crystals suitable for X-ray structural determination have not been successful. Detailed high-resolutionNMR studies of the DNase colicins and their immunity proteins have not previously been reported, although we have reported a preliminary description of the IH NMR characteristics of Im9 (James et al., 1992). N M R studies of the 84 amino acid immunity protein of the RNase colicin E3 (Yajima et al., 1992) have been reported, but this protein bears no sequence identity to Im9 and, as we show in the present paper, has a markedly different structure from that of Im9. Thus, the data reported herein, which include the almost complete ' H N M R assignments and complete backbone peptide lSN N M R assignments of Im9 together with the identification of its secondary structure elements, contribute the first set of structural information for this class of colicin DNase inhibitors. This structural information, together with biochemical data, leads to a model for the mediation of specificity in DNase colicin-immunity protein complexes, and this model is described briefly in the present paper. A preliminary report of this model has been published elsewhere (Kleanthous et al., 1994). EXPERIMENTAL PROCEDURES

Sample Preparation. Im9 was expressed in Escherichia coli and purified as previously described (Wallis et al., 1992b). All samples prepared for N M R were assayed for biological activity as described by Wallis et al. (1992b) and found to be fully active. Uniformly lSN-labeled Im9 was obtained by growing the E . coli JM105 cells containing the expression system in minimal medium with lSNH4C1 (1 g/L). Concentrations of NMR samples in either 90% 'H20/10% 2H20 or 99.9% 2H20 were typically 1 - 4 mM. The N M R solutions were prepared from Im9 lyophilized from either 'H20 or 2H20 solutions containing sodium phosphate at pH 7.0 or 6.2 to yield NMR samples with a final phosphate concentration of 25-50 mM. N M R Spectroscopy. Preliminary N M R experiments were carried out on a JEOL GX-400 NMR spectrometer as previously described (James et al., 1992), but the majority of the data reported in the present paper were acquired on Bruker AM500 and AMX600 spectrometers with 'H frequencies of 500.1 3 and 600.13 MHz, respectively, and lSNfrequencies of 50.664 and 60.79 MHz, respectively. Spectra measured with Bruker instruments were recorded in the pure phase absorption mode by the time-proportional phase incrementation method (Redfield & Kunz, 1975;Marion & Wuthrich, 1983). Proton chemical shifts were measured from internal dioxane at 3.77 ppm. Nitrogen chemical shifts were referenced using external 1SNH4NO3 (5 M lSNH4N03 in 2 M HNO3) with the ammonium lSNresonance taken as 0 ppm. Typical 90' pulse widths were as follows: AM500 ['H, 8 ps; 'H (homonuclear spin lock) 15 ps; ISN (hard), 18 1 s ; lSN(GARP (Shaka et al., 1985) or WALTZ-16 (Shaka et al., 1983) decoupling), 85 ps]; AMX600 [IH, 8.5 ps; IH (homonuclear spin lock) with MLEV17 (Bax & Davies, 1985) 15 ps and DIPSI-I1 (Shaka etal., 1988) 32ps; lSN (hard), 18ps; lSN(GARPor WALTZ16 decoupling), 100 ps]. Standard pulse sequences and phase cycling were employed for the 2D homonuclear experiments.

Osborne et al. For experiments requiring homonuclear Hartman-Hahn spin-lock sequences [2D HOHAHA (Braunschweiler BE Ernst, 1983; Bax et al., 1985), and 3D lSN-'H TOCSY-HMQC (Zuiderweg & Fesik, 1989; Marion et al., 1989)] totalmixing times of 60 and 80 ms were used. NOESY (Jeener et al., 1979; Macura et al., 1981) data were collected using mixing times of 100 and 150 ms. Typical data set sizes for 2D homonuclear experiments were 512 increments of 2K complex data points. These yielded, after zero-filling, spectra with digital resolution of about 2-3 Hz per point in each dimension. Prior to Fourier transformation, either a sine-bell squared ( ~ / 3phase shift) or a Lorentzian-to-Gaussian window function was applied in both F2 and F1 dimensions. For measurements in 'H20, either a low-power phase-locked presaturation of the water signal or a combination of a soft presaturation and a 1 - i pulse was used, the latter being employed in cases where the sample concentrations were low (-1-2 mM). To implement the latter method in a 2D homonuclear HOHAHA experiment, a soft presaturation pulse was applied during the preparation period, and after the spin-lockperiod, a three-pulse sequence was used that included a flip-back 90° pulse as the read pulse (Lian et al., 1991). Compared with a normal presaturation experiment, thequality of a 2D spectrum obtained in this manner is significantly improved in the fingerprint region of the spectrum with a superior baseline. 2D lH-15N HSQC spectra were acquired by the method of Bax et al. (1990). 15N decoupling was carried out using either the GARP or WALTZ-16 sequence. Solvent suppression for the lH-lSN spectra was achieved by low-power presaturation. The lH-ISN J-coupling evolution and refocussing delays of the INEPT portions of the pulse sequences were set to 2.7 ms. Typical data set sizes were 5 12 increments of 2K or 4K complex points with spectral widths of 14 ppm in the 'H dimension and 36 ppm in the lSN dimension. The 'H carrier was placed on the water signal, and the I5N carrier was placed on 98 ppm. All heteronuclear 3D experiments were performed on the AMX600 spectrometer at 298K on a 4 mM sample at pH 6.2. The heteronuclear 3D TOCSY-HMQC, NOESY-HMQC (Zuiderweg & Fesik, 1989; Marion et al., 1989), and HMQCNOESY-HMQC (Frenkiel et al., 1990) pulse schemes used were similar to those reported. Typical data sizes for the 3D TOCSY-HMQC and 3D NOESY-HMQC experiments were 1024(t3) by 70(t2) by 200(tl). The data set size for the HMQC-NOESY-HMQC experiment was 1024 by 64 by 64. For all experiments, 16 scans per tl increment were acquired. The NOESY mixing time for the 3D experiments was 150 ms. Measurements of the 3 J ~coupling ~ . constants were obtained using the HMQC-J experiment (Kay & Bax, 1990) with a data set size 2212 tl increments of 2048 data points; zero-filling in F1 gave a final digital resolution of 0.75 Hz per point in the Fl dimension. 2D lsN-'H HSQC spectra of uniformly lSN-labeledprotein in 2H20were used to monitor the exchange of backbone amide protons. It was possible to obtain good quality 2D spectra on a 3 mM lSN-labeled sample in 10 min (256 f l increments). The N H solvent exchange experiment commenced after the addition of 99.9% 2H20 at 273K to a sample of lSN-labeled Im9 previously lyophilized from 50 mM potassium phosphate in 'H20 at pH 6.2. The sample was inserted in the magnet at 293K and allowed to come to thermal equilibrium, when the lock stabilized. The first 2D IsN-lH HSQC experiment

Biochemistry, Vol. 33, No. 41, 1994

NMR Studies of the Colicin E9 Immunity Protein

12349

0

T44 G79 G82

G49

T27

0

8

e

Q

e S65

153 .%

T11 0

S6 059

9

G66

T38 *

Y55

e

sv19e K80

T20

e e

F40

K72

V370 K57

H46

e

€4-

026

Y10

= d B

D61

e=

w74

E41

a

D9 Q

E14

C23eE31 0 T70

e

e

ee

073

R75

A13

e-1

V71

~

136

H39

E12

N78

L52

F83

N24

Q

a

s2B

e*

S28

L16

0

e-

M43

e

9

A77

0

K35+A 76

133

Bp

060

e W 4 017

118

e

4, 0

E30 17

Q

Q

DS I I

9.0

I

7.5

I

7.0

FIGURE 1: Selected region of the lSN-lH HMQC-J spectrum of uniformly I5N-labeled Im9 recorded at pH 6.2 and 20 OC in a 3 mM 90% H20/10%D20 solution. Assignments for backbone amide cross peaks are indicated.

was started 7 min after dissolution in 2H20. A total of six spectra were recorded at the following times, measured from the addition of *H20 to the lyophilized protein up to the midpoint of the acquisition time of each data set; 12, 24, 34 (all with two scans per t l increment), 50 (4 scans per tl increment), and 80 min (8 scans per t l increment). 2D spectra were processed on either the Bruker X32 data station using Bruker UXNMR software or on a Silicon Graphics Indigo R4000 workstation using FELIX (Hare research and BIOSYM). The 3D data sets were all processed on the Silicon Graphics Indigo R4000 workstation using FELIX. Typical processed 3D data sizes were 512(F3) by 256(F2) by 512(F1). When it was necessary to improve the digital resolution a 1024(t3) X 512(t2) X 256(t1) matrix was built and Fourier transformed in t 3 and 22. t l fids were then loaded separately and zero-filled to 16K complex points prior to Fourier transformation, thus resulting in a series of 1D spectra. For all spectra, a sine-bell squared ( ~ / shift) 3 window function was applied.

RESULTS Stability of Im9. Im9 was found to be appreciably soluble only above pH 6.0. At lower pH, solutions became turbid as the protein precipitated. At pH values of 6-7, samples of Im9 degraded if held for some hours at temperatures of 40 OC and above, and therefore most of the N M R data were collected at temperatures of 20-37 OC. A complicating factor for assignment was that, at pH values of 7.0, a number of amide

proton resonances were not observed due to exchange with the bulk solvent. Accordingly, the majority of 3D experiments were performed at lower pH values (6.2) and temperatures (20 "C), so as to maximize the number of N H signals. N M R Spectroscopy. Conventional 2D homonuclear IH N M R methods for identifying spin systems from scalar connectivities (COSY and TOCSY) and from dipolar connectivities (NOESY) produced limited information for the following reasons. First, there was severe overlap of peptide N H resonances. For example, 28 amide proton resonances have chemical shifts in the range 8.5-9.0 ppm of the ' H NMR spectrum. Second, 30 of the 86 amino acids of Im9 were expected to exhibit AMX spin systems for their a/@protons. These features, together with the presence of three prolines, considerably handicapped the identification of spin systems by the conventional sequential assignment procedure (Wiithrich, 1986). These problems were overcome by the use of HMQC experiments on uniformly enriched I5N samples. Figure 1 shows an expanded region of the 2D lH-l5N HMQC-J spectrum of such a sample. A total of 78 of the expected 82 backbone amide connectivities are observed; a further two cross peaks occur at lower 'H frequencies. Table 1 summarizes the assignments. Sequential assignments are readily obtained by heteronuclear NMR, making use of the 3D IH-l5N HOHAHAHMQC experiment to identify through-bond connectivities between peptide N H and side-chain protons and the 3D 'H15NNOESY-HMQC experiment to identify through-space

Osborne et al.

12350 Biochemistry, Vol. 33, No. 41, 1994 Table 1: Assignments of Immunity Protein of Colicin E9 (Im9p residue Leu-3 Lys-4 His-5 Ser-6 Ile-7 Ser-8 ASP-9 Tyr- 10 Thr-11 Glu-12 Ala- 13 Glu- 14 Phe- 15 Leu- 16 Gln- 17 Leu- 18 Val- 19 Thr-20 Thr-20 Thr-21 Ile-22 cys-23 Asn-24 Ala-25 Asp26 Thr-27 Ser-28 Ser-29 Glu-30 GIu-3 1 Glu-32 Leu-33 Val-34 Lys-35 Leu-36 Val-37 Thr-38 His-39 Phe-40 Glu-41 Glu-42 Met-43 Thr-44 Glu-45 His-46 Ser-48 Gly-49 Ser-50 Asp-5 1 Leu-52 Ile-53 Tyr-54 Tyr-55 Lys-57 Glu-58 Gly-59 Asp60 ASP-61 Ser-65 Gly-66 Ile-67 Val-68 Asn-69 Thr-70 Val-7 1 Lys-72 Gln-73 Trp-74 Arg-75 Ala-76 Ala-77 Asn-78 Gly-79 LYS-80 Ser-81 Gly-82 Phe-83 Lvs-84

I5N 103.92 101.11 94.76 92.89 106.18 92.89 97.51 99.74 91.20 103.41 98.20 98.77 103.05 99.10 101.71 102.68 97.09 97.47 97.47 98.57 98.87 98.75 94.66 103.92 98.83 87.77 95.12 92.07 104.22 98.75 98.77 99.93 97.56 99.71 100.02 101.13 96.51 100.67 99.52 100.55 99.45 93.58 84.63 90.42 98.85 97.31 87.80 99.55 105.88 93.93 90.48 93.29 96.89 101.95 102.85 93.85 101.00 100.43 89.02 94.08 104.74 98.40 97.96 99.59 101.16 100.03 97.35 101.10 98.17 99.66 99.48 93.02 86.98 97.28 97.06 88.32 95.30 103.1 1

NH 8.52 7.92 8.87 8.03 9.37 7.57 7.94 8.30 8.97 9.34 8.43 8.03 8.79 9.07 7.94 7.52 8.22 8.73 8.73 8.17 7.74 8.86 8.14 8.15 8.42 7.79 8.95 7.63 9.32 8.83 7.93 7.48 8.30 7.75 8.19 8.87 8.70 8.27 9.10 8.74 8.51 8.84 7.87 7.34 7.48 11.07 8.17 10.07 8.77 7.39 6.88 7.36 8.79 8.86 8.70 8.96 8.03 8.69 8.31 8.10 8.82 8.38 8.50 8.87 8.61 9.27 8.79 8.55 9.07 7.76 8.03 7.12 7.64 8.09 8.74 8.40 7.88 8.65

(YH 4.55 4.35 4.74 4.75 4.12 4.74 4.79 5.01 5.08 2.45 4.13 4.16 3.96 3.97 4.1 1 4.07 3.17 4.35 4.35 3.89 3.46 3.83 4.67 3.92 4.93 4.6 1 4.49 4.81 4.14 4.08 4.10 4.12 3.70 4.14 4.21 3.59 4.22 4.62 3.92 3.67 4.05 4.52 3.56 3.90 3.01 4.59 4.15, 3.89 4.13 4.62 3.89 3.54 4.22 5.02 4.31 4.10 4.33,3.75 4.7 1 4.7 1 4.05 3.90 3.90 3.66 4.63 4.07 3.66 3.97 4.25 4.26 3.56 4.19 4.21 4.57 3.88 4.58

PH 1.55,0.90 1.20,0.63 3.43 3.95, 3.72 1.65 3.95 2.83 3.59 4.81 1.86 1.46 2.79 2.91, 2.74 2.08 2.30 1.81, 1.54 1.93 3.56 3.56 4.50 1.76 2.70 3.00,2.74 1.52 2.93, 2.46 4.48 4.03 4.30,4.06 2.15 2.01, 2.16 2.16,2.25 2.20 2.20 1.68 2.16, 1.44 2.39 4.36 3.70 3.84

YH 1.56 1.02

GH 1.96, 1.30

0.90

others 6CH3 0.63.0.55 2.76

t

yCH3 0.49,6CH3 -0.13

1.30 2.05 2.56,2.40 1.65, 1.47 2.50 1.43

6CH3 0.93,0.83 GCH3 1.04,0.68 yCH3 0.88,0.77 yCH3 0.88,0.77 7CH3 0.88,0.77

1.22 1.22 1.13 1.85

yCH3 0.72 GCH3 0.56

0.97 2.41 2.40 2.40 1.75 1.75, 1.43 2.05

2.02

GCH3 1.04 yCHi 1.13, 1.02 c 2.96 6CHi 0.87 yCH; 1.22, 1.18

1.39 3.06,2.71, 2.15

2.11, 2.29

2.68 2.10, 1.47, e 2.26

2.50 2.69,2.18 3.94, 3.86 3.25, 2.88 1.65, 1.37 1.09 2.65, 2.57 3.19,2.59 1.74, 1.55

2.28.2.21

1.52 0.87,0.35

6CH3 0.50,0.35 6CH3 -0.15 yCH, -0.08 1.81

e 3.04 2.34,2.06

2.90,2.64 2.97,2.72 3.90 1.88 2.41 3.09, 3.01 4.55 2.55 1.91

1.73

yCH3 0.86,6CH3,0.45 yCH3 1.30, 1.26

1.23 1.78

2.16

yCH3 1.14,0.77 c 3.13 2.73,2.59, 2.32

1.1 1

1.28, 1.99

e

3.68, 3.56 1.40 1.46 2.34, 1.33 1.66, 1.45 3.98

3.17, 2.87

4.30, 3.59 2.71 4.70 1.70 4.25 1.48 1.93 c 3.00 Chemical shifts are in ppm referenced to external 15NH4N03at 0 ppm 15N and to internal dioxane at 3.74 ppm for IH. Uncertainties are f 0 . 1 ppm, I5N, and *0.02 ppm, IH. Values are quoted for Im9 at 20 OC and pH 6.2.

Biochemistry, Vol. 33, No. 41, 1994

N M R Studies of the Colicin E9 Immunity Protein

e

0" 0

0

e

l

Q 0 K72

I

9.0

I

8.5

I

8.0

I

7.5

I

I

7.0

D1 (PPmJ

FIGURE2: NH (D1axis) aliphatic (D2 axis) region of the TOCSY spectrum (80-ms mixing time) in H20 at pH 7.0 and 37 O C . Spin systems of various valine, leucine, isoleucine, and lysine residues are shown.

connectivities (Zuiderweg & Fesik, 1989; Marion et al., 1989). The 3D experiments overcome problems of overlap by spreading signals according to the frequency of the directly bound 15N chemical shift. However, the HMQC-NOESY experiment does not afford sequential amide proton NOE connectivities if the proton chemical shifts are degenerate. In such cases, the HMQC-NOESY-HMQC experiment was used to alleviate this degeneracy (Frenkiel et al., 1990). Identification of Spin Systems. Identification of spin systems was initiated by analysis of the 600-MHz 2D 'H-lH TOCSY spectrum of Im9 in 90% lH20/10% 2H20 (Figure 2). DQF-COSY spectra of Im9 in 90% 'H20/10% 2H20and in 99.9% 2H20 were used to distinguish between short-range coupled peaks (2-bond and 3-bond) and long-range coupled peaks in the TOCSY spectra. For some spin systems, this analysis was sufficient to identify them fully. For example, the five valine spin systems were readily identified in the TOCSY spectrum (Figure 2). However, for the longer chain aliphatic residues this approach was insufficient. Only one leucine spin system (Leu3) could be easily identified from the 2D TOCSY spectrum (Figure 2). For the remaining leucine residues, TOCSY experiments performed with mixing times up to 80 ms failed to show complete coupling patterns for these residues from the amide resonance. It was therefore necessary to bridge connectivities using the DQF-COSY spectrum, in particular making use of $H and 6CH3 correlations. Using these complementary spectra, it was possible to identify five of the six leucine spin systems and two of the four isoleucine spin systems. Identification of all the glycine, alanine, and threonine spin systems was also not straight-forward using 2D TOCSY and

12351

DQF-COSY spectra. This was partly because of resonance overlap as described previously, but mainly it resulted from complications arising from the degeneracy of resonances. These complications were partially resolved by the use of a 3D 'H15N HMQC-TOCSY spectrum with uniformly l5N-labeled Im9. Four of the eight threonine spin systems were readily identified from this spectrum, and assignment of the a-and @-protonswas afforded by the DQF-COSY spectrum. Six alanine-like spin systems were observed in the 3D spectrum. These arise from the four alanine residues and two of the remaining three threonine residues. There are eight amino acid residues which give rise to AMX spin systems corresponding to connectivities between the a-proton and two @-protons. These are serine, cysteine, aspartate, asparagine, phenylalanine, tyrosine, histidine, and tryptophan. When the @-protonsare degenerate, the amino acid appears as an AX2 spin system. For Im9,30 amino acids were expected to show this spin pattern, and a total of 42 AMX or AX2 spin systems were identified, of which two were assigned to asparagine residues based on NOE connectivities between the side-chain amide and @-protons. The excess number of this type of spin system is a consequence of amino acid residues whose coupling patterns do not behave in a straight-forward manner. These include glycine, glutamate, and glutamine residues. Scalar coupling experiments were used to assign the aromatic ring protons of the three histidine and sole tryptophan residues. NOESY experiments in 2Hz0were used to connect the ring and side-chain spin systems. Assignment of the remaining tyrosine and phenylalanine residues was complicated by the absence of the characteristic coupling patterns of the phenylalanine residues, which gave rise to tyrosine-like spin systems. It was not possible to use the one-dimensional spectra to alleviate this problem due to substantial overlap in the regions of interest. Accordingly, assignment of these residues was postponed to the sequential assignment analysis. Sequential Assignments. Many of the glutamate, lysine, arginine, asparagine, glycine, and aromatic spin systems were identified from the 3D spectra in combination with the 2D TOCSY and DQF-COSY spectra. However, only 39 spin systems were assigned to particular types of amino acids without sequence-dependent NMR data. The sequence-specificassignment of the Im9 spectrum relied heavily on the observation Of d"(i,l+l), daN(i,i+l),and d@N(i,i+l) NOE connectivities in the 3D *H-lSN HMQC-NOESY spectrum of l5N-labeledIm9. This stage of the analysis was made difficult by the large number of AMX and AX2 spin systems not assigned to amino acid type. However, sufficient unique sequences with identified spin systems were present to provide useful starting points for the analysis. Additionally, the sequential assignment procedure afforded identification of the previously unassigned aromatic residues. A large sequential stretch of da~(i,i+l), d"(i,i+l), d@N(i,i+l) connectivities were observed incorporating residues 30-46 (Figure 3). Assignment of this stretch was initiated by observation of NOES between the characteristic spin systems of leucine and valine. This dipeptide fragment occurs three times in the entire polypeptide sequence (Leul8-Vall9, Leu33-Va134, and Leu36-Va137). Assignment to Leu33Val34 was afforded by identification of NOE connectivities between the leucine and a glutamate/glutamine spin system (thus eliminating the Leu36-Val37 pair) and then connectivities from this spin system to a further glutamate-glutamine residue (thereby eliminating the Leu18-Val19 pair). Assignment of Lys35, whose amide proton and l5N chemical

Osborne et al.

12352 Biochemistry, Vol. 33, No. 41. 1994

€30

K35 0

@ CD Q

Q 6

-

44 0.31

QI

35,36

*

Q

30

04.2

w.7

4.6

FIGURE3: IHJH strips taken from the 3D l H 3 N NOESY-HMQC spectrum of uniformly lSN-labeledIm9 at pH 6.2 and 20 OC in H20. Slices were taken at 15N chemical shifts corresponding to the backbone amide resonances of residues E30-H46. Diagonal and sequential amide proton connectivities are shown.

shifts are degenerate with the Ala76 spin system was afforded by the identification of sequential NOEs to this residue from Va134. Throughout this polypeptide region, all but four of the residues exhibited their characteristic spin systems as identified from COSY and TOCSY spectra. Sequential assignments for the sequence of 20 residues, running from Ser65 to Lys84, was obtained from d"(i,i+l) connectivities starting from the unique tryptophan residue at position 74. Confirmation of the sequential assignments was obtained from the observation that nearly all the residues display their characteristic spin systems in J-correlation spectra. The spin systems of three residues (Lys80, Arg75, and Ile67) did not exhibit complete connectivities in the TOCSY and COSY spectra and were assigned based on sequential NOE connectivities to residues on either side of them, which afforded a unique tripeptide fragment in each case. The identity of Am78 was confirmed by observation of NOE connectivities between its side-chain amide and @-protons. The first stage of assignment had tentatively identified three glycine spin systems which show only one NH-CaH connectivity. Two of these spin systems are assigned to Gly66 and Gly79 based on the above sequential assignments. Sequential NOEs observed between the backbone atoms of isoleucine and threonine spin systems afforded assignment of the unique dipeptide Thr2 1-Ile22 and initiated further sequential assignments. Assignment of residues 12-25 was obtained on the basis of strong sequential d"(i,i+l) connectivities and theobservation ofa number OfdaN(i,i+l)anddaN(i,i+l) NOEs within this segment. The presence of a strong da~(i,i+l) NOE between a threonine spin system and the previously assigned Glu12 initiated

sequential assignment in the region Leu3 to Thrl 1. A further strong da~(i,i+l) connectivity between Thrl 1 and an AX2 spin system yielded assignment of Tyr 10-the presence of NOEs between the aromatic protons of Tyr 10 and the side-chain protons of Thrl 1 is further confirmation of this assignment. From Tyr 10 to Ile7, sequential assignments were based initially on the presence of strong d"(j,i+l) NOEs and substantiated by the observation of further sequential NOEs. Between residues Ile7 and Leu3, assignment was afforded by the observation of one or more sequential NOE connectivities. Throughout this region, residues show characteristic spin systems apart from Ile7 for which no connectivities from the a proton could be observed in COSY and TOCSY spectra. The segment of amino acids flanked by two proline residues stretching from 48 to 56 was assigned after the identification of sequential NOEs between an isoleucine and leucine spin system, which forms the unique dipeptide fragment Leu52Ile53. Sequential assignment from Leu52 to Ser48 was relatively straight-forward, with only Ser5O exhibiting an unusual spin system (it was not possible to detect connectivities from the a-to @-protons)in the TOCSY and COSY spectra. Due to chemical shift degeneracy of the amide protons of Asp51 with Tyr55 and Leu52 with Tyr54, unambiguous identification of sequential d"(i,i+l) connectivities for residues 52-55 required the 3D HMQC-NOESY-HMQCexperiment. Residues 57-61 were assigned based on the observation of strong sequential da~(i,i+l) connectivities and weak d"(i,i+l) NOEs. Assignments were initiated by observation of NOEs to and from the remaining lysine spin system that is otherwise unassigned. No NOE connectivities could be observed past Asp6 1.

Biochemistry, Vol. 33, No. 41, 1994

N M R Studies of the Colicin E9 Immunity Protein

12353

identity, location, and extent of regular secondary structure elements to be obtained by inspection of the a-proton chemical mWm.8 . . 1 W H O 00.0 0 0 0 0m.00 0. 'JNH~ shifts. Sequential a-protons with chemical shifts more than CSI 0.10 ppm lower than the random coil value are designated as dNN(i.i+l) daN(iii+l) being part of an a-helix, whereas sequential a-proton chemical dbN(i,i+l) dNN(iii+Z) shifts more than 0.1 ppm greater than the random coil value daN(ii+Z) are designated as @sheet. a-protons exhibiting shifts within daN(iii+3) daN(iiii-9 0.1 ppm of their random coil values are designated as coil. The chemical shift index for Im9 is given in Figure 4 and essentially concurs with NOE, coupling constant, and amide exchange data that shows the structure to be comprised of three helices spanning residues Glul2-Ala25, Glu30-His46, NH e x c h a n g e 'JNHa and Ser65-Ala76. CSI A major problem with the secondary structure analysis was dNN(i,i+l) the relatively poor dispersion of a-proton resonances which daN(i.i+l) dbN(i,i+l) prevented a thorough interpretation of the helical regions. In dNN(iii+Z) W(i.i+Z) particular, it was difficult to unambiguously assign a sufficient daN(i.i+3) number of connectivities which differentiate a-helices and 3lo-helices. We have therefore assigned these regions as helices only. 60 65 70 75 80 85 G D D D S P S G I V N T V K Q W R A A N G K S G F K Q G The sequential NOE connectivities for residues 12-25 00000 0 ~ 0 0 0 0 0 0 0 0 0 0 0 b000000 NHexchange indicate that this segment is helical. Despite the degeneracy m o a W H m m # W H D 8 . m 8 8 0 m O m I m W 'JNHa ! O O + o - - - - - - - - - - - 0 + 0 ! 0 0 CSI of a number of a-proton resonances for this segment, five da~(i,i+3)and daN(i,i+4) NOE connectivities were observed, consistent with a helix. The continuous stretches of strong d"(i,i+I) NOE connectivities for residues 30-46 and for residues 65-76 indicate that these regions are helical, consistent with the relatively small 3JNHa coupling constants and slowly H e l i x III I Tum I exchanging amide protons for residues toward the center of the two helical regions. The largenumber of daN(i,i+l),dm(i.i+l), FIGURE 4: Summary of sequential and medium-range NOE connectivities observed in the Im9 protein. The intensities of NOESare and da~(i,i+3) connectivities are consistent with this. categorized as strong, medium, and weak and are represented by the The conformation of other regions of the protein is not so thickness of the lines. Filled circles below the amino acid sequence clear. The strong sequential interresidue d a connectivities ~ (single letter code) represent slowly exchanging amide protons that plus the large 3 J ~coupling were observed in the 2D 15N-lH HSQC spectrum recorded 80 min ~ a constant value for Thrl 1 and after dissolvingthe protein in D20. Filled semicirclesand open circles TyrlO indicate that these residues have an extended conforindicate intermediateand fast exchangingamide protons, respectively. mation. For residues 3-8, the presence of large 3 J ~ ~ a c ~ u p l i n g Asterisks represent sequentialconnectivitiesthat could not be observed constants also suggests an extended conformation, and the due to the degeneracy of a-proton chemical shifts. Filled and open fast rates of amide exchange is consistent with these residues boxes indicate respectively 'JNH,,coupling constants of less than 6 Hz or larger than 8 Hz. CSI refers to the chemical shift index (see text); being removed from the protein interior. Slow exchange for (+) - and (0) refer to aH chemical shifts which are greater than, residues Asp9 to T h r l l suggests that regular structure less than, and within 0.10 ppm of the random coil value, respectively; commences at this point of the sequence and is substantiated (!) indicates Gly residues whose aH resonances are not degenerate by the observation of an NOE connectivity between Thrl 1 and that exhibited shifts both greater and less than its random coil and Lys84. Residues 26-29 probably form a loop between value. the 12-25 and 30-46 helices, consistent with the relatively Assignment of residues 26-29 was initiated by the presence large coupling constants and variable strength d"(i,i+l) NOES. of a strong daN(i,i+l) connectivity between an AMX spin system N O E data for residues 48-55 suggest that a small helix may and the previously assigned Glu30 spin system. With the be present in this region, although coupling constant and amide exception of Thr27, which exhibits an alanine-like spin system, exchange data (Figure 4) show no obvious patterns. Nevcharacteristic spin systems were shown by these residues. The ertheless, the presence of slowly exchanging amide protons complete set of assignments obtained in this work are given for residues Asp5 1-Tyr55 indicates that these residues may in Table 1. be buried within the protein surface and/or are involved in Identification of Secondary Structure. A description of hydrogen bonding networks. the secondary structure of Im9 together with a summary of There is evidence that residues Ala76-Gly79, which connect the N M R parameters identifying the secondary structure is the third helix and C-terminal segments, form a type I turn. given in Figure 4. Numerous studies (Wiithrich, 1986; This is characterized by the presence of a daN(i,i+*) connectivity Grzesiek et al., 1992; Ikura et al., 1991; Omichinski et al., between residues Ala77 and Gly79, a daN(i,i+3) NOE between 1992;Stockmanet al., 1992; Weiretal., 1993) haveestablished Ala77 and Lys80, and a large 3 J ~coupling ~ a constant that a qualitative interpretation of NOEs, 3 J ~coupling ~ a exhibited by Asn78. constants, and hydrogen exchange rates accurately define the Residues 80-84 show weak d"(i,i+l) connectivities, but location of regular secondary structure elements. We derived strong sequential d a N connectivities, which is characteristic NOEdata from homo- and heteronuclear 2D and 3D NOESY a an extended conformation. The interresidue d a N NOES spectra with mixing times of 150 ms. Approximate 3 J ~ ~ of are stronger than intraresidue d a NOEs, ~ confirming the values were extracted from HMQC-J experiments. Hydrogen extended conformation. Additionally, residues Lys80 and exchange rates were obtained as described in the methods Gly82 exhibit large 3 J ~coupling section. Additionally, Wishart et al. (1992) have devised a ~ a constants (>8 Hz). method known as the chemical shift index that allows the Extended backbone conformation is often associated with a 1

5

10 -.

15 -.

20 -.

25 ~~

M E L K H S I S D Y T E A E F L Q L V T T I C N A D T S S

NH e x c h a n g e

00000000...00.......ooooooooo

12354 Biochemistry, Vol. 33, No. 41, 1994

Osborne et al.

ASN 34

VAL34 THR38

Q

'

VAL 37

ARC 38 LYSIS

'AL 37

ASP 33

LEU 33 PHE 40

HIS39

PHE 40

CLU 31

LEU 36

ASP 34

ASP 34

CLN 33

LEU 33

HIS 39

PHE 40 ASN 36

'

CYS31

PHE40 LEU 36

FIGURE5: Helical wheel representations of residues 32-40 for all

the known nuclease-specific immunity proteins of the E-group colicins: Im9, Im2, Im8,and Im7. Numbering scheme is according to alignment with the Im9 sequence. @-sheet,characterized by typical interstrand NOES, but no such NOES could be observed involving these residues. The amide exchange rate for residues 80-84 might be expected to be fast considering their position toward the end of the C-terminus; however, both amides of Lys80 and Lys84 exhibit relatively slow rates of exchange, indicating they may be involved in hydrogen bonding. DISCUSSION

We have obtained an almost complete set of 'H N M R and peptide 15N N M R assignments for the colicin E9 immunity protein (Table 1) and determined the location of regular secondary structure elements (Figure 4). Overall, Im9 contains three helical segments, which comprise 50% of its structure. This is very different from the structure of the 84 amino acid immunity protein to the RNase colicin E3 (Im3) proposed by Yajima et al. (1992,1993) based on 'H and "N N M R data. The lack of apparent structural similarity is consistent with amino acid sequence comparisons which show no homology between Im3 and Im9 or the cytotoxic domains of colicins E3 and E9,even though other domains of the colicins are highly homologous. The specificity determining region of the DNase immunity proteins, Im8 and Im9, have been identified by gene fusion, mutagenesis, and chemical modification experiments to be located in the region 16-43 (Wallis et al., 1992a). Residues 32-40, which form part of a helical structure in Im9, appear to be particularly important. We assume that all four DNasespecific immunity proteins (Im2, Im7, Im8, and Im9) have similar structures based on the sequence alignment reported by James et al. (1987) and Chak et al. (1991), which shows them to be -60% homologous. Helical representations of residues 32-40 (Figure 5) are thus revealing. Such helices are amphipathic with a strongly polar side. All but Im2 have an extensive nonpolar surface, and depending upon the exact position of the Asp33 side chain of Im2, its helix may also behave with strong characteristic amphipathic properties. These include interactions of the nonpolar surface of the helix with the interior core of the protein and the exposure of the polar part of the helix to the solvent. This implies that the specificity determining groups include residues 34, 35, and 38, which is consistent with mutagenesis studies of Im9. In these Val34 was replaced by Asp34, with the result that some

-

immunity to colicin E8 was produced in the variant which also retained its colicin E9 immunity. Determination of the 3D structure of Im9 and other DNase immunity proteins together with N M R studies of the colicin-immunity protein complexes will allow the model for specificity determination outlined above to be explored. We note with interest that the recently reported X-ray structure of the complex formed by the bacterial ribonuclease barnase and its 89 amino acid inhibitor barstar shows that an a-helix of the inhibitor plays an important part in the proteinprotein interaction (Guillet et al., 1993), a situation similar to the one we propose for the colicin E9 system. In addition, the predominant secondary structure in both barstar and Im9 is the a-helix, although barstar also contains short segments of the@-sheet. This observation is even more intriguing when it is put in the context of another RNase inhibitor, the immunity protein of colicin E3 (Im3) which is predominantly @-sheet (Yajima et al., 1992, 1993). Thus it seems that, at least in terms of secondary structure elements, barstar is more similar to Im9, a DNase inhibitor, than it is to Im3, another RNase inhibitor. REFERENCES

Akutsu, A,, Masaki, H., & Ohta, T. (1989) J . Bacteriol. 171, 6430-6436. Bax, A., & Davies, D. G. (1985) J . Magn. Reson. 65, 355-360. Bax, A,, Davies, D. G., & Sarkar, S. K. (1985) J . Magn. Reson. 63, 230-234. Bax, A., Ikura, M., Kay, L. E., Torchia, D. A., & Tschudin, R. (1990) J . Magn. Reson. 86, 304-318. Bodenhausen, G., & Reuben, D. J. (1980) Chem. Phys. Lett. 69, 185-189. Bowman, C. M., Dahlaberg, J. E., Ikemura, T., Konisky, J., & Nomura, M. (1971) Proc. Natl. Acad. Sci. U.S.A. 68, 964968. Braunschweiler, L., & Emst, R. R. (1983) J . Magn. Reson. 53, 521-528. Chak,K.-F.,Kuo, W.-S., Lu, F.-M., & James,R. (1991) J . Gen. Microbiol. 137, 91-100. Cramer, W. A,, Dankert, J. R., & Uratani, Y . (1983) Biochim. Biophys. Acta 737, 173-179. Curtis, M. D., & James, R. (1991) Mol. Microbiol. 5, 27272733. DiMasi, D. R., White, D. C., Schnaitman, C. A., & Bradbeer, C. (1973) J . Bacteriol. 115, 506-513. Eaton, T., & James, R. (1989) Nucleic Acids Res. 17, 1761. Frenkiel, T., Bauer, C., Carr, M. D., Birdsall, B., & Feeney, J. (1990) J . Magn. Reson. 90, 420-425. Grzesiek, S . , Dobeli, H., Gentz, R., Garotta, G., Labhart, A. M., & Bax, A. (1992) Biochemistry 31, 8180-8190. Guillet, V., Lapthorn, A., Hartley, R. W., & Maugen, Y. (1993) Structure 1 , 165-176. Ikura, M., Spera, S., Barbato, G., Kay, L. E., Krinks, M., & Bax, A. (1991) Biochemistry 30, 9216-9228. James, R., Curtis, M. D., Wallis, R., Osborne, M. J., Kleanthous, C., & Moore, G. R. (1992) in Bacteriocins, microcins & lantibiotics (James, R., Lazdunski, C., & Pattus, F., Eds.) pp 181-201, NATO AS1 Series H, Springer, Heidelberg. James, R., Jarvis, M., & Barker, D. F. (1987) J . Gen. Microbiol. 133, 1553-1562. Jeener, J., Meier, B. H., Bachmann, P., & Ernst, R. R. (1979) J . Chem. Phys. 71, 4546-4553. Kay, L. E., & Bax, A. (1990) J . Magn. Reson. 86, 110-126. Kleanthous, C., Wallis, R., Reilly, A., Osborne, M. J., Lian, L.-Y., Moore, G. R., & James, R. (1994) in Bacterial Protein Toxins (Freer et al., Eds.) Zbl. Bakt. Suppl. 24, pp 230-240, Gustav Fischer, Stuttgart.

NMR Studies of the Colicin E9 Immunity Protein Lian, L. Y., Yang, J. C., Derrick, J. P., Sutcliffe, M. J., Roberts, G. C. K., Murphy, J. P., Goward, C. R., & Atkinson, T. (1991) Biochemistry 30, 5335-5340. Luria, S.E., & Suit, J. L. (1987) in Escherichia coli & Salmonella typhimurium, cellular & molecular biology (Neidhardt, F. C., Ed.) Vol. 2, pp 1615-1624, American Society of Microbiology, Washington, DC. Macura, S., Huang, Y., Suter, D., & Ernst, R. R. (1981) J. Magn. Reson. 43, 259-281. Marion, D., & Wuthrich, K. (1983) Biochem. Biophys. Res. Commun. 113, 967-974. Marion, D., Driscoll, P. C., Kay, L. E., Wingfield, P. T., Bax, A., Gronenborn, A. M., & Clore, G. M. (1989) Biochemistry 28, 6150-6156. Omichinski, J. G., Clore, G. M., Robien, M., Sakaguchi, K., Appella, E., & Gronenborn, A. M. (1992) Biochemistry 31, 3907-391 7. Pattus, F., Massotte, H. U., Wilmsen, J., Lakey, D., Tsernoglou, A., Tucker, A., & Parker, M. W. (1990) Experientia 46,180192. Redfield, A. G., & Kunz, S. D. (1975) J. Magn. Reson. 19, 250-254. Schaller, K., & Nomura, M. (1976) Proc. Natl. Acad. Sci. U.S.A. 73, 3989-3993. Senior, B. W., & Holland, I. B. (1971) Proc. Natl. Acad. Sci. U.S.A. 68, 959-963. Shaka, A. J., Keeler, J., Frenkiel, T., & Freeman, R. J. (1983) J . Magn. Reson. 52, 335-338. Shaka, A. J., Barker, 0. B., & Freeman, R. J. (1985) J. Magn. Reson. 64, 547-552.

Biochemistry, Vol. 33, No. 41, 1994

12355

Shaka, A. J., Lee, C. J., & Pines, A. (1988) J. Magn. Reson. 77, 274-293. Stockman, B. J.,Scahill, T. A., Roy, M., Ulrich, E. L., Strakalaitis, N. A,, Brunner, D. P., Yem, A. W., & Deibel, M. R., Jr. (1992) Biochemistry 31, 5237-5245. Toba, M., Masaki, H., & Ohta, T. (1988) J. Bucteriol. 170, 3237-3242. Wallis, R., Moore, G. R., Kleanthous, C., & James, R. (1992a) Eur. J. Biochem. 210, 923-930. Wallis, R., Reilly, A., Rowe, A., Moore, G. R., James, R., & Kleanthous, C. (1992b) Eur. J. Biochem. 207, 687-695. Wallis, R., Reilly, A., Barnes, K., Abell, C., Campbell, D. G., Moore, G. R.,James, R., & Kleanthous, C. (1994) Eur. J. Biochem. 220, 447-454. Weir, H. M., Kraulis, P. J., Hill, C. S., Raine, A. R. C., Laue, E. D., & Thomas, J. 0. (1993) EMBO 12, 1311-1319. Wishart, D. S., Sykes, B. D., & Richards, F. M. (1992) Biochemistry 31, 1647-1651. Wuthrich, K. (1986) in NMR of Proteins and Nucleic Acids, Wiley, New York. Yajima, S., Muto, Y., Yokoyama, S.,Masaki, H., & Uozumi, T. (1992) Biochemistry 32, 5578-5586. Yajima, S., Muto, Y., Morikawa, S., Nakamura, H., Yokoyama, S.,Masaki, H., & Uozumi, T. (1993) FEBS Lett. 333, 257260. Zuiderweg, E. R. P., & Fesik, S.(1989) Biochemistry 28,23872391.