Ion Mobility Mass Spectrometry Coupled with Rapid Protein Threading

Ion Mobility Mass Spectrometry Coupled with Rapid Protein Threading Predictor Structure Prediction and Collision-Induced Dissociation for Probing Chem...
0 downloads 0 Views 2MB Size
Article pubs.acs.org/ac

Ion Mobility Mass Spectrometry Coupled with Rapid Protein Threading Predictor Structure Prediction and Collision-Induced Dissociation for Probing Chemokine Conformation and Stability Milady R. Niñonuevo and Julie A. Leary* Department of Molecular and Cellular Biology, University of California, Davis, California 95616, United States S Supporting Information *

ABSTRACT: Unique to ion mobility mass spectrometry (IMMS) is the ability to provide collision cross section (CCS) data and the capacity to delineate any dissociation and/or unfolding of protein complexes. The strong correlation of the experimentally determined CCS with theory is indicative of the retention of native structure in the gas phase, which in turn, qualifies as a means in evaluating the IM-MS data. The assessment of IM-MS data, however, is currently impeded due to the lack of appropriate structural coordinates to use as input in the in silico calculation of theory. To address this issue, this study involves the use of rapid protein threading predictor (RAPTOR) to generate tertiary structures of closely related monomeric chemokines (MCP-1, MCP3, MCP-4, and eotaxin) and, subsequently, utilize these models to estimate the theoretical values. Experimental CCS of both the model proteins and chemokines correlate well with theory generated by RAPTOR. All conformations for z = 5+ of chemokines fall within theoretical limits. Of the four chemokines, MCP-4 with z = 6+ appears to adopt an extended conformation, while eotaxin gradually unfolds, and the extended structures of MCP-1 and MCP-3 increase in abundance upon activation. Combining RAPTOR with IM-MS and collision-induced dissociation (CID) enables us to interrogate the conformations of homologous proteins with very similar tertiary structures.

M

the theoretical calculation, which are mainly dependent on either NMR or X-ray crystallography methods. Although there has been a considerable increase in the protein structures deposited in the database (∼77 000 to date), a significant number of structures have yet to be unambiguously elucidated. Limitations of the current techniques include (1) the need for a substantial amount of highly purified sample for NMR, (2) financial cost, (3) time requirement (months to years), and (4) difficulty in obtaining good quality crystals for X-ray crystallography. Oftentimes, PDB files from the database are composed of incomplete coordinates, and in some cases, the structural data from X-ray crystallography and NMR are not in good agreement. Consequently, the assessment of IM-MS data is hampered by the lack of well-suited models for theoretical calculations. A recent review emphasizes a need for more accurate structures that can be used to determine theoretical cross sections.8 Clearly, obtaining accurate models is paramount to the improvement of proper assessment and data interpretation and, consequently, in the advancement of the applications of IM-MS for studying protein structures and dynamics. As IMMS is still gaining momentum, this laboratory and others are developing strategies to establish new methodologies that can

ass spectrometry (MS) has evolved as a useful tool in structural biology, and it can be used to analyze intact protein assemblies that, when prepared and electrosprayed properly, can retain their native structures upon transfer from solution into the gas phase.1−5 Stoichiometry, topology, dynamics, stability, and composition can therefore be determined using these novel “native” mass spectrometry techniques. Over the past several years, MS has been demonstrated to complement, or even surpass at times, some of the more traditional biophysical methods, such as cryoelectron microscopy, X-ray crystallography, and nuclear magnetic resonance (NMR). Recently, MS has been coupled to ion mobility permitting simultaneous examination of a protein’s mass, conformation, and composition. Separation of mass, charge, and shape in ion mobility mass spectrometry (IM-MS) using a stacked ring ion guide cell geometry is based on the ion’s capacity to traverse a gas-filled cell under the influence of repeating pulsed voltages.6 Traveling wave ion mobility mass spectrometry (TWIM-MS) has the ability to provide collision cross section (CCS) data, accomplished by calibrating the T-wave drift times using protein calibrants with known CCS.7 Correlation between the experimental and theoretical CCS is an indication that the folded structure of a protein is retained in the gas phase and is, therefore, critical in the data evaluation. As a general practice, the atomic coordinates in a database (RCSB Protein Databank (PDB)) are widely used as input in © 2012 American Chemical Society

Received: December 2, 2011 Accepted: February 20, 2012 Published: February 21, 2012 3208

dx.doi.org/10.1021/ac2030249 | Anal. Chem. 2012, 84, 3208−3214

Analytical Chemistry

Article

(Thermo Fisher Scientific, Wilmington, DE, U.S.A.). Nativelike solutions (8−15 pmol/μL) were kept on ice until analysis. Nano-ESI-TWIM/MS Analysis and Collisional Activation. All nano-ESI-TWIM/MS analyses were performed using a T-wave ion mobility mass spectrometry system, Synapt G2 HDMS (Waters Corp., Manchester, U.K.).21−23 The system was equipped with a nanolockspray source, a borosilicate glass capillary sprayer, a T-wave ion guide, a quadrupole, and a tri Twave region (trap, IM, and transfer). The tri T-wave region was embedded between the quadrupole and the orthogonal acceleration reflectron time-of-flight (oa-TOF) mass analyzer. For each analysis, a 3−6 μL aliquot of the protein solution was loaded onto custom-made borosilicate glass nanotips (1.0 mm o.d., 0.78 mm i.d.), pulled, and gold-coated as described previously.24 Nano-ESI was accomplished by application of 0.6−0.8 kV capillary voltages via a conductive elastomer. The acquisition parameters were adjusted to maintain the native state of ions without compromising the transmission efficiency and mass accuracy: cone voltages, 5−7 V; extractor voltages, 1− 2 V; trap direct current (dc) bias, 40 V; trap collision energy (CE), 3 V; transfer CE, 0 V. IM separation was carried-out using a wave velocity of 700 m/s, a wave height of 40 V and nitrogen as the buffer gas (flow rate, 90 mL/min;, IM cell pressure, ∼3 mbar). Mass spectra were externally calibrated using 2 μg/μL cesium iodide in 50% (v/v) aqueous acetonitrile. Acquisitions were performed in the positive ion mode for 3 min from m/z 500−5000 at 1 scan/s. For collisional activation, the mass-to-charge selected precursor ions at the quadrupole region were activated within the trap cell region (argon gas flow rate, 3 mL/min) by gradually increasing the trap CE. The activated ions were then pulsed and collisionally cooled by helium prior to injection into the IM cell. Acquisition parameters were the same as those specified above.

be used as a dependable tool in structural biology. With the advances in structure prediction,9 IM-MS is becoming a promising tool in reliably generating three-dimensional (3D) protein structures. For example, homology modeling combined with IM-MS was used to produce atomic-resolution models of protein complexes.10 More recently, a method integrating computational methods, IM-MS and incomplete X-ray structures was implemented to generate models of multimeric protein complexes.11 In addition to providing CCS data, IM-MS can furnish additional information about the protein’s conformational stability and detailed structural features through collisioninduced dissociation (CID). It is now recognized that the CID mechanism of noncovalently bound assemblies generally entails the gradual unfolding of a single subunit, followed by its ejection from the complex.12−15 Proton transfer occurs concomitantly as the subunit unfolds, resulting in highly charged subunits proportional to surface area yet disproportionate to mass. Dissociation behaviors and specific noncovalent features of protein ensembles have been successfully studied using IM-MS coupled with CID.16−18 In this report, we demonstrate a strategy employing rapid protein threading predictor (RAPTOR) structure prediction to obtain structural models that strongly correlate with experimental CCS. RAPTOR is capable of providing 3D structures based on homology modeling and protein threading. 19 Homology modeling relies on sequence alignment only, whereas protein threading is based upon structure−sequence homology. It has three algorithms: no core, nonpairwise (NP) core, and integer programming (IP). RAPTOR’s approach essentially involves the following: given the amino acid sequence of a protein, it evaluates the sequence to identify conserved and probable secondary structures. It then scans its built-in PDB-based template library, aligns the query sequence against the library, and finally, ranks and identifies the best template using rigid statistical measures. Part of the scoring mechanism of RAPTOR incorporates sequence homology, secondary structure, the exposed/buried state, and the interaction between two residues in 3D space. RAPTOR structure prediction in combination with IM-MS and CID has permitted us to interrogate the conformations of closely related monomeric chemokines (MCP-1, MCP-3, MCP-4, and eotaxin). We show that RAPTOR provides accurate coordinates from which to calculate theoretical CCS, which in turn match well with experimentally measured values. Comparison with other prediction algorithms favors RAPTOR, whereas CID allowed us to assess stability of this important class of cytokines. Utilization of IM-MS, CID, and RAPTOR as an ensemble of techniques will allow investigators to make assessments on how these chemokines present themselves to glycosaminoglycans (GAG), thus enabling signaling through the corresponding receptors.



RESULTS AND DISCUSSION Collision Cross Section. Ion mobilities and collision cross sections (Ω) are obtained in drift tube mobility separators using the Mason−Schamp law,25 expressed below: 1/2 3q ⎡ 2π ⎤ 1 Ω= ⎢ ⎥ 16N ⎣ μk bT ⎦ K

(1)

where K is the ion mobility, q is the ion charge, N is the gas number density, μ is the reduced mass of the ion (M) and buffer gas (m) given by (mM/(m + M)), kb is the Boltzmann’s constant, and T is the absolute temperature. Unlike drift tube mobility systems where drift time correlates linearly to mobility, drift time in traveling wave IM scales quadratically (as the inverse square of mobility).26 Although the traveling wave IM is more complicated than traditional mobility separator systems because of its configuration and the nonuniform field, TWIM-MS has been demonstrated in the literature to provide CCS values that are in excellent agreement with those derived from drift tube mobility system and/or theoretical models after calibration of the T-wave device.27−32 In this study, the calibration was accomplished using denatured myoglobin with known CCS values as calibrant.7,33 The drift times corresponding to each charge state were corrected to consider only the time spent in the cell (reduced mobility, td”). The CCS of calibrant ions were corrected for both the charge state and reduced mass (corrected CCS). The corrected CCS values were plotted against td” and fitted to a



EXPERIMENTAL SECTION Chemokines Preparation. MCP-1, MCP-3, MCP-4, and eotaxin were expressed, isolated, and purified as described previously.20 Protein stock solutions (∼60−100 pmol/μL) were buffer-exchanged twice in 100 mM ammonium acetate (NH4OAc), pH 6.8 using Micro Bio-Spin columns prepacked with Bio-Gel P-6 (BioRad, Hercules, CA, U.S.A.) prior to nanoelectrospray ionization (nano-ESI) TWIM/MS analyses. Protein concentrations were estimated using a Nanodrop UV spectrophotometer, absorbance measurement at 280 nm 3209

dx.doi.org/10.1021/ac2030249 | Anal. Chem. 2012, 84, 3208−3214

Analytical Chemistry

Article

Table 1. Collision Cross Section Determinations of Standard Proteins theoretical CCS (Å2)a standard protein

PDB ID

bovine carbonic anhydrase

1V9l (X-ray)

chicken lysozyme

1GXX (NMR)

equine cytochrome c

1HRC (X-ray)

bovine ubiquitin

1V81 (NMR)

experimental CCS (Å2)

method

RCSB PDB

RAPTOR

unpublished data from this labb

literature

PA EHSS PA EHSS PA EHSS PA EHSS

1893 2420 1171 1450 1056 1318 907 1127

1880 2412 1161 1448 1005 1249 763 925

2060 (9+)

2004 (9+)c

1299 (6+)

1333 (6+)d

1226 (5+)

1238 (5+)d 1217 (5+)e 791 (4+)d

857 (4+)

a

CCS are estimated using projection approximation (PA) and exact hard-sphere scattering (EHSS) methods. The PDB files used as input are from the RCSB PDB (pdb files are indicated next to the protein) and from RAPTOR structure prediction software. bCCS are determined using nano-ESITWIM/MS. Denatured equine myoglobin was used as calibrant. cRef 32. dRef 31. eRef 38.

PDB web portal as a java application. FATCAT method takes into account the conformational flexibility, providing a more accurate comparison. Generally, the algorithm first identifies compatible aligned fragment pairs (AFP), which are based on the similarities in local geometry prior to all backbone (Cα) alignment. The overall similarity between the two structures is given a root-mean-square deviation (rmsd) value. Structural assessment was also performed using protein structure analysis (ProSA)37 to recognize any errors in the model. The program provides a z-score value to describe the overall quality. Assessment and Validation of Theoretical CCS. CCS data of standard proteins were also examined to validate the RAPTOR-derived theoretical values. The theoretical CCS listed in Table 1 show good correlation with experimental (from this laboratory and from literature31,32,38) and/or theoretical values calculated using the PDB files from the database. However, as mentioned earlier, it is important to note that a few of these PDB files from the database have either incomplete atomic coordinates or have additional residues arising from cloning artifacts (e.g., in bovine carbonic anhydrase, BCA). Additionally, theoretical values from RAPTOR-derived models were compared to other structure prediction programs such as Phyre39 and I-TASSER.40 Among the three prediction programs, only the CCS values calculated from RAPTOR for chemokines, MCP-1, and eotaxin correlated the best with the experimental CCS as shown in Table S2 (Supporting Information). For BCA, both RAPTOR- and I-TASSERderived structures were in good agreement with experimental CCS. Tertiary Structures of Chemokines and Model Evaluations. Chemokines are a family of small (8−12 kDa), secreted chemotaxis-inducing proteins involved in many homeostatic, pathological, and inflammatory processes.41,42 These proteins (∼50 in humans) are generally classified into four subfamilies according to the position of the N-terminal cysteine residues: C, CC, CXC, and CX3C, where X is any amino acid.43 The proteins’ sequence and the predominant receptors are listed in Table S1 (Supporting Information). These chemokines share a significant sequence similarity (>50%); hence, the secondary and tertiary structures are expected to be similar. All RAPTOR-predicted structures adopt the typical fold of monomeric CC chemokines: a highly flexible N-terminal domain followed by an extended N-loop, three antiparallel β pleated sheets, two intramolecular disulfide bonds, and a C-terminal α helix.42 To evaluate the structures, comparisons were made between the RAPTOR structures and the structures in the RCSB PDB database. The structure

power series curve. The drift times for the target charge state were also normalized followed by the calculation of experimental CCS using the power series equation. Identical parameters were applied for both the calibrant and chemokines. CCS calibration curves covered nearly the entire drift times and mass-to-charge of chemokine ions with correlation coefficients >0.998 (Figure S1 in the Supporting Information). All measurements were made in triplicate with