Two-Dimensional Linear Dichroism Spectroscopy for Identifying

Feb 15, 2017 - Noelle M. do Nascimento , Augusto Juste-Dolz , Paulo R. Bueno , Isidro Monzó , Roberto Tejero , José L. Lopez-Paz , Angel Maquieira ,...
1 downloads 0 Views 2MB Size
Subscriber access provided by University of Newcastle, Australia

Letter

Two Dimensional Linear Dichroism Spectroscopy for Identifying Protein Orientation and Secondary Structure Composition Guozhen Zhang, Jun Li, Peng Cui, Tao Wang, Jun Jiang, and Oleg V. Prezhdo J. Phys. Chem. Lett., Just Accepted Manuscript • DOI: 10.1021/acs.jpclett.7b00311 • Publication Date (Web): 15 Feb 2017 Downloaded from http://pubs.acs.org on February 16, 2017

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

The Journal of Physical Chemistry Letters is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 7

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The Journal of Physical Chemistry Letters

Two Dimensional Linear Dichroism Spectroscopy for Identifying Protein Orientation and Secondary Structure Composition Guozhen Zhang,†,1 Jun Li,†,1 Peng Cui,1 Tao Wang,1 Jun Jiang,*,1 Oleg V. Prezhdo2 1

Hefei National Laboratory for Physical Sciences at the Microscale, Collaborative Innovation Center of

Chemistry for Energy Materials, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, P. R. China. 2

Departments of Chemistry and Physics and Astronomy, University of Southern California, Los Angeles, California 90089, United States.

Abstract: Quantitative measurements of protein orientation and secondary structure composition are of great importance for protein biotechnology applications and disease treatments, and yet they are technically challenging for a spectroscopic study. Based on quantum mechanics/molecular mechanics simulations, we demonstrate that two dimensional (2D) linear dichroism spectroscopy is capable of probing the direction of α-helix motifs in proteins. Compared to the conventional linear dichroism (LD) spectra, 2D spectra double the measurable range of orientation of secondary structures. In addition, by calculating the ratio of transverse ππ* signals to longitudinal ππ* signals in 2D spectra, we can achieve quantitative measurement of the fraction of α-helix content in a protein.

Protein structure determination is crucial for understanding the physiological activities of proteins and their biotechnology applications due to close ties between function and structure.1 Especially, knowledge of dynamic protein orientation and secondary structure composition plays a key role in studying fibrillationinduced diseases, ion channel proteins, membrane protein topology, and bioelectronics.2-6 Determining amyloid and amyloid-like fibril structures and their direction of growth is indispensable for disclosing the disease etiology and casting insight into treatment and prevention of many serious diseases, such as Alzheimer’s disease, Huntington’s disease, and genomic CAG repeats.2-3 Deep understanding of how an ion channel works requires investigating the orientation changes of the ion channel protein in response to changes in its environment. 4 Studying biological behavior of membrane proteins relies on knowledge of their topology, including their orientation relative to the membrane.5 For bioelectronics, manipulation of charge transport within a protein transistor requires precise control of protein orientation in the molecular junction.6 In recent years, several groups have developed new techniques that can be used for studying protein orientations. Stupp et al. have achieved sophisticated fibril orientation control through X-ray triggered self-assembling, making it possible to guide amyloid fibril growth to examine the dependence of fibril function on orientation.7 Ye et al. have reported a spectroscopic study of ion channel gating action, in which they tuned membrane potential and pH value to alter the orientation of a model ion channel.4 Richards et al. have developed a simple method to preserve orientation of membrane proteins during protein transfer from cell to supported bilayers using cell blebs as an intermediate.8 Chen et al. realized orientation control of a protein using chemical binding between the residue of the protein and the monolayer on a metal electrode surface.6 Proteins involved in the above mentioned studies typically are dynamic, non-crystallized, or insoluble (for fibrils). These

conditions pose a great challenge for making quantitative measurements or for control of protein real-time orientation and structural composition using conventional experimental tools, such as atomic force microscopy (AFM), X-ray diffraction (XRD) or nuclear magnetic resonance (NMR).1, 9 Great efforts have been made to develop alternative spectroscopic approaches (e.g. Fourier transform IR, sum frequency generation, and polarized Raman) that are able to detect protein orientation by measuring the angular dependence of a specific spectral signal on the basis of a reference coordinate frame that is defined in advance.4, 10-17 Albeit substantial progress has been made in the measurement of protein orientation using spectroscopic techniques, due to limitations peculiar to each method, no single method is capable of tackling all tasks required to measure protein orientation. Therefore, there is enough space for new methods to complement these extant techniques for exploring protein conformation and orientation. Polarized ultraviolet (UV) and visible Raman spectroscopies are useful in determining protein sidechain orientation by probing the transition moment of a specific amino acid,18-19 but the difficulty of assigning every chromophore hinders the identification of the global structure of a protein. Linear dichroism (LD) spectroscopy is a powerful approach for determining the global orientation of proteins. Indeed, femtosecond LD spectroscopy has been successfully applied to biomolecules and inorganic surfaces.20-22 Recent flow LD studies have shown exceptional results for measuring the orientation of protein secondary structural motifs, the structure and orientation of membrane proteins interacting with guest molecules and of amyloid fibrils, and for determining how agents dock with enzymes or fibers.23 However, local information is often missing due to sensitivity to interactions with adjacent chromophore transitions. To study local and global structures simultaneously, it is necessary to go beyond the conventional one-dimensional spectroscopies. Twodimensional (2D) resonant laser spectroscopy is capable of achieving

ACS Paragon Plus Environment

1

The Journal of Physical Chemistry Letters

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

the goal by monitoring vibrational and electronic excitations of molecules and their complexes.24 2D spectroscopies provide more information about structure and dynamics than 1D spectroscopies due to high sensitivity to electronic correlations and ultrafast electronic dynamics in proteins and other nano systems.25 Recent advances in intense UV femtosecond lasers with high repetition rates have led to successful capture of protein dynamic pictures by the 2D UV spectroscopy (2DUV).26 Because both LD and 2D spectroscopic techniques do not require crystallization of samples and are highly sensitive to the structural changes of proteins, one can use them to probe protein conformation and orientation as a complement to the existing techniques. In this work, by combining the advantages of the LD and 2D UV spectroscopies, we propose a new technique termed two-dimensional linear dichroism spectroscopy (2DLD). Similarly to LD, the signals of 2DLD are defined as the difference between the 2DUV signals produced by two perpendicular polarization configurations.25 We also define 2DRD as the ratio between the transverse and longitudinal 2D response signals of α-helix protein motifs. 2DLD and 2DRD respectively aim at the quantitative measurement of orientations of the secondary structures of proteins and the proportional occurrence of different secondary structural motifs under specific (e.g. dynamic, non-crystallized, or insoluble) conditions, expanding the power of 2D techniques in the area of protein structural determination. Two peptide bond transitions of n→π*(denoted as nπ* below) at 45500 cm-1 (~220 nm) and π→π* (denoted as ππ* below) at 53000 cm-1 (~190 nm), as depicted by Figure 1A, are the primary components of the ultraviolet adsorption spectra of the protein backbone.27 Polarization of the backbone transitions due to inter- and intra-molecular interactions causes spectral signals to be dependent on backbone orientation.28 Thus, the net polarization of transitions in proteins is strongly affected by the secondary structures that determine the allowed combinations of coupled local transitions. Other than random coil motifs, the most common ordered secondary structures are α-helices and β-sheets. Here, we ignore nπ* transitions which normally induce much weaker signals than ππ* transitions. It is known that amide ππ* transitions in α-helices and β-sheets have different orientational dependencies. Therefore, we can choose the ππ* transition as the probe in this study. Due to the Davydov splitting28 effect in α-helices, ππ* transitions are split into transitions polarized parallel to the helical axis (longitudinal transition) at the lower energy 48000 cm-1 (~210 nm), and transitions polarized perpendicular to the helical axis (transverse transition) at the higher energy 53000 cm-1 (~190 nm). Figure 1B displays a typical α-helical protein structure in the xyz coordinate space. The two ππ* transition components polarized parallel and perpendicular to the α-helical axis are defined as µL and µT, respectively. Meanwhile, the ππ* transition in β-sheets is characterized by µT alone, because of the lack of the parallel component of this transition. Figure 1. (A) UV transitions of a peptide bond on a protein backbone. (B) Schematic representation of typical α-helix and β-sheet proteins in the xyz coordinate space, in which the z axis represents the orientation axis.

LD is the difference between the absorbance of light (LA) polarized respectively parallel and perpendicular to a specific orientation. Taking the z axis in Figure 1B as the orientation axis,

Page 2 of 7

absorption of light polarized in the z direction is the parallel LA (A||), while the average of the absorbances in the x and y directions accounts for the perpendicular LA (A⊥), equation (1) below. The orientations of the α-helix and β-sheet proteins with respect to the z axis are described by the angle θ, as shown in Figure 1B. The LDs of both motifs depend on the longitudinal (L) and transverse (T) transition dipole moments µL and µT, and consequently, have a 2θ dependence (See equation (2) and supporting information for details). A 2DUV signal is measured using four coherent short laser pulses, labeled by their wavevectors k1, k2, k3, k4. In the direction of k4 = -k1 + k2 + k3, the 2DUV signal is detected as a function of three delay times: t1 between pulses k1 and k2, t2 between pulses k2 and k3, and t3 between pulses k3 and k4. The 2D echo signals are determined by the two-dimensional Fourier transforms t1→Ω1 and t3→Ω3 with t2 set to zero. Calculations were performed using the protocol given in our previous work29-32 for the two polarization configurations zzzz and xxxx (i.e. four laser pulses with all polarizations along the z or x axis), and provide non-chiral 2DUV zzzz and xxxx spectra. More details of calculating the 2D photon echo signals are described in the supporting information. We then define the 2D linear dichroism (2DLD) spectrum as the difference between the 2DUV signals with zzzz and xxxx polarizations:

1 LD = A|| − A⊥ = Az − ( Ax + Ay ) 2 LD(µT ) ∝ µT2 (sin 2 θ − cos 2 θ ) ∝ − µT2 cos 2θ

(2)

LD(µ L ) ∝ − µ L2 (sin 2 θ − cos 2 θ ) ∝ µ L2 cos 2θ We find that the 2DLD signals are functions of cos2θ (see equation (3) and supporting information), which can be translated into protein orientation. Moreover, the sidebands resulting from coupling of different signals are dependent on orientation as well. Most proteins contain both α-helices and β-sheets. To characterize a protein with content fraction γ of α-helix structure, we calculate 2DRD, the ratio between the transverse ππ* signals at 53000 cm-1 and the longitudinal ππ* signals at 48000 cm-1 in the 2DUV zzzz spectra (shorted as T/L ratio):

2DRD =

2DUVzzzz ,T 2DUVzzzz , L

=

µT4 sin 4 θ + η1 γ µ L4 cos 4 θ + η2

(4)

4

where η1 and η2 represent the background signals that might be induced by the resonant tail of protein backbone and aromatic side-

2DLD = 2DUVzzzz − 2DUVxxxx 2DLD(µT ) ∝ µT4 (sin 4 θ − cos 4 θ ) ∝ − µT4 cos 2θ 4 L

4

4

(3)

4 L

2 DLD( µ L ) ∝ − µ (sin θ − cos θ ) ∝ µ cos 2θ chain transitions. We have studied the UV spectra of the following proteins: two αhelix-only proteins – tropomyosin (PDB code: 2d3e) and hemoglobin (PDB code: 1hda), two β-sheet only proteins – crystals of amyloid-like fibrils (PDB code: 1yjp) and lentil lectin (PDB code: 1les), two proteins containing both α-helices and β-sheets – FtsZ (PDB code: 1fsz) and pyruvate kinase (PDB code: 1mol), and the 32residue β-amyloid (Aβ9-40) fibril. The X-ray crystal structures were taken from the RSCB protein data bank and used as the starting geometry for each protein. To reduce the computation cost, we extracted essential fragments of these proteins for molecular simulations. For example, we trimmed a fragment of two helix chains (TROP1: SER215-LYS264 of chain A and B in 2d3e.pdb) taken from tropomyosin (2d3e), and a fragment of the single helix chain from hemoglobin (1hda), respectively, to investigate the

ACS Paragon Plus Environment

2

Page 3 of 7

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The Journal of Physical Chemistry Letters

angular dependence of the 2DLD spectra on the orientation of the pure α-helix peptides. We have used a computational protocol based on an efficient quantum mechanics/molecular mechanics (QM/MM) algorithm called "exciton Hamiltonian with electrostatic fluctuations" (EHEF).29 In this theoretical framework for the 2DUV signal, the ππ* transitions of the protein backbone are described by the Frenkel exciton model (equation 5) derived from the Heitler-London and adiabatic approximations.33

where ma is the ath electronic transition on the mth peptide unit, (in our case, a = 2 for ππ*). B̂ma† is the creation operator that promotes the mth peptide unit into the ath excited state, and B̂ma is the conjugated annihilation operator. This couple of operators comply with the Pauli commutation rules [B̂ma, B̂nb†] = δmn(1 − 2B̂mb†B̂ma). The ground-state is represented by |0>, and its energy is given by = 0. In the case of single exciton, the mth singly-excited state energy is given as = εma, and the resonant coupling between two different excited states m and n is given by = Jma, nb. (See more description in the supporting information.) EHEF provides an interface for reading molecular dynamics (MD) simulation trajectories and generating QM charge distributions. MD simulations of proteins in aqueous solution were performed using the NAMD 2.7 software package34 with the CHARMM force field35. The protein of interest was set in the center of a rectangular simulation box under periodic boundary condition, and was surrounded by 10 Å thick layers of TIP3P36 water on all sides in three dimensions. Long-range electrostatic interactions were treated with the particle-mesh Ewald (PME) method.37 The real space cutoff for non-bonded interactions was set to 12 Å. The MD evolution time step was 1 fs. All MD simulations started with a 600 ps heating from 0 to 310 K that followed a 5000 step energy minimization. After a 2 ns equilibration run in the NPT ensemble at T = 300 K and P = 1 atm, we generated a 16 ns NPT trajectory, and recorded structures every 400 fs. The resulting NPT ensemble of MD geometric snapshots was used to compute the effective UV Hamiltonian and spectra. Parameters of the transition energies of the resonant couplings, and the electric and magnetic dipole moments for an isolated protein backbone amide units were extracted from the DichroCalc package,38 which itself is based on the excited states of Nmethylacetamide (NMA) computed by CASSCF/SCRF39 (the complete-active space self-consistent-field method implemented within a self-consistent reaction field) in MOLCAS40. The electronic charge distributions of amino side chains in proteins and surrounding water molecules were extracted from the EHEF program, which itself is based on density functional theory (DFT) calculations at the B3LYP/6-311++G** level with the GAUSSIAN03 package.41 Using the matrix method in the DichroCalc core we constructed effective fluctuating QM Hamiltonians. The SPECTRON code42 developed by the Mukamel group was used to compute the LA and LD spectra based on 2000 MD snapshots; computations of the 2DUV and 2DLD spectra were based on 2000 MD snapshots as well. The accuracy and predictive power of our simulation protocol have been validated previously for various proteins.29-32 There, we compared the computed LA and LD spectra with available experimental data, and achieved good agreements (See supporting information for details). Therefore, we employ the same simulations here. The definition of the orientation is as depicted in Figure 1. The

Z-axis is taken as the direction along the α-helix or β-sheet in the initial state. Using this as a reference point, we define an angle θ to represent the orientation relative to the Z-axis. Thus, θ=0° is for the initial orientation. What matters most is how the 2DLD signal changes in response to changes in θ. If the changes in the signal follow the same pattern for the two species, then the species likely experience similar changes in the orientation. The two pure α-helix protein fragments under investigation are (1) a fragment of two chains (TROP1: SER215-LYS264 of chain A and B in 2d3e.pdb) taken from tropomyosin (PDB code: 2d3e), and (2) a fragment of a single chain taken from hemoglobin (HEMO1, PDB code: 1hda). The 2DUV spectra of TROP1 in different orientations (Figure 2A) have been computed. The first panel in Figure 2B shows the 2DUV zzzz spectra of TROP1 in five orientations (θ=0°, 22.5°, 45°, 67.5°, 90°). For TROP1 along the z axis, the diagonal peak is centered at 48000:48000 cm-1 (i.e. Ω1=Ω3=-48000 cm-1), accompanied by two relatively weak positive cross-peaks (green to red). As the orientation of TROP1 changes from θ=0° to θ=90°, these three main peaks shift from lower to higher energy, with the diagonal peak becoming centered at the 53000:53000 cm-1 point for θ=90°.

Figure 2. (A): The structures of TROP1 as a function of θ, where θ is the angle between the TROP1 central axis and the z axis. (B): The 2DUV zzzz and 2DLD spectra of TROP1 (upper panel) in different orientations, from left to right: θ= 0°, 22.5°, 45°, 67.5°, 90°; and (bottom panel) the T/L ratio of the 2D zzzz signal, as a function of θ. For (C): TROP1, (D): HEMO1, 2D spectra obtained by averaging over 2000 MD snapshots.

The 2DLD signals calculated for TROP1 (Figure 2B) show richer spectral features and are distinctly different from the 2DUV zzzz signals. At θ=0°, there is a strong negative (blue) diagonal peak accompanied by two weak positive (red) sidebands, and a strong positive (red) diagonal peak with two weak negative (blue) sidebands at 48000 cm-1 and 53000 cm-1, respectively. As θ increases from 0° to 90°, the diagonal peaks change sign, while the sidebands experience opposite yet milder sign changes, as compared to the diagonal peaks. The behavior of the 2DLD positive and negative signal peaks is in agreement with that of the LD signals on the x and z axes. Figures 2C and 2D show the variation of the T/L ratio as a function of the orientation angle. There is a good trigonometric functional relation between the T/L ratio and the orientation angle for both TROP1 and HEMO1, showing a tan4θ dependence. In addition, because TROP1 and HEMO1 are both pure α-helix structures, the value of γ in equation (4) is 1, from which it follows that the value of µ T4 / µ L4 is between 0.10~0.11.

ACS Paragon Plus Environment

3

The Journal of Physical Chemistry Letters

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The 2DUV zzzz and 2DLD spectra of the β-sheet protein protofibril were calculated in five different orientations (θ=0°, 22.5°, 45°, 67.5°, 90°) (Figures 3A and 3B). As the angle increases, the signal peak at 48000 cm-1 in the 2DUV zzzz spectra weakens, and the signal peak at 53000 cm-1 strengthens. This trend is not as distinct as for the 2DUV zzzz spectra of TROP1. Moreover, as θ increases from 0° to 90°, the side peaks of the protofibril show prominent positive signals. The 2DLD spectra exhibit different features than the 2DUV zzzz spectra. The 2DLD spectra show an asymmetrical distribution along the diagonal. At θ=0°, the 2DLD signal has a negative peak on the diagonal near ~48000 cm-1 accompanied by a weak positive side peak, while it has a positive peak on the diagonal near ~53000 cm-1

Figure 3. (A): The structures of protofibril 1yjp as a function of θ, where θ is the angle between the protofibril central axis and z axis. (B): The 2D signals (upper panel) of the protofibril in different orientations, from left to right: θ= 0°, 22.5°, 45°, 67.5°, 90°; and (bottom panel) the T/L ratio of the 2D zzzz signal as a function of θ. For (C): protofibril (1yjp), (D): protofibril (1les-b), 2D spectra obtained by averaging over 2000 MD snapshots

with a weak negative side peak nearby. As θ increases from 0 to 90°, all signals of the 2DLD spectra switch their signs: the negative signal peak on the diagonal near ~48000 cm-1 (due to the π→π* backbone transition) changes from negative to positive, while the positive signal peak near 53000 cm-1 changes from positive to negative. These changes are in line with those observed in the LD. The T/L ratios follow a sin4θ dependence in the two β-sheets as well. Figure 3 shows a clear sin4θ dependence of the T/L ratio in both lyjp and 1les-b. As the helical structure content is 0 for both 1yjp and 1les-b β-sheet proteins, γ is 0 in equation (6). It follows that the µT4 / η 2 value is between 3.05~3.44. This is a consequence of the fact that the Davydov splitting of the amide ππ* transition at ~190nm in β-sheets is only polarized perpendicular to the sheet. Through the study of the pure α-helix and β-sheet structures, we found that the 2DUV zzzz and 2DLD spectra show remarkable dependence on the orientation of the secondary structural motifs. Subsequently, we applied the analysis to two proteins containing both motifs: FtsZ (PDB code: 1fsz) and monellin (PDB code: 1mol). Figures 4A and 4B display the FtsZ structure in five orientations and the corresponding 2D signal spectra, respectively. The 2DUV zzzz

Page 4 of 7

spectral signal has broad negative bands along the diagonal, and its intensity along the diagonal at ~48000 cm-1 decreases with increasing θ. On the contrary, the intensity on the diagonal at ~53000 cm-1 increases with increasing θ. However, the relationship between 2DUV zzzz and θ for FtsZ is not as distinct as it is for either α-helix or β-sheet. The dependence of the 2DLD spectra on orientation (characterized by θ) is readily seen: at θ=0°, a prominent negative signal appears near 48000 cm-1 coupled with a pair of weak positive sidebands, and a prominent positive signal appears near 53000 cm-1, coupled with a pair of weak negative sidebands. From 0° to 90°, negative signals gradually turn to positive, and vice versa. The trends

apparent in Figure 4B follow the same pattern as those in Figure 2B and Figure 3B. This suggests that both α-helixes and β-sheets in FtsZ likely exhibit similar orientation changes, as described in Figures 1-3.

Figure 4. (A): The structures of FtsZ as a function of θ, where θ is the angle between the FtsZ central axis and the z axis. (B): The 2DUV zzzz and 2DLD signals for FtsZ (upper panel) in different orientations, from left to right: θ= 0°, 22.5°, 45°, 67.5°, 90°; and (bottom panel) the T/L ratio of the 2D zzzz signal as a function of θ. For (C): FtsZ (1fsz), and (D): monellin (1mol), 2D spectra obtained by averaging over 2000 MD snapshots.

Next, we studied the variation of the T/L ratio with respect to the orientation angle in two real systems: FtsZ (PDB code: 1fsz) and monellin (PDB code: 1mol). Figures 4C and 4D show a clear tan4θ dependence, the same as that found for the α-helix case. Equation (4) gives the content fraction γ equal to 0.50 in FtsZ and 0.27 in monellin. Comparing these values with the content fractions computed using the DSSP program,43-44 which produces 0.44 in FtsZ, and 0.22 in monellin, we conclude that the 2D signals can predict the helix content with high accuracy. Finally, we investigate amyloid fibrils, which are associated with more than 20 neurodegenerative diseases. The mechanism of amyloid fibril formation involves many intermediate states.30 Here, we choose two model structures of amyloid fibrils: one is composed of six Aβ9-40 molecules (Figure 5A) (denoted as A6 below) and the other contains two Aβ9-40 molecules (denoted as A2 below).

ACS Paragon Plus Environment

4

Page 5 of 7

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The Journal of Physical Chemistry Letters

For the 2D signals (Figure 5B), both the 2DUV zzzz and 2DLD spectra of A6 show distinct changes in response to the change of orientation. At θ=0°, the 2DUV zzzz spectra are dominated by the negative diagonal ~48000 cm-1 peak accompanied by two very weak positive sidebands. As the angle increases, the signal weakens gradually at 48000 cm-1, while the negative signal intensity on the diagonal at ~53000 cm-1 is strengthened accordingly. At θ=90°, the signal peak intensity and positive sidebands at diagonal 53000 cm-1 all reach a maximum. The 2D zzzz signal at 53000 cm-1 is strengthened with the increase of the angle between the helical axis and the z axis, while the signal at 48000 cm-1 is weakened. At θ=0°, the 2DLD signal has a negative diagonal peak accompanied by two weak positive sidebands at 48000 cm-1 and a positive diagonal peak accompanied by two weak negative sidebands at 53000 cm-1. As the angle between the helical axis and the z axis increases, the feature signal at 48000 cm-1 turns positive, and that at 53000 cm-1 experiences the opposite change, which agrees with the inversion of the LD spectral signal peaks.

associated with orientation of the structures, suggesting a path towards a higher resolution spectroscopy. The reported analysis of the 2D spectroscopic signals as a function of orientation also allows us to obtain the fractional helical content of protein structures. The above results demonstrate the capability of the 2DLD spectroscopy to establish a quantitative relationship between the secondary structure composition (α-helix/β-sheet ratio) and protein orientation. This capability will be the very useful for future studies by allowing one to extract one type of information from other types and making protein secondary structure predictable from spectral information. The demonstrated methodology can both complement the existing spectroscopic techniques for protein structure determination, and open up new windows for probing amyloid fibrils and other structures, their dynamical fluctuations, and associated photochemical and photophysical processes, which are crucial for the understanding and manipulation of protein functionality. We hope that the presented theoretical results will encourage the corresponding 2DLD experimental studies on protein orientation in the near future.

ASSOCIATED CONTENT AUTHOR INFORMATION Corresponding Author *Email: [email protected]

ORCID Jun Jiang: 0000-0002-6116-5605

Notes The authors declare no competing financial interest. Author Contributions

† These authors contributed equally. ACKNOWLEDGMENTS Figure 5. (A): The structures of A6 as a function of θ, where θ is the angle between the FtsZ central axis and the z axis. (B): The 2D signals for A6 (upper panel) in different orientations, from left to right: θ= 0°, 22.5°, 45°, 67.5°, 90°; and (bottom panel) the T/L ratio of the 2D zzzz signal as a function of θ. For (C): A6 Fibril, (D): A2 Fibril, 2D spectra obtained by averaging over 2000 MD snapshots.

Changes in the T/L ratio with the orientation angle, as seen in Figures 5C and 5D, exhibit a tan4θ dependence for both A6 and A2. Insignificant differences are shown in the background signal (A6: η1=1.63, η2=0.11 and A2: η1=1.34, η2=0.06). Combining the results shown in Figures 5C and 5D and equation (4) we obtain γ=0.45. Since the amyloid fibril structure contains two parallel β sheets facing each other, our results suggest that an amyloid fibril can be treated as a semi-half-helix structure. Quantitative determination of protein orientation and secondary structure composition in real-time under non-crystallized or insoluble conditions is an important yet challenging capability for protein structure analysis. In this work, we showed that the 2DLD spectroscopy provides a significant advance towards this goal, because the 2DLD signals can be used to monitor the changes in orientation of helical, sheet, and amyloid fibril protein structures in solution. Different orientation dependencies for different structural motifs have been established simultaneously. Compared to onedimensional spectra, 2D spectra provide us with more information

This work was financially supported by the 973 Program (No. 2014CB848900), NSFC (No. 21633006), CAS Strategic Priority Research Program B (No. XDB01020000), Hefei Science Center CAS (2015HSC-UP011). OVP acknowledges support of the US National Science Foundation, grant no. CHE-1565704.

Supporting Information Available Linear absorption (LA) and linear dichroism (LD), exciton model for the ππ* transitions of the protein backbone, theory and simulation of the 2DUV signal, theory of 2DLD and 2DRD, and LA and LD spectra of the α-helix proteins, β-sheet proteins, mixed α-helix and βsheet proteins, and amyloid fibril protein. This material, is available free of charge via the Internet at http://pubs.acs.org

REFERENCES 1. Whitford, D. Proteins : structure and function. J. Wiley & Sons: Hoboken, NJ, 2005; p xiv, 528 p. 2. Buchanan, L. E.; Carr, J. K.; Fluitt, A. M.; Hoganson, A. J.; Moran, S. D.; de Pablo, J. J.; Skinner, J. L.; Zanni, M. T. Structural motif of polyglutamine amyloid fibrils discerned with mixed-isotope infrared spectroscopy. Proc. Natl. Acad. Sci. U. S. A. 2014, 111 (16), 5796-5801. 3. Sawaya, M. R.; Sambashivan, S.; Nelson, R.; Ivanova, M. I.; Sievers, S. A.; Apostol, M. I.; Thompson, M. J.; Balbirnie, M.; Wiltzius, J. J. W.; McFarlane, H. T.; et al. Atomic structures of amyloid cross-beta spines reveal varied steric zippers. Nature 2007, 447 (7143), 453-457.

ACS Paragon Plus Environment

5

The Journal of Physical Chemistry Letters

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

4. Ye, S. J.; Li, H. C.; Wei, F.; Jasensky, J.; Boughton, A. P.; Yang, P.; Chen, Z. Observing a Model Ion Channel Gating Action in Model Cell Membranes in Real Time in Situ: Membrane Potential Change Induced Alamethicin Orientation Change. J. Am. Chem. Soc. 2012, 134 (14), 6237-6243. 5. von Heijne, G. Membrane-protein topology. Nat. Rev. Mol. Cell Biol. 2006, 7 (12), 909-918. 6. Chen, Y. S.; Hong, M. Y.; Huang, G. S. A protein transistor made of an antibody molecule and two gold nanoparticles. Nat. Nanotechnol. 2012, 7 (3), 197-203. 7. Cui, H.; Pashuck, E. T.; Velichko, Y. S.; Weigand, S. J.; Cheetham, A. G.; Newcomb, C. J.; Stupp, S. I. Spontaneous and xray-triggered crystallization at long range in self-assembling filament networks. Science 2010, 327 (5965), 555-9. 8. Richards, M. J.; Hsia, C. Y.; Singh, R. R.; Haider, H.; Kumpf, J.; Kawate, T.; Daniel, S. Membrane Protein Mobility and Orientation Preserved in Supported Bilayers Created Directly from Cell Plasma Membrane Blebs. Langmuir 2016, 32 (12), 2963-2974. 9. Shibata, M.; Yamashita, H.; Uchihashi, T.; Kandori, H.; Ando, T. High-speed atomic force microscopy shows dynamic molecular processes in photoactivated bacteriorhodopsin. Nat. Nanotechnol. 2010, 5 (3), 208-212. 10. Thielges, M. C.; Fayer, M. D. Protein Dynamics Studied with Ultrafast Two-Dimensional Infrared Vibrational Echo Spectroscopy. Acc. Chem. Res. 2012, 45 (11), 1866-1874. 11. Oladepo, S. A.; Xiong, K.; Hong, Z.; Asher, S. A.; Handen, J.; Lednev, I. K. UV Resonance Raman Investigations of Peptide and Protein Structure and Dynamics. Chem. Rev. 2012, 112 (5), 26042628. 12. Christov, C. Z. Biomolecular spectroscopy : advances from integrating experiments and theory. First edition. ed.; Elsevier: Oxford, 2013; p 1 online resource (349 pages). 13. Roy, S.; Covert, P. A.; FitzGerald, W. R.; Hore, D. K. Biomolecular Structure at Solid–Liquid Interfaces As Revealed by Nonlinear Optical Spectroscopy. Chem. Rev. 2014, 114 (17), 83888415. 14. Yang, H.; Yang, S.; Kong, J.; Dong, A.; Yu, S. Obtaining information about protein secondary structures in aqueous solution using Fourier transform IR spectroscopy. Nat. Protocols 2015, 10 (3), 382-396. 15. Sereda, V.; Sawaya, M. R.; Lednev, I. K. Structural Organization of Insulin Fibrils Based on Polarized Raman Spectroscopy: Evaluation of Existing Models. J. Am. Chem. Soc. 2015, 137 (35), 11312-11320. 16. Ding, B.; Panahi, A.; Ho, J. J.; Laaser, J. E.; Brooks, C. L.; Zanni, M. T.; Chen, Z. Probing Site-Specific Structural Information of Peptides at Model Membrane Interface In Situ. J. Am. Chem. Soc. 2015, 137 (32), 10190-10198. 17. Donovan, M. A.; Yimer, Y. Y.; Pfaendtner, J.; Backus, E. H. G.; Bonn, M.; Weidner, T. Ultrafast Reorientational Dynamics of Leucine at the Air-Water Interface. J. Am. Chem. Soc. 2016, 138 (16), 5226-5229. 18. Asakawa, K.; Masuda, S.; Takeuchi, H. Indole ring orientations of Trp189 in the ground and M intermediate states of bacteriorhodopsin as studied by polarized UV resonance Raman spectroscopy. J. Raman Spectrosc. 2006, 37 (1-3), 255-262. 19. Pajcini, V.; Asher, S. A. Preresonance Raman single-crystal measurements of electronic transition moment orientations in Nacetylglycinamide. J. Am. Chem. Soc. 1999, 121 (47), 10942-10954. 20. Onfelt, B.; Lincoln, P.; Norden, B.; Baskin, J. S.; Zewail, A. H. Femtosecond linear dichroism of DNA-intercalating chromophores: solvation and charge separation dynamics of [Ru(phen)2dppz]2+ systems. Proc. Natl. Acad. Sci. U. S. A. 2000, 97 (11), 5708-5713. 21. Rehault, J.; Zanirato, V.; Olivucci, M.; Helbing, J. Linear dichroism amplification: adapting a long-known technique for ultrasensitive femtosecond IR spectroscopy. J. Chem. Phys. 2011, 134 (12), 124516. 22. Chen, K.; Li, H.; Ma, L.-P.; Ren, W.; Zhou, J.-Y.; Cheng, H.M.; Lai, T. Ultrafast linear dichroism-like absorption dynamics in

Page 6 of 7

graphene grown by chemical vapor deposition. J. Appl. Phys. 2014, 115 (20), 203701. 23. Bulheller, B. M.; Rodger, A.; Hicks, M. R.; Dafforn, T. R.; Serpell, L. C.; Marshall, K. E.; Bromley, E. H. C.; King, P. J. S.; Channon, K. J.; Woolfson, D. N.; et al. Flow Linear Dichroism of Some Prototypical Proteins. J. Am. Chem. Soc. 2009, 131 (37), 13305-13314. 24. Mukamel, S.; Tanimura, Y.; Hamm, P. Coherent Multidimensional Optical Spectroscopy. Acc. Chem. Res. 2009, 42 (9), 1207-1209. 25. Jiang, J.; Mukamel, S. Two-Dimensional Ultraviolet (2DUV) Spectroscopic Tools for Identifying Fibrillation Propensity of Protein Residue Sequences. Angew. Chem. Int. Ed. 2010, 49 (50), 96669669. 26. Consani, C.; Aubock, G.; van Mourik, F.; Chergui, M. Ultrafast Tryptophan-to-Heme Electron Transfer in Myoglobins Revealed by UV 2D Spectroscopy. Science 2013, 339 (6127), 15861589. 27. Jiang, J.; Golchert, K. J.; Kingsley, C. N.; Brubaker, W. D.; Martin, R. W.; Mukamel, S. Exploring the Aggregation Propensity of gamma S-Crystallin Protein Variants Using Two-Dimensional Spectroscopic Tools. J. Phys. Chem. B 2013, 117 (46), 14294-14301. 28. Moffitt, W. Optical Rotatory Dispersion of Helical Polymers. J. Chem. Phys. 1956, 25 (3), 467-478. 29. Jiang, J.; Abramavicius, D.; Bulheller, B. M.; Hirst, J. D.; Mukamel, S. Ultraviolet Spectroscopy of Protein Backbone Transitions in Aqueous Solution: Combined QM and MM Simulations. J. Phys. Chem. B 2010, 114 (24), 8270-8277. 30. Jiang, J.; Abramavicius, D.; Falvo, C.; Bulheller, B. M.; Hirst, J. D.; Mukamel, S. Simulation of Two-Dimensional Ultraviolet Spectroscopy of Amyloid Fibrils. J. Phys. Chem. B 2010, 114 (37), 12150-12156. 31. Jiang, J.; Lai, Z.; Wang, J.; Mukamel, S. Signatures of the Protein Folding Pathway in Two-Dimensional Ultraviolet Spectroscopy. J. Phys. Chem. Lett. 2014, 5 (8), 1341-1346. 32. Zhang, I. Y.; Jiang, J.; Gao, B.; Xu, X.; Luo, Y. RRS-PBC: a molecular approach for periodic systems. Science China Chemistry 2014, 57 (10), 1399-1404. 33. Abramavicius, D.; Palmieri, B.; Mukamel, S. Extracting single and two-exciton couplings in photosynthetic complexes by coherent two-dimensional electronic spectra. Chem. Phys. 2009, 357 (1-3), 79-84. 34. Phillips, J. C.; Braun, R.; Wang, W.; Gumbart, J.; Tajkhorshid, E.; Villa, E.; Chipot, C.; Skeel, R. D.; Kale, L.; Schulten, K. Scalable molecular dynamics with NAMD. J. Comput. Chem. 2005, 26 (16), 1781-1802. 35. MacKerell, A. D.; Bashford, D.; Bellott, M.; Dunbrack, R. L.; Evanseck, J. D.; Field, M. J.; Fischer, S.; Gao, J.; Guo, H.; Ha, S.; et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J. Phys. Chem. B 1998, 102 (18), 35863616. 36. Jorgensen, W. L.; Chandrasekhar, J.; Madura, J. D.; Impey, R. W.; Klein, M. L. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983, 79 (2), 926-935. 37. Essmann, U.; Perera, L.; Berkowitz, M. L.; Darden, T.; Lee, H.; Pedersen, L. G. A Smooth Particle Mesh Ewald Method. J. Chem. Phys. 1995, 103 (19), 8577-8593. 38. Bulheller, B. M.; Hirst, J. D. DichroCalc-circular and linear dichroism online. Bioinformatics 2009, 25 (4), 539-540. 39. Yamamoto, N.; Vreven, T.; Robb, M. A.; Frisch, M. J.; Schlegel, H. B. A direct derivative MC-SCF procedure. Chem. Phys. Lett. 1996, 250 (3-4), 373-378. 40. Aquilante, F.; De Vico, L.; Ferre, N.; Ghigo, G.; Malmqvist, P. A.; Neogrady, P.; Pedersen, T. B.; Pitonak, M.; Reiher, M.; Roos, B. O.; et al. Software News and Update MOLCAS 7: The Next Generation. J. Comput. Chem. 2010, 31 (1), 224-247. 41. Frisch, M. J.; Trucks, G. W.; Schlegel, H. B.; Scuseria, G. E.; Robb, M. A.; Cheeseman, J. R.; Montgomery, J. A.; Vreven, T.;

ACS Paragon Plus Environment

6

Page 7 of 7

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The Journal of Physical Chemistry Letters

Kudin, K. N.; Burant, J. C.; et al. Gaussian 03, revision C.02; Gaussian, Inc.: Wallingford, CT, 2004. 42. Abramavicius, D.; Palmieri, B.; Voronine, D. V.; Šanda, F.; Mukamel, S. Coherent Multidimensional Optical Spectroscopy of Excitons in Molecular Aggregates; Quasiparticle versus Supermolecule Perspectives. Chem. Rev. 2009, 109 (6), 2350-2408. 43. Kabsch, W.; Sander, C. Dictionary of Protein Secondary Structure - Pattern-Recognition of Hydrogen-Bonded and Geometrical Features. Biopolymers 1983, 22 (12), 2577-2637.

44. Touw, W. G.; Baakman, C.; Black, J.; te Beek, T. A. H.; Krieger, E.; Joosten, R. P.; Vriend, G. A series of PDB-related databanks for everyday needs. Nucleic Acids Res. 2015, 43 (D1), D364-D368.

ACS Paragon Plus Environment

7