Metamorphic Proteins: Emergence of Dual Protein ... - ACS Publications

Jun 1, 2017 - Muralikrishna Lella* and Radhakrishnan Mahalakshmi*. Molecular Biophysics Laboratory, Department of Biological Sciences, Indian Institut...
0 downloads 0 Views 4MB Size
Subscriber access provided by CORNELL UNIVERSITY LIBRARY

Perspective

Metamorphic Proteins: Emergence of dual protein folds from one primary sequence Muralikrishna Lella, and Radhakrishnan Mahalakshmi Biochemistry, Just Accepted Manuscript • Publication Date (Web): 01 Jun 2017 Downloaded from http://pubs.acs.org on June 5, 2017

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Biochemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Metamorphic Proteins: Emergence of dual protein folds from one primary sequence Muralikrishna Lella1,* and Radhakrishnan Mahalakshmi1,* 1

Molecular Biophysics Laboratory, Department of Biological Sciences, Indian Institute of

Science Education and Research, Bhopal – 462066. India. *E-mail: [email protected]; [email protected].

KEYWORDS. Metamorphic proteins, fold switching, α-helix, β-sheet, chameleonic sequences.

ACS Paragon Plus Environment

1

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 2 of 63

ABSTRACT

Every amino acid exhibits a different propensity towards distinct structural conformations. Hence, decoding how the primary amino acid sequence transitions to a defined secondary structure and its final three-dimensional fold is presently considered predictable with reasonable certainty. However, protein sequences have recently been discovered that defy the first principles of secondary structure prediction – they attain two different folds. Such proteins, aptly named metamorphic proteins, decrease the conformational constraint by increasing flexibility in the secondary structure, and thereby result in efficient functionality. In this review, we discuss the major factors driving conformational switch, relating both to protein sequence and structure using illustrative examples. We discuss the concept of evolutionary transition in sequence and structure, the functional impact of the tertiary fold, and the pressure of intrinsic and external factors that give rise to metamorphic proteins. We mainly focus on the major components of protein architecture, namely, the α-helix and β-sheet segments, which are involved in conformational switching within the same or highly similar sequences. These chameleonic sequences are widespread in both cytosolic and membrane proteins, and these folds are equally important for protein structure and function. We discuss the implications of metamorphic proteins and chameleonic peptide sequences in de novo peptide design.

ACS Paragon Plus Environment

2

Page 3 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

INTRODUCTION Predicting the structure of newly identified or designed protein sequences based on homology modeling using computational algorithms has seen considerable advances in the past four decades. The accuracy of prediction has improved to 80%;1-3 however, the complexity in protein folds severely hampers complete de novo structure prediction. Several recent examples of proteins and de novo designed peptide sequences have been reported, which defy Anfinsen’s ‘one sequence, one fold’ paradigm to varying degree.4 The observation of such “chameleonic sequences” that exhibit structural plasticity and toggle between dual folds renders structure prediction a challenging task. The available protein structures in the PDB (Protein Data Bank) are increasing with time, emphasizing the need to revisit protein structures that exhibit structural plasticity. Here, we discuss recently identified examples of protein and peptide sequences adopting multiple structures, and we discuss the molecular basis of such changes. The major focus on chameleonic sequences started in 1980s when work from Kabsch and Sander (1984),5 and the Brenner group (1985)6 reported that short sequences from different proteins could attain two distinct folds. Analysis of the structures of 62 proteins revealed distinct structural variability even in short sequences that were five residues long.5 Subsequently, Cohen’s group extended the analysis to identical hexapeptide sequences from a larger dataset of 316 proteins.7 They showed that identical sequences can exhibit vastly different structures, and can exist in unrelated classes of protein folds. For example, Cohen’s group identified eight classes of sequences that exhibited α-helical structure in one class and formed β-strands in another class of proteins.7 In 1996, Minor and Kim coined the term ‘chameleon sequences’ for such sequences, when they demonstrated the context-dependent structural conversion of a 11-

ACS Paragon Plus Environment

3

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 4 of 63

residue segment of a protein.8 Subsequently, Mihaly Mezei used a larger PDB structural dataset to show that the longest naturally occurring chameleonic sequences are seven residues in length.9 Several other examples of chameleonic sequences (heptapeptide and octapeptide) have since been observed.10, 11 More recently, attempts to improve protein structure prediction efficiency, and identification of evolutionary relationship between secondary structures with the same amino acid sequence, have led to the identification of conformational flexibility and helix-strand transitions in short peptides from unrelated proteins.12, 13 Subsequently, several studies have identified molecular and environmental factors that drive conformational conversion in chameleonic sequences.14-16 Studies have deduced the mechanism of structural interconversion experimentally and through molecular dynamics simulations (MDS) in both chameleonic sequences in general, as well as amyloidogenic proteins in particular.17-21 The well-established phenomenon of amyloid formation involves the structural transition from α-helix to β-sheet; the latter then assembles into ordered β-aggregates. This conformational interconversion observed during amyloidogenesis has been investigated and discussed extensively in the literature. Hence, this review does not include examples of β-aggregates. This review mainly focuses on conformational transitions observed for non-amyloidogenic sequences in different circumstances.

ENERGY BARRIER AND SEQUENCE SPACE The protein universe is largely made up of 20 amino acids in its primary sequence. Permutations in the secondary structure elements of α-helix, β-sheet, and random coil bring additional diversity in protein structure and facilitate its function. The diversity we find in protein sequences and structures raises interesting questions such as: (i) What drives the primary

ACS Paragon Plus Environment

4

Page 5 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

amino acid sequence to reach a defined secondary structure? (ii) How much freedom of fold space is available to the protein in its environment? (iii) How do the fold and sequence dictate protein function? It is pertinent to now address these questions in the context of structural plasticity exhibited in chameleonic sequences. Evidently, factors controlling protein sequencestructure-space serve as parameters that are together responsible for the ultimate fold that the protein adopts.22, 23 Flexibility in local interactions mediated through sequence-space readjustment could allow the folded protein to escape a unique energy minimum and populate a second favorable energy in the folding pathway.20 The emergence of chameleonic sequences therefore relies on an evolutionarily optimized energetic equilibrium between at least two unrelated structures.24-26 It is well known that, folded proteins exist as an ensemble of closely related equilibrium structures in their energetically stable state. However, the conundrum in metamorphic proteins is that more than one distinct structural form is energetically favored. An evolutionary optimization of a single sequence between two equally stable structures indicates that structural (scaffold) plasticity is thermodynamically allowed.27 Further, oligomerization of one of the two metamorphic proteins is believed to energetically stabilize the fold, and facilitate the emergence of alternate structural scaffolds.28 Changes in environmental factors such as buffers or lipidic conditions can also energetically favor the bidirectional equilibrium of a single sequence towards one scaffold or structural assembly.29 This is achieved by changes to the energy landscape such that one structural form is more stabilized than another. An accompanying high energy barrier for the reverse process “locks” the protein in one metamorphic state.29 In proteins wherein oligomerization of at least one of the metamorphs is not mandated by the free energy landscape, the primary sequence is believed to possess stabilizing and destabilizing

ACS Paragon Plus Environment

5

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 6 of 63

segments. Structural rearrangement usually requires a near-complete unfolding of one of the forms and re-establishment of new non-covalent interactions in the alternate metamorphic form.30 An exhaustive study of human protein sequences by the Hilser group points to the existence of a thermodynamic environment31 that balances structure and sequence compatibility. The authors demonstrate that negative selection pressure leads to the retention of one (or a subset) of folds in a system where energetic compatibility of one sequence for more than one fold can exist. This negative selection is imposed by incompatible sequences that function as gatekeeper sequences in deciding folds that are disallowed. Hence, evolution seems to function in contrast to the positive design (or positive compatibility index) that is usually employed in de novo design and protein engineering,32 wherein favorable interactions are preferably retained. The benefits of negative design have also been demonstrated in other studies.33 It can be envisioned that metamorphic proteins have retained some gatekeeper residues31 in sequence and structure segments that determine conformational specificity. Yet, it is likely that they also possess residues of low negative compatibility (high positive compatibility index) that promote alternative favorable interactions and thermodynamically support the formation of more than one stable fold. The following sections now discuss how intrinsic and extrinsic factors allow the folding intermediate to choose between two distinct folded protein states, with illustrations drawn from structural and functional proteins.

ACS Paragon Plus Environment

6

Page 7 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

HISTORY OF THE PARACELSUS CHALLENGE In 1994, in an attempt to understand protein folding, Creamer and Rose challenged the scientific community to achieve two different folds from one protein by changing no more than 50% of the protein sequence.34, 35 They termed this “The Paracelsus Challenge” for the protein world. Several groups solved this problem to varying degree, but the results from three different groups – Paracelsin-43 from the Thoronton group in 1996, Crotein-G from Yuan and Clarke in 1998, and Janus from the Regan group in 1997 – merit discussion (Figure 1). All three amended proteins involved the redesign of the parent protein cores.36

Figure 1: Reply to the Paracelsus challenge. In each panel, the parent protein is shown on the left, and the target protein is on the right. The residues retained from the parent and target proteins to attain the engineered proteins are highlighted in red and cyan color, respectively. The engineered proteins are Paracelsin-G from BDS-1 and B-domain of Protein A (panel A), CroteinG from 434 Cro repressor and B-domain of protein G (panel B), and Janus from B-domain of protein G and Rop protein (panel C). All the Protein Data Bank (PDB) entries are provided above each structure. The figures are reproduced and modified with permission from Dalal et al., 1997.36 Copyright © 1997 Elsevier Inc.

ACS Paragon Plus Environment

7

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 8 of 63

a) Paracelsin-43: The Thornton group, in collaboration with the Sadler group, used the inverse protein folding method37, 38 to address the Paracelsus challenge. They chose two 90%.61-64 The domains GA98 and GB98 differed only at the 45th position, and possessed Leu and Tyr, respectively. The substitutions L45Y or Y45L in the GA98/GB98 system led to a change in the fold from 3α ↔ 4β+α.64, 65 The substitution at the 25th position from Thr to Ile in GB98 could also change the 4β+α fold to the 3α fold. This GB98-T25I (3α) structure could be rescued back to its 4β+α by substituting Leu to Ala at the 20th position (GB98-T25I, L20A).56, 66 The findings using protein G suggests that the conformational trigger in proteins can be confined to one or two amino acids.67 Such residues play a critical role in stabilizing tertiary interactions and do not allow any other structural rearrangement to occur. Highly homologous heteromorphic proteins such as protein G can pose substantial hurdles for computational biologists in predicting the secondary and three dimensional structure of a protein from the primary sequence. Yet, the identification of protein domains such as that seen for protein G can also provide interesting insight on fold switching and protein plasticity.

ACS Paragon Plus Environment

19

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 20 of 63

CONFORMATIONAL CHANGE UPON PROTEIN DIMERIZATION a) Lymphotactin (Ltn10-Ltn40) Lymphotactin (Ltn, XCL1) is a protein from the C family of chemokines, and is involved in inducing chemotaxis in humans.68 Under physiological conditions (37 °C and 150 mM NaCl) Ltn exists as a dynamic equilibrium between two different native folds (monomer and dimer).69 Monomeric Ltn (Ltn10) presents a 3β+α form (Figure 6A), and binds specifically to its receptor XCR1 (GPCR family) present on leukocytes. Dimeric form of Ltn (Ltn40) is present in an all-β form (Figure 6B), and can bind to cell surface glycosaminoglycans,70 particularly heparin. Ltn40 does not activate XCR1; similarly, Ltn10 fails to recognize heparin. Hence, both structural states have specific affinity to their respective binding partners, and are therefore functionally isolated. The shift in population from Ltn10 to Ltn40 can be controlled under physiological conditions by a change in the salt concentration – Ltn10 is seen at high salt and low temperatures, while at higher temperatures of up to 40 °C and low ionic strength, Ltn40 is predominant.69 Ltn dimerization leads to a change in the protein hydrophobic core and long range interactions, which is achieved by altering the C-terminal α-helix to a β-sheet. Further, hydrogen bonds are reestablished with different amino acids.71

ACS Paragon Plus Environment

20

Page 21 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 6: Dynamic equilibrium between monomer and dimer with two different folds in Ltn/XCL1. (A) Ltn10 present in monomeric form adopts a 3β+α in high salt concentration and low temperature. (B) Ltn40 is present in the dimeric form with all-β conformation in low salt and higher temperatures. CC3 (C) and CC5 (D) are engineered Ltn proteins, designed by introducing cysteines (yellow) to lock the protein conformation either in monomer (C) or dimer (D) form through disulfide bonds. The region that changes from α-helix in Ltn10 to random coil region (in Ltn40) is shown as orange; the region that changes from random coil to β-sheet has been shown in blue. The color code for the proteins is retained in the multiple sequence alignment, except for cysteine which is shown as yellow/black. The PDB IDs are shown above each structure. Panel (A) and (B) are redrawn with inspiration from Tuinstra et al., 2008.71 Copyright 2008 National Academy of Sciences.

ACS Paragon Plus Environment

21

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 22 of 63

In addition to identifying and characterizing the structural switch in Ltn, the Volkman group was able to show that Ltn could be conformationally locked in one of the two structures through disulfide bonds72, 73 and disturbing the hydrophobic core (W55→D).71 Disulfide bonds formed in the Ltn CC1 (T10→C and “AC” introduced between G32 and S33) and Ltn CC3 (V21→C, V59→C) variants locks the protein conformation in a monomeric form similar to Ltn10.72 The CC3 mutant shows characteristics similar to Ltn10, and can activate the XCL1 receptor (Figure 6C). In the Ltn CC5 (A36→C, A49→C) variant with a disulfide bond (Figure 6D), or Ltn W55→D with an altered hydrophobic core, the proteins are more stable in the dimeric form with high binding affinity to heparin.73 The stable dimers can also act as a potential HIV inhibitor.73 The Ltn10-Ltn40 equilibrium shows that a single protein can indeed perform multiple functions in the cell through structural transitions, and forms an outstanding example of a perfect metamorphic protein. Another example that shows a similar behavior is the conformational dimer of Mad2 with O-Mad2 and C-Mad2;74, 75 these systems have been reviewed elsewhere in greater detail.76, 77 b) Chloride Intracellular Channel Proteins (CLICs) The chloride intracellular ion channel proteins (CLICs)78, 79 are a family of intracellular anion channels that exhibit multiple metamorphic behavior – (i) they are present as soluble forms or transmembrane ion channels; (ii) one CLIC domain also undergo a conformational interconversion between β-sheet and α-helical structure, which decides whether the protein stays monomeric or forms a dimer.80 The N-terminal domain adopts an α3+β4 structure in its reduced form, which retains the protein as a monomer (Figure 7A). Oxidation modifies the cysteine residues, and leads to the formation of a disulfide bond between two conserved cysteines at the 24th and 59th positions.81 This oxidation causes the N-terminal domain to undergo a structural

ACS Paragon Plus Environment

22

Page 23 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

transition to an all-α state, which now dimerizes (Figure 7B). The intermediate dimeric state of CLIC inserts into the intracellular membrane and oligomerizes to form an anion channel. By studying the channel properties in the presence of reducing agents, and by using point mutations of the human CLIC protein CLIC1 (C24A), it was established that CLIC1 forms a redoxregulated functional channel.81

Figure 7: Redox-controlled dimerization of CLIC showing the conformational switch in the Nterminal domain. (A) The monomeric state of N-terminal CLIC has a 3α+4β form. Upon oxidation, cysteines present at the 24th and 59th positions (yellow) form a disulfide linkage, leading to protein dimerization (B). The dimeric interface can insert into the membrane. In both the structure and the schematic below, the change from a β-sheet structure to α-helix and loop regions has been shown in blue, and the change from unstructured region to α-helix has been shown in orange. The PDB IDs are shown above each structure.

ACS Paragon Plus Environment

23

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 24 of 63

CONFORMATIONAL CHANGE UPON PROTEIN OLIGOMERIZATION One of the best examples of protein-protein interactions is oligomerization, wherein, homo- or hetero-oligomerization can occur. A stable and energy minimized oligomeric state can be achieved through several changes like domain repositioning, structural rearrangements, formation of new binding partners, and establishing inter- or intra-domain interactions. It is argued that oligomerization can promote conformational changes, by establishing new stabilizing interactions that are not seen in the monomeric protein.27 Both cytosolic and membrane proteins are known to form oligomerization-mediated metamorphic structures. Here, we discuss a few striking examples.

a) Cytosolic Proteins: Metallopeptidase A class of metallopeptidases – the selective and specific caseinolytic metallopeptidase (selecase) from Methanocaldococcus jannaschii – attracted recent attention for its highly metamorphic nature.82 Selecase is a highly soluble protein enzyme that exhibits a concentration– dependent oligomeric state, ranging from monomer (slc1) to octamer (slc8). The functional fold of the protein is in the monomeric state (Figure 8A). Selecase is inactive and self-inhibiting in the oligomeric states (dimer, tetramer, and octamer). The functionally active monomer contains two domains, namely the N-terminal subdomain (NTS, α/β sandwich) and C-terminal subdomain (CTS, all-α). A central hydrophobic core stabilizes the metal binding cleft; the amino acids responsible for catalysis are present between NTS α2 and the connecting loop to CTS α3. This region constitutes the NTS-CTS interface. Upon oligomerization, the stable tetrameric (slc4) form loses enzymatic activity by an auto-inhibitory structural rearrangement that occurs at the

ACS Paragon Plus Environment

24

Page 25 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

metal binding cleft (Figure 8B). In slc4, a connecting loop between helix α2-α3 and helix α3 protrudes away from the core of the protein and forms canonical anti-parallel β-strands β4 and β5. This conformational change displaces the metal binding ligand H80 by ~16Å and α4 by ~30Å in slc4, rendering selecase inactive. It has been observed that this conformational interconversion is completely concentration dependent, and repacking of monomeric or oligomeric states occurs spontaneously.82

Figure 8: Concentration-dependent oligomerization and structural transition of selecase. (A) The topology of helix α3 and connecting loop of α2-α3 (orange) in the monomeric form of selecase is highlighted. This region is involved in a conformation switch when the protein concentration increases. (B) Tetrameric state of selecase showing that the highlighted region adopts a β-hairpin structure that protrudes away from protein core. The PDB IDs are shown above each structure.

ACS Paragon Plus Environment

25

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 26 of 63

b) Membrane Proteins: α- and β-Pore Forming Toxins Transmembrane proteins show limited variation in the structural motifs that also support their stability and function – they are predominantly α-helical in nature, while some outer membrane proteins of bacteria, mitochondria and chloroplast are β-sheet rich. The most interesting structural metamorphosis is exhibited by pore forming proteins.83 The pore forming proteins are a class of toxins produced from pathogens to attack the host cell by forming holes in the membrane. Pore forming toxins (PFT) are classified into two major classes, α-PFT and β-PFT, based on the final structural fold that is presented in the membrane spanning region.84, 85 PFTs exhibit high coordination between the protein segments as they undergo structural transitions from the monomeric soluble state to the oligomeric membrane integrated pore state. Cytolysin A (ClyA), also called hemolysin E (HlyE) from Staphylococcus aureus is categorized under α-PFT, and it forms α-helical channels in the membrane.86, 87 The structure of ClyA is a four α-helical bundle with one hydrophobic β-tongue (β-hairpin) that is stabilized by hydrophobic aromatic clusters present between the loop regions of two short α-helices αD and αE.86 The β-tongue undergoes a major conformational rearrangement upon ClyA assembly into the membrane, and converts into an α-helix. ClyA now forms three helical bundles with a membrane embedded helix αA1. The helical bundle associates in the membrane, forming a dodecameric transmembrane pore.86 The β-PFT pore formation mechanism has been studied extensively in both prokaryotes and eukaryotes. From the monomeric soluble form, the β-sheet or α-helical domain of the protein converts to an extended β-sheet and forms an oligomeric β-barrel pore in its membrane embedded form. Cholesterol-dependent cytolysin88 and the membrane attack complex/ perforin

ACS Paragon Plus Environment

26

Page 27 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

family89 show such kind of conformational transitions during the transmembrane pore formation. Other well-studied examples include the bacterial toxin pneumolysin,90 pleurotolysin complex (PlyA-PlyB, Figure 9),91 hemolytic lectin CEL-III,92 and lymphocyte perforin.93

Figure 9: Pore-forming toxins undergo structural transitions upon inserting into the membrane. Illustrated here is the example of the β-PFT class of proteins. PlyB of the PlyA-PlyB complex is involved in a conformational switch from α-helix to β-sheet (A-D, yellow color). This facilitates the formation of a pore in the membrane with a diameter of 80Å. The cross-section (E) and surface view (F) of the membrane pore are also provided. Figures are reproduced here with permission from Lukoyanova et al., 2015.91

ACS Paragon Plus Environment

27

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 28 of 63

LIGHT-MEDIATED CONFORMATIONAL SIGNAL IN PHYTOCHROMES Light-mediated growth and development of cyanobacteria and plants require photoreceptors to harness light, and for efficient photosynthesis.94 One protein class involved in photomorphogenesis is the phytochrome (Phy) photoreceptor. Phytochrome shows photointerconversion between dark adopted red-light absorbing Pr state and the light activated far-red light absorbing Pfr state.95 Phy protein is divided into multiple domains.96, 97 The N-terminal photosensory module (PSM) bears the light absorbing segment. The output module, which stabilizes the head-to-head dimerization along its helical spine, is linked to the PSM. In addition, phytochromes contain the cGMP phosphodiesterase/ adenylyl cyclase/ FhlA (GAF) domain and the Phy-specific (PHY) domain. The GAF domain binds bilin while the PHY domain interacts with the photoreceptor bilin present in GAF domain and stabilizes the photo-activated state of the protein. PHY domain achieves this stabilization through a unique hairpin or tongue loop that extends outside the domain. The extended tongue loop contains two antiparallel β-sheets that interact with the core of bilin/GAF domain through a salt bridge between two polar residues and aromatic hydrophobic interactions. This β-sheet structure is seen in the Pr state of phytochromes (Figure 10, left). Upon photo-activation by bilin receptors, the β-sheet present in the Pr state changes to α-helix in the Pfr state (Figure 10, right). This, in turn, leads to movement in the helical spine at the dimer interface to stabilize the photo-activated state. Conserved motifs like PR-X-S-F (X is any residue), F/W-X-E/Q, W-A/G-G, present in the tongue region, play a crucial role in stabilizing the interactions.98, 99 Mutating these conserved amino acids hampers the photoconversion.99, 100 The proposed toggle mechanism for light-driven conformational change is observed in bacterial,98, 100-102 cyanobacterial,103 and plant phytochromes.99 The phytochromes in

ACS Paragon Plus Environment

28

Page 29 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

these organisms share common homology in the sequence, structure, and function at Pr state and Pfr states (Figure 10).

Figure 10: Phytochrome photo-interconversion between dark-adopted red-light-absorbing Pr state (left) and the light activated far-red light absorbing Pfr state (right). The β-hairpin is present in the Pr state and it interconverts to the α-helix that populates the Pfr state. Figure reproduced with permission from Burgie et al., 2014.99 Copyright 2014 National Academy of Sciences.

OTHER EXAMPLES OF PROTEIN METAMORPHS Conformational transitions have been observed in many other proteins. Although they have not been discussed elaborately in this review, they are briefly mentioned here. In the case of some nuclear receptors such as the retinoic acid receptor, a structural change elicited upon agonist binding is used as a mechanism for controlling gene expression.104 The enzyme cysteine-aspartic protease-6 (caspase-6) shows helix-sheet conversion upon protein substrate binding.105 Cyanobacterial circadian rhythms are controlled by three different proteins KaiA, KaiB and KaiC. Here, KaiB displays a fold switch to activate KaiC and KaiA, so as to initiate the circadian cycle.106

ACS Paragon Plus Environment

29

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 30 of 63

METHODS TO STUDY CONFORMATIONAL SWITCH IN METAMORPHIC PROTEINS Many proteins and designed peptides show conformational preferences to a particular secondary structure depending on external factors or parameters like concentration of peptide or protein,82 pH,107-109 temperature,110, 111 time,110, 112 ionic strength,69 solvent,113, 114 metal,115, 116 and lipid or micelle concentration.29, 117-124 These external factors can exert pressure on the peptide and allow it to form a stable fold at that condition, by re-establishing local interactions with different partners. By far, structural studies using X-ray crystallography and NMR (nuclear magnetic resonance) have served as the most popular spectroscopic tools to characterize the conformational interconversion seen in metamorphic proteins. In conjunction with protein variants generated through mutagenesis, crystallography and NMR have provided atomistic information on secondary structure transitions and enabled identification of the primary elements responsible for conformational switch. Other spectroscopic methods used to validate structural transitions in metamorphic proteins include electron paramagnetic resonance (EPR),125 circular dichroism (CD),41, 126 Fourier transform infrared (FTIR) spectroscopy,127 and Raman spectroscopy.128, 129 Förster resonance energy transfer (FRET) can also establish the conformational changes; here, fluorescent probes can be strategically introduced such that distances between the fluorophore(s) are different in the two different folds.130 In most metamorphic proteins, the pathway for the conversion from one protein fold to the other protein fold is believed to be direct. If it is not a direct switch, it has to go through intermediates, which have not yet been observed in the vast majority of known metamorphic proteins. If the structural transition follows complete unfolding or an intermediate with lowered

ACS Paragon Plus Environment

30

Page 31 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

secondary structure content, these unfolded or intermediate states have not yet been characterized, because the change in fold occurs over very short timescales. Hence, the end state structures are well characterized, and experimental limitations in understanding the intermediates are overcome using MDS. Therefore, we can speculate the pathway of fold transition using MDS. Recent studies on metamorphic proteins like RfaH/RfaH-CTD and human islet amylin polypeptide (hIAPP) using MDS have proved that direct switching between the two folds follows the appearance of intermediates.19, 131, 132 Limited Proteolysis-Selected Reaction Monitoring (LiP-SRM) mass spectrometry can be applied to observe structural changes in biomolecules ranging from small peptides to complex biological samples.133 The LiP-SRM workflow includes proteolysis, which depends on structural features of the protein under native conditions. Samples that are treated to limited proteolysis are separated using gradient chromatography, detected using tandem mass spectrometry, and analyzed through shotgun proteomics. This method has been successfully tested to detect subtle or pronounced conformational changes seen in samples as diverse as amyloid proteins to enzymes involved in carbon metabolism.133

ACS Paragon Plus Environment

31

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 32 of 63

METAMORPHIC PROTEINS AND PEPTIDE DESIGN USING FIRST PRINCIPLES Peptides play an important role in a variety of biological processes. They serve as attractive tools to obtain a basic understanding of the conformational switch observed in metamorphic proteins.8 Designed peptides are made to preferentially adopt a particular conformation of either α-helix or β-sheet using conformationally constrained amino acids.134-136 However, extrinsic factors can force peptide sequences to form a different fold by forming new intermolecular interactions. This conformational interconversion is prevalently observed in the aggregation phenomenon of amyloid fibrils. Amyloidogenic peptides serve as a good example to understand the importance of hydrophobic and charged residues in the sequence for irreversible conformational conversion.18, 137 Indeed, a 16-residue peptide that was shown to undergo a temperature-driven reversible conformational switch from α-helix to β-sheet110 served as an excellent model system to understand the initial process of amyloid formation. Our focus here, however, will be on soluble peptides that show chameleonic behavior. Overall, while de novo designed sequences with conformationally constrained residues are unlikely to exhibit chameleonic behavior, the possible occurrence of multiple structural states in nature-inspired peptide sequences cannot be ruled out. We discuss a few select examples in this section. One of the earliest chameleonic peptides was a 16-residue amphiphilic sequence from Manfred Mutter and coworkers in 1991, which exhibited a pH-dependent switch from an α-helical oligomer to an antiparallel β-sheet structure.107 Other designed peptide sequences that exhibited a pH, solvent, salt concentration, or temperature dependent conformational switch from β-sheet to α-helical structures include the 17- residue sequences108 from the Kuntz laboratory, and the Aib-

ACS Paragon Plus Environment

32

Page 33 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Gly nucleated octapeptide from the Balaram laboratory.138 The effect of exogenous factors on peptide sequences derived from proteins has also been reported in the case of the 15-residue chameleonic peptide from the CD4-binding domain of HIV gp120113 and peptides from chymotrypsin and liver alcohol dehydrogenase.14 Metal- and redox-mediated conformational switch between α-helix and β-sheet has been reported in designed Zn-binding,109 Cu-binding,116 Zn/Cu-binding115 peptides, and in an oxidation-reduction process involving the Met side chain.126 Chameleonic behavior is uncommon in synthetic membrane-associated peptides derived from natural sequences. Yet, two outstanding and unexpected examples that have been reported so far include the first transmembrane domain of Mycobacteriophage D29 holin peptide (Figure 11A) and the 14-residue peptide of pneumococcus autolysin (Figure 11B). These examples are discussed below. Lytic bacteriophages affect the release of mature progeny phages by the disruption of the bacterial cell wall and membrane. The first step to the physical process of cell lysis involves the formation of holes in the host inner membrane by ‘holin’ proteins produced by the phage.139 Holins can have 1-3 transmembrane domains that adopt a helical structure in the membrane. Holins are quiescently accumulated in the bacterial inner membrane and are maintained in this inactive state by ‘antiholins’ that are co-synthesized in a 2:1 holin: anti-holin ratio.140 Anti-holins are two resides longer than holin proteins. The assembly of mature phage particles triggers the removal of the extra residues from antiholins, converting them to holins.139 Holins form a hole in the host membrane that is sufficiently large to effect progeny phage release.139 In mycobacteriophages, which target various species of mycobacteria, holins are known to occur.139, 141 However, an antiholin has not been identified, which raises the question of how the uncontrolled and premature association of holins is prevented. Our study of the first

ACS Paragon Plus Environment

33

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 34 of 63

transmembrane domain (TM1) of the Mycobacteriophage D29 holin revealed a surprising conformational switch of TM1 that was lipid dependent.120 At high lipid concentrations, TM1 exists in a helical form, similar to other canonical holins. When more holin proteins are synthesized in the host cytosol and relocated to the inner membrane, the protein: lipid ratio increases in favor of protein. Under these conditions, TM1 undergoes a conformational conversion to a β-hairpin structure, which allows the peptide to aggregate in the membrane and cause membrane destabilization.123 A centrally located Pro-Gly segment that is conserved across mycobacteriophage holins facilitates the reversible conversion of TM1 between the helical and β-hairpin structure, based on the protein: lipid ratio (Figure 11A). A similar lipid-dependent structural conversion between a polyproline conformation (in lipid vesicles) and α-helical conformation (in organic solvents) has been reported for the second transmembrane domain of mycobacteriophage D29 holin.142

ACS Paragon Plus Environment

34

Page 35 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 11: Micelle- triggered conformation change in TM1 (A) and LytA239-252 (B). (A) The centrally located Pro-Gly present in TM1 is important for β-hairpin to α-helix interconversion in low micelle and high micelle concentrations, respectively. This structural interconversion is abolished upon introducing β-hairpin favoring residues (DPro-Gly) or helix favoring residues (Ala-Ala or Aib-Gly) in the central turn region. Panel A is edited and reproduced with permission from Lella et al., 2013120 with permission from The Royal Society of Chemistry. (B) LytA239-252 is present in a β-hairpin form in aqueous conditions (top) and converts to α-helix

ACS Paragon Plus Environment

35

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 36 of 63

(bottom) when a lipidic (DPC) or detergent (SDS) micelle environment is provided. Panel B is reproduced with permission from Zamora‐Carreras et al., 2015.122 Copyright © 2015 John Wiley and Sons. An interesting case of structural interconversion was reported in an isolated 14-residue sequence of an autolysin from the pathogen Streptococcus pneumoniae. Autolysins possess choline-binding modules (CBMs) that allow these proteins to attach to choline moieties of teichoic and lipoteichoic acids found on the host cell surface. CBMs possess choline-binding repeats (CBRs) that adopt a ββ-solenoid motif. Each strand in this ββ-solenoid motif is constituted by β-hairpins that are about 14 residues in length, and are enriched with aromatic and charged residues.143 Work from the Jiménez group on a CBR of the autolysin LytA (LytA239-252) led to the identification of a structural switch in this 14-residue peptide.122 NMR analysis of LytA239-252 revealed that the peptide adopted a β-hairpin structure in aqueous solutions (Figure 11B). Turn nucleation was achieved at A245-D246 and the non-hydrogen bonding position of the β-hairpin was stabilized by a Lys-Trp and Trp-Tyr interaction.122, 144 Surprisingly, when the peptide was presented with a lipidic environment, it underwent a highly cooperative transition from a β-hairpin to α-helix as the concentration of phosphocholine molecules was increased (Figure 11B). Further characterization of LytA239-252 using NMR and CD showed that the conformational change was facilitated by the interaction of the peptide with phosphocholine micelles and not with the individual phosphocholine molecules. The studies also established that the process of structural conversion in LytA239-252 was reversible. The Jiménez group was also able to demonstrate that the β-hairpin ↔ α-helix conversion could also be elicited in choline and non-choline lipid vesicles, and when non-lipidic micellar system (such as SDS micelles) was provided to the peptide (Figure 11B). As the mechanism of LytA translocation across the

ACS Paragon Plus Environment

36

Page 37 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

membrane is not known, the identification of a structural switch allowed the authors to propose that the structural switch in the CBRs of LytA could facilitate this translocation. Overall, the amphipathic and metamorphic nature of the CBR region might assist the LytA molecule in its function of host cell adhesion and pathogenesis.

ACS Paragon Plus Environment

37

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 38 of 63

CONCLUSIONS Conformational plasticity exhibited by metamorphic proteins appears to be an interesting deviation from Anfinsen’s hypothesis, and poses a challenge to protein structure prediction and de novo design of novel sequences. However, this dual nature of proteins carries an immense applicability in the design of nanoswitches. An increasing number of proteins that exhibit dramatic conformational switch are being identified. These conformational changes can be triggered by various factors including simple external cues such as pH, temperature, and salt concentrations (Figure 12). While metamorphic proteins can be perceived as a conundrum in solving the protein folding problem, we believe that the identification of more naturally occurring protein metamorphs will facilitate the use of these molecules in bionanotechnology as molecular switches.

Figure 12: Summary of intrinsic and extrinsic factors responsible for metamorphic behavior of proteins.

ACS Paragon Plus Environment

38

Page 39 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

AUTHOR INFORMATION Corresponding Author *Muralikrishna Lella, E-mail: [email protected]; *Dr. Radhakrishnan Mahalakshmi, E-mail: [email protected]. Author Contributions The manuscript was written through contributions of all authors. All authors have given approval to the final version of the manuscript. *These authors contributed equally. Funding Sources This work was supported by the Science and Engineering Research Board project SERB/WEA13/2016 to R.M. Notes The authors have no conflict of interest to disclose. ACKNOWLEDGMENT M.L. thanks IISER Bhopal for research fellowship. R.M. is a recipient of the Wellcome Trust/ DBT India Alliance Intermediate Fellowship.

ACS Paragon Plus Environment

39

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 40 of 63

REFERENCES [1]. [2]. [3].

[4]. [5]. [6].

[7]. [8]. [9]. [10]. [11]. [12]. [13]. [14]. [15]. [16]. [17]. [18].

Frishman, D., and Argos, P. (1997) The future of protein secondary structure prediction accuracy, Fold. Des. 2, 159-162. Pirovano, W., and Heringa, J. (2010) Protein secondary structure prediction, Methods Mol. Biol. 609, 327-348. Faraggi, E., Zhang, T., Yang, Y., Kurgan, L., and Zhou, Y. (2012) Spine X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles, J. Comput. Chem. 33, 259267. Anfinsen, C. B. (1973) Principles that govern the folding of protein chains, Science 181, 223-230. Kabsch, W., and Sander, C. (1984) On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations, Proc. Natl. Acad. Sci. U. S. A. 81, 1075-1078. Wilson, I. A., Haft, D. H., Getzoff, E. D., Tainer, J. A., Lerner, R. A., and Brenner, S. (1985) Identical short peptide sequences in unrelated proteins can have different conformations: a testing ground for theories of immune recognition, Proc. Natl. Acad. Sci. U. S. A. 82, 5255-5259. Cohen, B. I., Presnell, S. R., and Cohen, F. E. (1993) Origins of structural diversity within sequentially identical hexapeptides, Protein Sci. 2, 2134-2145. Minor, D. L., Jr., and Kim, P. S. (1996) Context-dependent secondary structure formation of a designed protein sequence, Nature 380, 730-734. Mezei, M. (1998) Chameleon sequences in the PDB, Protein Eng. 11, 411-414. Guo, J. T., Jaromczyk, J. W., and Xu, Y. (2007) Analysis of chameleon sequences and their implications in biological processes, Proteins 67, 548-558. Krishna, N., and Guruprasad, K. (2011) Certain heptapeptide and large sequences representing an entire helix, strand or coil conformation in proteins are associated as chameleon sequences, Int. J. Biol. Macromolec. 49, 218-222. Zhou, X., Alber, F., Folkers, G., Gonnet, G. H., and Chelvanayagam, G. (2000) An analysis of the helix-to-strand transition between peptides with identical sequence, Proteins 41, 248-256. Jacoboni, I., Martelli, P. L., Fariselli, P., Compiani, M., and Casadio, R. (2000) Predictions of protein segments with the same aminoacid sequence and different secondary structure: a benchmark for predictive methods, Proteins 41, 535-544. Zhong, L., and Johnson, W. C., Jr. (1992) Environment affects amino acid preference for secondary structure, Proc. Natl. Acad. Sci. U. S. A. 89, 4462-4465. Grishin, N. V. (2001) Fold change in evolution of protein structures, J. Struct. Biol. 134, 167-185. Macdonald, J. R., and Johnson, W. C., Jr. (2001) Environmental features are important in determining protein secondary structure, Protein Sci. 10, 1172-1177. Ding, F., Borreguero, J. M., Buldyrey, S. V., Stanley, H. E., and Dokholyan, N. V. (2003) Mechanism for the alpha-helix to beta-hairpin transition, Proteins 53, 220-228. Pagel, K., Vagt, T., and Koksch, B. (2005) Directing the secondary structure of polypeptides at will: from helices to amyloids and back again?, Org. Biomol. Chem. 3, 3843-3850.

ACS Paragon Plus Environment

40

Page 41 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

[19].

[20]. [21]. [22]. [23]. [24]. [25]. [26]. [27]. [28]. [29]. [30]. [31]. [32]. [33]. [34]. [35].

Li, S., Xiong, B., Xu, Y., Lu, T., Luo, X., Luo, C., Shen, J., Chen, K., Zheng, M., and Jiang, H. (2014) Mechanism of the all-alpha to all-beta conformational transition of RfaH-CTD: molecular dynamics simulation and Markov state model, J. Chem. Theory Comput. 10, 2255-2264. Lin, M. M., Shorokhov, D., and Zewail, A. H. (2014) Dominance of misfolded intermediates in the dynamics of alpha-helix folding, Proc. Natl. Acad. Sci. U. S. A. 111, 14424-14429. Kim, B., Do, T. D., Hayden, E. Y., Teplow, D. B., Bowers, M. T., and Shea, J. E. (2016) Aggregation of chameleon peptides: implications of alpha-helicity in fibril formation, J. Phys. Chem. B 120, 5874-5883. Meyerguz, L., Kleinberg, J., and Elber, R. (2007) The network of sequence flow between protein structures, Proc. Natl. Acad. Sci. U. S. A. 104, 11627-11632. Cao, B., and Elber, R. (2010) Computational exploration of the network of sequence flow between protein structures, Proteins 78, 985-1003. James, L. C., and Tawfik, D. S. (2003) Conformational diversity and protein evolution--a 60-year-old hypothesis revisited, Trends Biochem. Sci. 28, 361-368. Tokuriki, N., and Tawfik, D. S. (2009) Protein dynamism and evolvability, Science 324, 203-207. Cordes, M. H., and Stewart, K. L. (2012) The porous borders of the protein world, Structure 20, 199-200. Yadid, I., Kirshenbaum, N., Sharon, M., Dym, O., and Tawfik, D. S. (2010) Metamorphic proteins mediate evolutionary transitions of structure, Proc. Natl. Acad. Sci. U. S. A. 107, 7287-7292. Alva, V., Koretke, K. K., Coles, M., and Lupas, A. N. (2008) Cradle-loop barrels and the concept of metafolds in protein classification by natural descent, Curr. Opin. Struct. Biol. 18, 358-365. Tanaka, K., Caaveiro, J. M., and Tsumoto, K. (2015) Bidirectional transformation of a metamorphic protein between the water-soluble and transmembrane native states, Biochemistry 54, 6863-6866. Tyler, R. C., Murray, N. J., Peterson, F. C., and Volkman, B. F. (2011) Native-state interconversion of a metamorphic protein requires global unfolding, Biochemistry 50, 7077-7079. Hoffmann, J., Wrabl, J. O., and Hilser, V. J. (2016) The role of negative selection in protein evolution revealed through the energetics of the native state ensemble, Proteins 84, 435-447. Kuhlman, B., Dantas, G., Ireton, G. C., Varani, G., Stoddard, B. L., and Baker, D. (2003) Design of a novel globular protein fold with atomic-level accuracy, Science 302, 13641368. Bolon, D. N., Grant, R. A., Baker, T. A., and Sauer, R. T. (2005) Specificity versus stability in computational protein design, Proc. Natl. Acad. Sci. U. S. A. 102, 1272412729. Rose, G. D., and Creamer, T. P. (1994) Protein folding: predicting predicting, Proteins 19, 1-3. Rose, G. D. (1997) Protein folding and the Paracelsus challenge, Nat. Struct. Biol. 4, 512514.

ACS Paragon Plus Environment

41

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

[36]. [37]. [38]. [39].

[40]. [41]. [42]. [43]. [44]. [45].

[46].

[47]. [48]. [49]. [50]. [51]. [52].

Page 42 of 63

Dalal, S., Balasubramanian, S., and Regan, L. (1997) Transmuting alpha helices and beta sheets, Fold. Des. 2, R71-79. Bowie, J. U., Luthy, R., and Eisenberg, D. (1991) A method to identify protein sequences that fold into a known three-dimensional structure, Science 253, 164-170. Yue, K., and Dill, K. A. (1992) Inverse protein folding problem: designing polymer sequences, Proc. Natl. Acad. Sci. U. S. A. 89, 4163-4167. Jones, D. T., Moody, C. M., Uppenbrink, J., Viles, J. H., Doyle, P. M., Harris, C. J., Pearl, L. H., Sadler, P. J., and Thornton, J. M. (1996) Towards meeting the Paracelsus challenge: the design, synthesis, and characterization of paracelsin-43, an alpha-helical protein with over 50% sequence identity to an all-beta protein, Proteins 24, 502-513. Yuan, S. M., and Clarke, N. D. (1998) A hybrid sequence approach to the Paracelsus challenge, Proteins 30, 136-143. Dalal, S., Balasubramanian, S., and Regan, L. (1997) Protein alchemy: changing betasheet into alpha-helix, Nat. Struct. Biol. 4, 548-552. Kinch, L. N., and Grishin, N. V. (2002) Evolution of protein structures and functions, Curr. Opin. Struct. Biol. 12, 400-408. Newlove, T., Konieczka, J. H., and Cordes, M. H. (2004) Secondary structure switching in Cro protein evolution, Structure 12, 569-581. Van Dorn, L. O., Newlove, T., Chang, S., Ingram, W. M., and Cordes, M. H. (2006) Relationship between sequence determinants of stability for two natural homologous proteins with different folds, Biochemistry 45, 10542-10553. Roessler, C. G., Hall, B. M., Anderson, W. J., Ingram, W. M., Roberts, S. A., Montfort, W. R., and Cordes, M. H. (2008) Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds, Proc. Natl. Acad. Sci. U. S. A. 105, 2343-2348. Burmann, B. M., Knauer, S. H., Sevostyanova, A., Schweimer, K., Mooney, R. A., Landick, R., Artsimovitch, I., and Rosch, P. (2012) An alpha helix to beta barrel domain switch transforms the transcription factor RfaH into a translation factor, Cell 150, 291303. Belogurov, G. A., Vassylyeva, M. N., Svetlov, V., Klyuyev, S., Grishin, N. V., Vassylyev, D. G., and Artsimovitch, I. (2007) Structural basis for converting a general transcription factor into an operon-specific virulence regulator, Mol. Cell 26, 117-129. Tomar, S. K., Knauer, S. H., Nandymazumdar, M., Rosch, P., and Artsimovitch, I. (2013) Interdomain contacts control folding of transcription factor RfaH, Nucleic Acids Res. 41, 10077-10085. Burmann, B. M., Schweimer, K., Luo, X., Wahl, M. C., Stitt, B. L., Gottesman, M. E., and Rosch, P. (2010) A NusE:NusG complex links transcription and translation, Science 328, 501-504. Mooney, R. A., Schweimer, K., Rosch, P., Gottesman, M., and Landick, R. (2009) Two structurally independent domains of E. coli NusG create regulatory plasticity via distinct interactions with RNA polymerase and regulators, J. Mol. Biol. 391, 341-358. Cordes, M. H., Walsh, N. P., McKnight, C. J., and Sauer, R. T. (1999) Evolution of a protein fold in vitro, Science 284, 325-328. Cordes, M. H., Burton, R. E., Walsh, N. P., McKnight, C. J., and Sauer, R. T. (2000) An evolutionary bridge to a new protein fold, Nat. Struct. Biol. 7, 1129-1132.

ACS Paragon Plus Environment

42

Page 43 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

[53]. [54]. [55]. [56]. [57]. [58]. [59]. [60]. [61]. [62]. [63]. [64]. [65]. [66]. [67]. [68].

Cordes, M. H., Walsh, N. P., McKnight, C. J., and Sauer, R. T. (2003) Solution structure of switch Arc, a mutant with 3(10) helices replacing a wild-type beta-ribbon, J. Mol. Biol. 326, 899-909. Anderson, T. A., Cordes, M. H., and Sauer, R. T. (2005) Sequence determinants of a conformational switch in a protein structure, Proc. Natl. Acad. Sci. U. S. A. 102, 1834418349. Allison, J. R., Bergeler, M., Hansen, N., and van Gunsteren, W. F. (2011) Current computer modeling cannot explain why two highly similar sequences fold into different structures, Biochemistry 50, 10965-10973. Bryan, P. N., and Orban, J. (2013) Implications of protein fold switching, Curr. Opin. Struct. Biol. 23, 314-316. Sjobring, U., Bjorck, L., and Kastern, W. (1991) Streptococcal protein G. Gene structure and protein binding properties, J. Biol. Chem. 266, 399-405. He, Y., Rozak, D. A., Sari, N., Chen, Y., Bryan, P., and Orban, J. (2006) Structure, dynamics, and stability variation in bacterial albumin binding modules: implications for species specificity, Biochemistry 45, 10102-10109. Gallagher, T., Alexander, P., Bryan, P., and Gilliland, G. L. (1994) Two crystal structures of the B1 immunoglobulin-binding domain of streptococcal protein G and comparison with NMR, Biochemistry 33, 4721-4729. Alexander, P. A., Rozak, D. A., Orban, J., and Bryan, P. N. (2005) Directed evolution of highly homologous proteins with different folds by phage display: implications for the protein folding code, Biochemistry 44, 14045-14054. Alexander, P. A., He, Y., Chen, Y., Orban, J., and Bryan, P. N. (2007) The design and characterization of two proteins with 88% sequence identity but different structure and function, Proc. Natl. Acad. Sci. U. S. A. 104, 11963-11968. Giri, R., Morrone, A., Travaglini-Allocatelli, C., Jemth, P., Brunori, M., and Gianni, S. (2012) Folding pathways of proteins with increasing degree of sequence identities but different structure and function, Proc. Natl. Acad. Sci. U. S. A. 109, 17772-17776. He, Y., Chen, Y., Alexander, P., Bryan, P. N., and Orban, J. (2008) NMR structures of two designed proteins with high sequence identity but different fold and function, Proc. Natl. Acad. Sci. U. S. A. 105, 14412-14417. Alexander, P. A., He, Y., Chen, Y., Orban, J., and Bryan, P. N. (2009) A minimal sequence code for switching protein structure and function, Proc. Natl. Acad. Sci. U. S. A. 106, 21149-21154. Shortle, D. (2009) One sequence plus one mutation equals two folds, Proc. Natl. Acad. Sci. U. S. A. 106, 21011-21012. He, Y., Chen, Y., Alexander, P. A., Bryan, P. N., and Orban, J. (2012) Mutational tipping points for switching protein folds and functions, Structure 20, 283-291. Roy, A., Perez, A., Dill, K. A., and Maccallum, J. L. (2014) Computing the relative stabilities and the per-residue components in protein conformational changes, Structure 22, 168-175. Kelner, G. S., Kennedy, J., Bacon, K. B., Kleyensteuber, S., Largaespada, D. A., Jenkins, N. A., Copeland, N. G., Bazan, J. F., Moore, K. W., Schall, T. J., and et al. (1994) Lymphotactin: a cytokine that represents a new class of chemokine, Science 266, 13951399.

ACS Paragon Plus Environment

43

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

[69]. [70]. [71]. [72]. [73]. [74]. [75]. [76]. [77]. [78].

[79].

[80].

[81]. [82].

[83]. [84].

Page 44 of 63

Kuloglu, E. S., McCaslin, D. R., Markley, J. L., and Volkman, B. F. (2002) Structural rearrangement of human lymphotactin, a C chemokine, under physiological solution conditions, J. Biol. Chem. 277, 17863-17870. Fox, J. C., Tyler, R. C., Peterson, F. C., Dyer, D. P., Zhang, F., Linhardt, R. J., Handel, T. M., and Volkman, B. F. (2016) Examination of glycosaminoglycan binding sites on the XCL1 dimer, Biochemistry 55, 1214-1225. Tuinstra, R. L., Peterson, F. C., Kutlesa, S., Elgin, E. S., Kron, M. A., and Volkman, B. F. (2008) Interconversion between two unrelated protein folds in the lymphotactin native state, Proc. Natl. Acad. Sci. U. S. A. 105, 5057-5062. Tuinstra, R. L., Peterson, F. C., Elgin, E. S., Pelzek, A. J., and Volkman, B. F. (2007) An engineered second disulfide bond restricts lymphotactin/XCL1 to a chemokine-like conformation with XCR1 agonist activity, Biochemistry 46, 2564-2573. Fox, J. C., Tyler, R. C., Guzzo, C., Tuinstra, R. L., Peterson, F. C., Lusso, P., and Volkman, B. F. (2015) Engineering metamorphic chemokine lymphotactin/ XCL1 into the GAG-binding, HIV-inhibitory dimer conformation, ACS Chem. Biol. 10, 2580-2588. Luo, X., Tang, Z., Xia, G., Wassmann, K., Matsumoto, T., Rizo, J., and Yu, H. (2004) The Mad2 spindle checkpoint protein has two distinct natively folded states, Nat. Struct. Mol. Biol. 11, 338-345. Mapelli, M., Massimiliano, L., Santaguida, S., and Musacchio, A. (2007) The Mad2 conformational dimer: Structure and implications for the spindle assembly checkpoint, Cell 131, 730-743. Murzin, A. G. (2008) Metamorphic proteins, Science 320, 1725-1726. Goodchild, S. C., Curmi, P. M. G., and Brown, L. J. (2011) Structural gymnastics of multifunctional metamorphic proteins, Biophys. Rev. 3, 143– 153. Littler, D. R., Harrop, S. J., Goodchild, S. C., Phang, J. M., Mynott, A. V., Jiang, L., Valenzuela, S. M., Mazzanti, M., Brown, L. J., Breit, S. N., and Curmi, P. M. (2010) The enigma of the CLIC proteins: ion channels, redox proteins, enzymes, scaffolding proteins?, FEBS Lett. 584, 2093-2101. Al Khamici, H., Brown, L. J., Hossain, K. R., Hudson, A. L., Sinclair-Burton, A. A., Ng, J. P., Daniel, E. L., Hare, J. E., Cornell, B. A., Curmi, P. M., Davey, M. W., and Valenzuela, S. M. (2015) Members of the chloride intracellular ion channel protein family demonstrate glutaredoxin-like enzymatic activity, PLoS One 10, e115699. Littler, D. R., Harrop, S. J., Fairlie, W. D., Brown, L. J., Pankhurst, G. J., Pankhurst, S., DeMaere, M. Z., Campbell, T. J., Bauskin, A. R., Tonini, R., Mazzanti, M., Breit, S. N., and Curmi, P. M. (2004) The intracellular chloride ion channel protein CLIC1 undergoes a redox-controlled structural transition, J. Biol. Chem. 279, 9298-9305. Singh, H., and Ashley, R. H. (2006) Redox regulation of CLIC1 by cysteine residues associated with the putative channel pore, Biophys. J. 90, 1628-1638. Lopez-Pelegrin, M., Cerda-Costa, N., Cintas-Pedrola, A., Herranz-Trillo, F., Bernado, P., Peinado, J. R., Arolas, J. L., and Gomis-Ruth, F. X. (2014) Multiple stable conformations account for reversible concentration-dependent oligomerization and autoinhibition of a metamorphic metallopeptidase, Angew. Chem. Int. Ed. 53, 10624-10630. Iacovache, I., Bischofberger, M., and van der Goot, F. G. (2010) Structure and assembly of pore-forming proteins, Curr. Opin. Struct. Biol. 20, 241-246. Tilley, S. J., and Saibil, H. R. (2006) The mechanism of pore formation by bacterial toxins, Curr. Opin. Struct. Biol. 16, 230-236.

ACS Paragon Plus Environment

44

Page 45 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

[85].

Dal Peraro, M., and van der Goot, F. G. (2016) Pore-forming toxins: ancient, but never really out of fashion, Nat. Rev. Microbiol. 14, 77-92. [86]. Mueller, M., Grauschopf, U., Maier, T., Glockshuber, R., and Ban, N. (2009) The structure of a cytolytic alpha-helical toxin pore reveals its assembly mechanism, Nature 459, 726-730. [87]. Wallace, A. J., Stillman, T. J., Atkins, A., Jamieson, S. J., Bullough, P. A., Green, J., and Artymiuk, P. J. (2000) E. coli hemolysin E (HlyE, ClyA, SheA): X-ray crystal structure of the toxin and observation of membrane pores by electron microscopy, Cell 100, 265276. [88]. Hotze, E. M., and Tweten, R. K. (2012) Membrane assembly of the cholesteroldependent cytolysin pore complex, Biochim. Biophys. Acta 1818, 1028-1038. [89]. Reboul, C. F., Whisstock, J. C., and Dunstone, M. A. (2016) Giant MACPF/CDC pore forming toxins: a class of their own, Biochim. Biophys. Acta 1858, 475-486. [90]. Tilley, S. J., Orlova, E. V., Gilbert, R. J., Andrew, P. W., and Saibil, H. R. (2005) Structural basis of pore formation by the bacterial toxin pneumolysin, Cell 121, 247-256. [91]. Lukoyanova, N., Kondos, S. C., Farabella, I., Law, R. H., Reboul, C. F., Caradoc-Davies, T. T., Spicer, B. A., Kleifeld, O., Traore, D. A., Ekkel, S. M., Voskoboinik, I., Trapani, J. A., Hatfaludi, T., Oliver, K., Hotze, E. M., Tweten, R. K., Whisstock, J. C., Topf, M., Saibil, H. R., and Dunstone, M. A. (2015) Conformational changes during pore formation by the perforin-related protein pleurotolysin, PLoS Biol. 13, e1002049. [92]. Unno, H., Goda, S., and Hatakeyama, T. (2014) Hemolytic lectin CEL-III heptamerizes via a large structural transition from alpha-helices to a beta-barrel during the transmembrane pore formation process, J. Biol. Chem. 289, 12805-12812. [93]. Law, R. H., Lukoyanova, N., Voskoboinik, I., Caradoc-Davies, T. T., Baran, K., Dunstone, M. A., D'Angelo, M. E., Orlova, E. V., Coulibaly, F., Verschoor, S., Browne, K. A., Ciccone, A., Kuiper, M. J., Bird, P. I., Trapani, J. A., Saibil, H. R., and Whisstock, J. C. (2010) The structural basis for membrane binding and pore formation by lymphocyte perforin, Nature 468, 447-451. [94]. Moglich, A., Yang, X., Ayers, R. A., and Moffat, K. (2010) Structure and function of plant photoreceptors, Annu. Rev. Plant. Biol. 61, 21-47. [95]. Rockwell, N. C., Su, Y. S., and Lagarias, J. C. (2006) Phytochrome structure and signaling mechanisms, Annu. Rev. Plant. Biol. 57, 837-858. [96]. Sharrock, R. A. (2008) The phytochrome red/far-red photoreceptor superfamily, Genome Biol. 9, 230. [97]. Burgie, E. S., and Vierstra, R. D. (2014) Phytochromes: an atomic perspective on photoactivation and signaling, Plant Cell 26, 4568-4583. [98]. Takala, H., Bjorling, A., Berntsson, O., Lehtivuori, H., Niebling, S., Hoernke, M., Kosheleva, I., Henning, R., Menzel, A., Ihalainen, J. A., and Westenhoff, S. (2014) Signal amplification and transduction in phytochrome photosensors, Nature 509, 245248. [99]. Burgie, E. S., Bussell, A. N., Walker, J. M., Dubiel, K., and Vierstra, R. D. (2014) Crystal structure of the photosensing module from a red/far-red light-absorbing plant phytochrome, Proc. Natl. Acad. Sci. U. S. A. 111, 10179-10184. [100]. Burgie, E. S., Zhang, J., and Vierstra, R. D. (2016) Crystal structure of deinococcus phytochrome in the photoactivated state reveals a cascade of structural rearrangements during photoconversion, Structure 24, 448-457.

ACS Paragon Plus Environment

45

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 46 of 63

[101]. Yang, X., Kuk, J., and Moffat, K. (2009) Conformational differences between the Pfr and Pr states in Pseudomonas aeruginosa bacteriophytochrome, Proc. Natl. Acad. Sci. U. S. A. 106, 15639-15644. [102]. Otero, L. H., Klinke, S., Rinaldi, J., Velazquez-Escobar, F., Mroginski, M. A., Fernandez Lopez, M., Malamud, F., Vojnov, A. A., Hildebrandt, P., Goldbaum, F. A., and Bonomi, H. R. (2016) Structure of the full-length bacteriophytochrome from the plant pathogen Xanthomonas campestris provides clues to its long-range signaling mechanism, J. Mol. Biol. 428, 3702-3720. [103]. Anders, K., Daminelli-Widany, G., Mroginski, M. A., von Stetten, D., and Essen, L. O. (2013) Structure of the cyanobacterial phytochrome 2 photosensor implies a tryptophan switch for phytochrome signaling, J. Biol. Chem. 288, 35714-35725. [104]. le Maire, A., Teyssier, C., Erb, C., Grimaldi, M., Alvarez, S., de Lera, A. R., Balaguer, P., Gronemeyer, H., Royer, C. A., Germain, P., and Bourguet, W. (2010) A unique secondary-structure switch controls constitutive gene repression by retinoic acid receptor, Nat. Struct. Mol. Biol. 17, 801-807. [105]. Dagbay, K. B., Bolik-Coulon, N., Savinov, S. N., and Hardy, J. A. (2017) Caspase-6 undergoes a distinct helix-strand interconversion upon substrate binding, J. Biol. Chem. 292, 4885-4897. [106]. Chang, Y. G., Cohen, S. E., Phong, C., Myers, W. K., Kim, Y. I., Tseng, R., Lin, J., Zhang, L., Boyd, J. S., Lee, Y., Kang, S., Lee, D., Li, S., Britt, R. D., Rust, M. J., Golden, S. S., and LiWang, A. (2015) Circadian rhythms. a protein fold switch joins the circadian oscillator to clock output in Cyanobacteria, Science 349, 324-328. [107]. Mutter, M., Gassmann, R., Buttkus, U., and Altmann, K. H. (1991) Switch peptides - pHinduced alpha-helix to beta-sheet transitions of bis-amphiphilic oligopeptides, Angew. Chem. Int. Ed. 30, 1514-1516. [108]. Cerpa, R., Cohen, F. E., and Kuntz, I. D. (1996) Conformational switching in designed peptides: the helix/sheet transition, Fold. Des. 1, 91-101. [109]. Ambroggio, X. I., and Kuhlman, B. (2006) Computational design of a single amino acid sequence that can switch between two distinct protein folds, J. Am. Chem. Soc. 128, 1154-1161. [110]. Zhang, S., and Rich, A. (1997) Direct conversion of an oligopeptide from a beta-sheet to an alpha-helix: a model for amyloid formation, Proc. Natl. Acad. Sci. U. S. A. 94, 23-28. [111]. Kammerer, R. A., and Steinmetz, M. O. (2006) De novo design of a two-stranded coiledcoil switch peptide, J. Struct. Biol. 155, 146-153. [112]. Pagel, K., Wagner, S. C., Samedov, K., von Berlepsch, H., Bottcher, C., and Koksch, B. (2006) Random coils, beta-sheet ribbons, and alpha-helical fibers: one peptide adopting three different secondary structures at will, J. Am. Chem. Soc. 128, 2196-2197. [113]. Reed, J., and Kinzel, V. (1991) A conformational switch is associated with receptor affinity in peptides derived from the CD4-binding domain of gp120 from HIV I, Biochemistry 30, 4521-4528. [114]. Robinson, A. C., Majumdar, A., Schlessman, J. L., and Garcia-Moreno, E. B. (2017) Charges in hydrophobic environments: a strategy for identifying alternative states in proteins, Biochemistry 56, 212-218. [115]. Pagel, K., Vagt, T., Kohajda, T., and Koksch, B. (2005) From alpha-helix to beta-sheet--a reversible metal ion induced peptide secondary structure switch, Org. Biomol. Chem. 3, 2500-2502.

ACS Paragon Plus Environment

46

Page 47 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

[116]. Wang, X., Bergenfeld, I., Arora, P. S., and Canary, J. W. (2012) Reversible redox reconfiguration of secondary structures in a designed peptide, Angew. Chem. Int. Ed. 51, 12099-12101. [117]. Zhang, X., and Keiderling, T. A. (2006) Lipid-induced conformational transitions of beta-lactoglobulin, Biochemistry 45, 8444-8452. [118]. Sallach, R. E., Wei, M., Biswas, N., Conticello, V. P., Lecommandoux, S., Dluhy, R. A., and Chaikof, E. L. (2006) Micelle density regulated by a reversible switch of protein secondary structure, J. Am. Chem. Soc. 128, 12014-12019. [119]. Nadeau, V. G., and Deber, C. M. (2013) Loop sequence dictates the secondary structure of a human membrane protein hairpin, Biochemistry 52, 2419-2426. [120]. Lella, M., and Mahalakshmi, R. (2013) Pro-Gly mediated conformational switch of mycobacteriophage D29 holin transmembrane domain I is lipid concentration driven, Chem. Commun. 49, 9594-9596. [121]. Inda, M. E., Vandenbranden, M., Fernandez, A., de Mendoza, D., Ruysschaert, J. M., and Cybulski, L. E. (2014) A lipid-mediated conformational switch modulates the thermosensing activity of DesK, Proc. Natl. Acad. Sci. U. S. A. 111, 3579-3584. [122]. Zamora-Carreras, H., Maestro, B., Strandberg, E., Ulrich, A. S., Sanz, J. M., and Jimenez, M. A. (2015) Micelle-triggered beta-hairpin to alpha-helix transition in a 14residue peptide from a choline-binding repeat of the pneumococcal autolysin LytA, Chem. Eur. J. 21, 8076-8089. [123]. Lella, M., Kamilla, S., Jain, V., and Mahalakshmi, R. (2016) Molecular mechanism of holin transmembrane domain I in pore formation and bacterial cell death, ACS Chem. Biol. 11, 910-920. [124]. Lella, M., and Mahalakshmi, R. (2016) Engineering a transmembrane nanopore ion channel from a membrane breaker peptide, J. Phys. Chem. Lett. 7, 2298-2303. [125]. Zhou, A., Abu-Baker, S., Sahu, I. D., Liu, L., McCarrick, R. M., Dabney-Smith, C., and Lorigan, G. A. (2012) Determining alpha-helical and beta-sheet secondary structures via pulsed electron spin resonance spectroscopy, Biochemistry 51, 7417-7419. [126]. Dado, G. P., and Gellman, S. H. (1993) Redox control of secondary structure in a designed peptide, J. Am. Chem. Soc. 115, 12609-12610. [127]. Yassine, W., Taib, N., Federman, S., Milochau, A., Castano, S., Sbi, W., Manigand, C., Laguerre, M., Desbat, B., Oda, R., and Lang, J. (2009) Reversible transition between alpha-helix and beta-sheet conformation of a transmembrane domain, Biochim. Biophys. Acta 1788, 1722-1730. [128]. Fu, L., Liu, J., and Yan, E. C. (2011) Chiral sum frequency generation spectroscopy for characterizing protein secondary structures at interfaces, J. Am. Chem. Soc. 133, 80948097. [129]. Maiti, N. C., Apetri, M. M., Zagorski, M. G., Carey, P. R., and Anderson, V. E. (2004) Raman spectroscopic characterization of secondary structure in natively unfolded proteins: alpha-synuclein, J. Am. Chem. Soc. 126, 2399-2408. [130]. Heyduk, T. (2002) Measuring protein conformational changes by FRET/LRET, Curr. Opin. Biotechnol. 13, 292-296. [131]. Singh, S., Chiu, C. C., Reddy, A. S., and de Pablo, J. J. (2013) Alpha-helix to betahairpin transition of human amylin monomer, J. Chem. Phys. 138, 155101.

ACS Paragon Plus Environment

47

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 48 of 63

[132]. Gc, J. B., Gerstman, B. S., and Chapagain, P. P. (2015) The role of the interdomain interactions on RfaH dynamics and conformational transformation, J. Phys. Chem. B 119, 12750-12759. [133]. Feng, Y., De Franceschi, G., Kahraman, A., Soste, M., Melnik, A., Boersema, P. J., de Laureto, P. P., Nikolaev, Y., Oliveira, A. P., and Picotti, P. (2014) Global analysis of protein structural changes in complex proteomes, Nat. Biotechnol. 32, 1036-1044. [134]. Kaul, R., and Balaram, P. (1999) Stereochemical control of peptide folding, Bioorg. Med. Chem. 7, 105-117. [135]. Venkatraman, J., Shankaramma, S. C., and Balaram, P. (2001) Design of folded peptides, Chem. Rev. 101, 3131-3152. [136]. Bhardwaj, G., Mulligan, V. K., Bahl, C. D., Gilmore, J. M., Harvey, P. J., Cheneval, O., Buchko, G. W., Pulavarti, S. V., Kaas, Q., Eletsky, A., Huang, P. S., Johnsen, W. A., Greisen, P. J., Rocklin, G. J., Song, Y., Linsky, T. W., Watkins, A., Rettie, S. A., Xu, X., Carter, L. P., Bonneau, R., Olson, J. M., Coutsias, E., Correnti, C. E., Szyperski, T., Craik, D. J., and Baker, D. (2016) Accurate de novo design of hyperstable constrained peptides, Nature 538, 329-335. [137]. Kammerer, R. A., Kostrewa, D., Zurdo, J., Detken, A., Garcia-Echeverria, C., Green, J. D., Muller, S. A., Meier, B. H., Winkler, F. K., Dobson, C. M., and Steinmetz, M. O. (2004) Exploring amyloid formation by a de novo design, Proc. Natl. Acad. Sci. U. S. A. 101, 4435-4440. [138]. Awasthi, S. K., Shankaramma, S. C., Raghothama, S., and Balaram, P. (2001) Solventinduced beta-hairpin to helix conformational transition in a designed peptide, Biopolymers 58, 465-476. [139]. Young, R. (2013) Phage lysis: do we have the hole story yet?, Curr. Opin. Microbiol. 16, 790-797. [140]. Wang, I. N., Smith, D. L., and Young, R. (2000) Holins: the protein clocks of bacteriophage infections, Annu. Rev. Microbiol. 54, 799-825. [141]. Kamilla, S., and Jain, V. (2016) Mycobacteriophage D29 holin C-terminal region functionally assists in holin aggregation and bacterial cell death, FEBS J. 283, 173-190. [142]. Lella, M., and Mahalakshmi, R. (2017) Solvation driven conformational transitions in the second transmembrane domain of mycobacteriophage holin, Biopolymers 108, In Press DOI: 10.1002/bip.22894. [143]. Fernandez-Tornero, C., Lopez, R., Garcia, E., Gimenez-Gallego, G., and Romero, A. (2001) A novel solenoid fold in the cell wall anchoring domain of the pneumococcal virulence factor LytA, Nat. Struct. Biol. 8, 1020-1024. [144]. Maestro, B., Santiveri, C. M., Jimenez, M. A., and Sanz, J. M. (2011) Structural autonomy of a beta-hairpin peptide derived from the pneumococcal choline-binding protein LytA, Proteing Eng. Des. Sel. 24, 113-122.

ACS Paragon Plus Environment

48

Page 49 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

For Table of Contents Use Only

Metamorphic Proteins: Emergence of dual protein folds from one primary sequence Muralikrishna Lella1,* and Radhakrishnan Mahalakshmi1,*

ACS Paragon Plus Environment

49

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 1: Reply to the Paracelsus challenge.

ACS Paragon Plus Environment

Page 50 of 63

Page 51 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 2: Evolutionary fold change in the Cro family of transcription factors.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 3: Conformational flips observed in the transcription factor RfaH.

ACS Paragon Plus Environment

Page 52 of 63

Page 53 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 4: Fold change with point mutation.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 5: Sequential mutagenesis of GA and GB domains.

ACS Paragon Plus Environment

Page 54 of 63

Page 55 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 6: Dynamic equilibrium between monomer and dimer with two different folds in Ltn/XCL1.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 7: Redox-controlled dimerization of CLIC showing the conformational switch in the N-terminal domain.

ACS Paragon Plus Environment

Page 56 of 63

Page 57 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 8: Concentration-dependent oligomerization and structural transition of selecase.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 9: Pore-forming toxins undergo structural transitions upon inserting into the membrane.

ACS Paragon Plus Environment

Page 58 of 63

Page 59 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 10: Phytochrome photo-interconversion between dark-adopted red-light-absorbing Pr state (left) and the light activated far-red light absorbing Pfr state (right).

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 11: Micelle- triggered conformation change in TM1 (A) and LytA239-252 (B).

ACS Paragon Plus Environment

Page 60 of 63

Page 61 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 12: Summary of intrinsic and extrinsic factors responsible for metamorphic behavior of proteins.

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

TOC Graphic 30x26mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 62 of 63

Page 63 of 63

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Suggested graphical image for the Cover Page

ACS Paragon Plus Environment