Navigating the structural landscape of de novo α-helical bundles

31 mins ago - ACS Editors' Choice: Catalysis Enabled by Plasma Activation of Strong Chemical Bonds. This week: Catalysis enabled by plasma activation ...
0 downloads 0 Views 1MB Size
Subscriber access provided by UNIV OF LOUISIANA

Article

Navigating the structural landscape of de novo #-helical bundles Guto G. Rhys, Christopher W Wood, Joseph L. Beesley, Nathan R. Zaccai, Antony Burton, R Leo Brady, Andrew R. Thomson, and Derek N. Woolfson J. Am. Chem. Soc., Just Accepted Manuscript • DOI: 10.1021/jacs.8b13354 • Publication Date (Web): 08 May 2019 Downloaded from http://pubs.acs.org on May 8, 2019

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

Navigating the structural landscape of de novo helical bundles Guto G. Rhys,1, ‡ Christopher W. Wood,1, ‡ Joseph L. Beesley,1 Nathan R. Zaccai,2 Antony J. Burton,1,4 R. Leo Brady,2 Andrew R. Thomson,1,3 Derek N. Woolfson,1,2,5* 1School

of Chemistry, University of Bristol, Cantock’s Close, Bristol BS8 1TS, United Kingdom of Biochemistry, University of Bristol, Medical Sciences Building, University Walk, Bristol BS8 1TD, United Kingdom 3School of Chemistry, University of Glasgow, Glasgow G12 8QQ, United Kingdom 2School

4Frick

Chemistry Laboratory, Princeton University, Princeton, NJ 08544, USA

5BrisSynBio,

University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, United Kingdom

ABSTRACT: The association of amphipathic  helices in water leads to -helical-bundle protein structures. However, the driving force for this—the hydrophobic effect—is not specific and does not define the number or the orientation of helices in the associated state. Rather, this is achieved through deeper sequence-to-structure relationships, which are increasingly being discerned. For example, for one structurally extreme but nevertheless ubiquitous class of bundle—the -helical coiled coils—relationships have been established that discriminate between all-parallel dimers, trimers and tetramers. Association states above this are known, as are antiparallel and mixed arrangements of the helices. However, these alternative states are less-well understood. Here, we describe a synthetic-peptide system that switches between parallel hexamers and various updown-up-down tetramers in response to single-amino-acid changes and solution conditions. The main accessible states of each peptide variant are characterized fully in solution and, in most cases, to high resolution with X-ray crystal structures. Analysis and inspection of these structures helps rationalize the different states formed. This navigation of the structural landscape of -helical coiled coils above the dimers and trimers that dominate in nature has allowed us to design rationally a well-defined and hyperstable antiparallel coiled-coil tetramer (apCC-Tet). This robust de novo protein provides another scaffold for further structural and functional designs in protein engineering and synthetic biology.

INTRODUCTION In nature, bundles of four  helices are common in protein structures and assemblies. These four-helix bundles perform a wide variety of functions including: acting as protein hormones and cytokines;1,2 providing scaffolds for metal- and co-factor-binding to facilitate storage, redox and enzymatic functions;3–5 cementing protein-protein interactions that direct DNA binding, membrane fusion and other processes;6–8 serving as building blocks for viral capsids;9 and spanning membranes to perform signal transduction10 and transport functions.11 As such, four-helix bundles have become key targets and scaffolds for de novo protein design.12–18 Usually, these comprise amphipathic  helices. These helices assemble via their hydrophobic faces to form bundles with the hydrophobic side chains buried in consolidated cores. The associations can be of individual helices to form tetramers, of helical hairpins to give dimers, or intramolecularly within the same protein chain.19 However, and particularly for the intermolecular cases, alternative helical arrangements (e.g., all parallel, up-down antiparallel) and even other oligomeric states are possible. Therefore, the specification of helix-helix interactions that direct towards a specified structure and away from unwanted alternatives is critical

for successful design; these two aspects are known as positive and negative design, respectively. Ultimately, what is required are clear sequence-to-structure relationships and/or robust computational methods to augment the amphipathic -helical sequences and specify a target fourhelix bundle. Here we explore the structural plasticity between all-parallel and up-down-up-down antiparallel four-helix coiled-coil structures and some of the other competing states. Four-helix bundles are one of the earliest examples of a recognized protein-structure motif.20 Since then, structures of many four-helix bundles have enabled analyses of features that define the fold. Weber and Salemme expand on previous work, including Crick’s model for packing in helical coiled coils,21 to show that structural similarities between four-helix bundles are a consequence of basic physical properties of the component helices.22 Presnell and Cohen present further examples of four-helix bundles using computational screening and visual inspection of over 300 protein structures.23 This screening identifies 20 putative structures based on buried surface area, but when the 20 are examined visually, some are excluded as they “do not appear to be well packed”, with helix crossing angles diverging from extant four-helix bundles. These examples of

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

true four-helix bundles provide the basis for categorizing the fold by helix-crossing angles, with six categories emerging: square, splinter, x, unicornate, bicornate and splayed.24 The square class contains the previously identified four-helix bundles, where all helices are aligned, while the other classes have packing more similar to previously identified -helical globules.25 The main differentiating factor between these structures is the number and regularity of interactions between helices.

Indeed, there is spectrum of side-chain-mediated helix-tohelix packing in four-helix bundles from few non-specific interactions to repeated regular interactions (Figure 1A).26 Short helices (< 14 residues) tend to be more free to associate in aligned or orthogonal configurations, resulting in a wide range of bundles.25 Whereas, longer helices (>= 14

Page 2 of 15

residues) favor aligned arrangements, packing optimally as extended bundles with regular and repeating interactions.27 Extended bundles can be categorized further based on the predominant mode of helix packing, from less-specific ridges-into-grooves (RIG) to intimate knobs-into-holes (KIH) interactions.21,28,29 These packing modes are characteristics of globular and coiled-coil domains, respectively. Given their ubiquity and this level of understanding, fourhelix bundles offer a fertile ground for the design and engineering of new proteins. DeGrado has pioneered this area:30 initially, designing small synthetic peptides of leucine, glutamate and lysine to create self-assembling amphipathic helices,31 although these were later revealed to form larger globular bundles;32 then joining these together with proline-and-arginine-based loops to generate hairpins that dimerize;33–36 and expressing single-chain four-helix bundles that are monomeric and hyperstable.19 Contemporaneously, but using a larger amino-acid palette, Richardson and co-workers designed a monomeric “nativelike” four-helix protein.37 These bottom-up approaches have led to sophisticated de novo four-helix bundles with ion-transportation and cofactor-binding properties.38,39 For instance, Dutton and co-workers have also built four-helix bundle maquettes that incorporate heme to bind and transport oxygen.40–42 Similar scaffolds have been developed by others to introduce light-harvesting and enzyme-like functions.43–45 Hecht and co-workers have taken a different approach to designing four-helix bundles.14 This pioneers the use of binary patterns of hydrophobic (h) and polar (p) residues to define amphipathic helices linked into single chains by intervening turns.46 Rather than specifying the sequences any further, however, redundant codons are used to place multiple different residues at the h and p sites; i.e., sequence libraries are generated to be compatible with the overall four-helix-bundle fold. Moreover, rather than actively selecting from or evolving these libraries, proteins that survive or operate in cells are picked out passively. In this way, the group have achieved stably folded and structured de novo four-helix bundles.47,48 In turn, these have been endowed with functions such as heme binding,49 abilities to substitute for deleted endogenous proteins,50,51 smallmolecule binding,52 reducing copper toxicity,53 and, impressively, a de novo protein that catalyzes a lifesustaining reaction in a microorganism.54 With some exceptions,55 these de novo four-helix bundles tend to have sequences that promote RIG packing of helices. Thus, while structural data is limited, it is likely that most de novo four-helix bundles have irregular side-chain packing, falling in the middle of the spectrum of Figure 1A. For moreintimate KIH packing and coiled-coil formation, two general features are required: (i) heptad repeats of h and p residues, hpphppp (usually denoted abcdefg), or related repeats with

Figure 1. Four-helix bundles adopt a range of topologies with irregular to highly regular packing. From left to right, the structures are for a region of 3-isopropylmalate dehydrogenase (PDB ID code 1IPD) in the x class with only pairs of helices aligned; apolipoprotein E3 (PDB ID code 1LPE) in the square class, where all are helices aligned and interact via ridges-into-grooves packing; and CC-Tet (PDB ID code 3R4A) also in the square class but with knobsinto-holes helix-helix interactions. (B) The HAMP domain of TSR/AF1503, which forms an Alacoil (left and top right, PDB ID code 3ZX6), with the corresponding helical wheel showing helix packing (bottom right). (C) AACS designed Paragon Plus Environment hexameric coiled coil, CC-Hex (left and top right), also with a helical wheel of helix packing (bottom right).

Page 3 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

Table 1. Sequences and summary of biophysical data for CC-Hex*-L24 point mutants. Peptide

Sequence and Register

Helix2 (%)

gabcdefgabcdefgabcdefgabcdef

SV3 (mass/

SE4 (mass/

monom er mass)

monom er mass) 6.0

CC-Hex1

Ac-GELKAIAQELKAIAKELKAIAWELKAIAQGAG-NH2

82

-

CC-Hex*-L24D

Ac-GELKAIAQELKAIAKELKAIAWEDKAIAQG-NH2

10

1.6 4.6

&

CC-Hex*-L24E

Ac-GELKAIAQELKAIAKELKAIAWEEKAIAQG-NH2

15

2.0 4.5

&

CC-Hex*-L24H

Ac-GELKAIAQELKAIAKELKAIAWEHKAIAQG-NH2

55

CC-Hex*-L24K

Ac-GELKAIAQELKAIAKELKAIAWEKKAIAQG-NH2

CC-Hex*-L24Dab CC-Hex*-L24Nle

2.1

XRD5 oligomeric state

6 (pH 7.5) 4 (pH 8.5)

2.2

4 (pH 8.5) & 6 (pH 6)

4.2

4.9

4 (pH 6.5)

41

4.3

4.7

4 (pH 8)

Ac-GELKAIAQELKAIAKELKAIAWEΓKAIAQG-NH2

30

4.7

4.5

Ac-GELKAIAQELKAIAKELKAIAWEΔKAIAQG-NH2

91

5.0 & 12.0

6.2

4 (pH 6.5) 6 (pH 6.5)

Complete biophysical analysis can be found in the supplementary information. Single letter codes for L-1,4-diaminobutyruc acid and L-norleucine are Γ and Δ, respectively. Bold letters highlight point mutations in the CC-Hex* sequence. 1Data for CC-Hex from a previous publication.58 2Fraction helix determined from the mean residue ellipticity at 222 nm in circular dichroism experiments. Conditions: 50 M peptide concentration, phosphate-buffered saline (PBS) solution, pH 7.4 and 5 C. 3Sedimentation-velocity analytical ultracentrifugation. Conditions: 150 M peptide concentration, PBS solution, pH 7.4. 4Sedimentation-equilibrium analytical ultracentrifugation. Conditions: 70 M peptide concentration, PBS solution, pH 7.4. 5X-ray crystallography oligomeric state of the biological unit. The hexamers are parallel -helical barrels and the tetramers are offset antiparallel -helical bundles.

3, 4 spacings of h residues;56 and (ii) specific combinations of predominantly aliphatic hydrophobic residues at the a and d sites.57 Designed four-helix bundles that are coiled coils lie far to the right of the spectrum of Figure 1A. Early designs in this area elucidated the rules for the formation of parallel four-helix bundles and also for parallel dimers and trimers.59,60 These rules have been embellished and used in rational de novo design of many parallel coiled coils.57,61 Previously, we have described the rational design and complete characterization through to X-ray protein crystal structures of a basis set of all-parallel dimeric, trimeric and tetrameric coiled coils.62 Serendipitously, a simple permutation to the sequence repeat of the tetramer, CC-Tet, produces an entirely new coiled-coil assembly, namely an all-parallel hexamer, CC-Hex.58 This has a central accessible channel.63 Therefore, it is an -helical barrel. The formation of the structure can be rationalized as the mutations to CCTet expanded the hydrophobic surfaces of the component helices allowing more of them to associate. Moreover, the introduction of complementary charged aspartic acid (Asp, D) and histidine (His, H) residues at the core position Leu24 in the otherwise hydrophobic central channel of CC-Hex to render two peptides—CC-Hex-D24 and CC-Hex-H24—that complement to form a parallel A3B3-type heterohexamer. However, through a number of unpublished studies, we find that the CC-Hex scaffold is not completely robust and its oligomer state changes with other polar mutations. Subsequently, we have developed parametric computational design to deliver a series of -helical barrels, including pentamers, new hexamers and a heptamer.64

These -helical barrels are more-robust to mutation and serve as platforms for rational design to introduce new functions.65,66 Herein, we return to the mutants of CC-Hex and explore structural plasticity in the coiled-coil structural landscape. This must be navigated to deliver robust de novo designs. Specifically, we describe a hitherto unexplored sequence with glutamic acid (Glu, E) at position 24. This assembles as a stable parallel hexamer at low pH, and as an up-down-updown antiparallel tetrameric coiled coil near neutral pH. Moreover, substituting positively charged lysine (Lys, K) or 2,4-diaminobutyric acid (Dab) residues at position 24 gives the antiparallel tetramer. Thus, this study shows that the CC-Hex scaffold is far less mutable than previously believed. We use the X-ray crystal structures for the new state to guide the rational design of a robust and hyperstable antiparallel tetramer, apCC-Tet, which we characterize fully. This required consideration of both the composition of the hydrophobic core and of interhelical salt-bridging in en bloc mutations of the original CC-Hex sequence. apCC-Tet provides an additional de novo protein fold that could be of use in protein design and engineering, materials science, and synthetic biology.

RESULTS AND DISCUSSION Polar mutations at core sites of CC-Hex cause structural switches in solution. To probe how robust CC-Hex-based sequences were to forming -helical barrels, we synthesized variants of CC-

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Hex at position 24. This is an a position of the heptad repeat and, therefore, contributes to the cores of dimeric, trimeric and tetrameric coiled coils, and it points towards the lumens of -helical barrels. We tested additional charged side chains at the site, which we anticipated would be destabilizing. The variants included: negatively charged side chains Asp and Glu, to give L24D and L24E; positively charged side chains His, Lys and L-2,4-diaminobutyric acid (Dab), to giveL24H, L24Dab and L24K; and the non-polar side chain norleucine (Nle), L24Nle (Table 1). N.B., These and similar mutants described herein were made in a modified CC-Hex sequence, CC-Hex*, lacking the C-terminal AlaGly dipeptide, which was included originally as a mass tag. Hence the two nomenclatures used. For simplicity, hereafter mutations at position 24 in the CC-Hex* background are referred to as L24D, L24E etc. Circular dichroism (CD) spectroscopy of L24D and L24E at pH 7.4 showed that the peptides had reduced helicities (Figure 2A) compared to the parent CC-Hex.58 Analytical ultracentrifugation (AUC) experiments indicated that both variants formed a mixture of dimeric and tetrameric species rather than hexamers (Figures S6.1,S6.2; Table 1). For a given concentration, L24E was more -helical than L24D at neutral pH. Therefore, L24E was chosen for further biophysical characterization. Specifically, it was interrogated by CD spectroscopy and AUC over the pH range 3 – 7 (Figure 2B-D). At pH 3 and 4, the peptide was highly stable and did not fully unfold even by 90 °C. At pH 5 there was no significant loss of  helicity, but the peptide showed the start of a thermal unfolding transition. At pH 6 there was

≈30% drop in helicity and the peptide underwent full thermal denaturation. pH 7 saw a further drop in helicity and the peptide showed signs of cold denaturation. (N.B. This cold denaturation was typical of mutants later confirmed as tetramers (Figures S4.1–4.4,S4.6)). Sedimentation-equilibrium AUC experiments revealed that from pH 3 – 7.4 L24E switched from hexameric to dimeric

Page 4 of 15

species (Figure 2D), whereas sedimentation-velocity at pH 4 indicated multiple species (Figure S6.13). At pH 7.4, increasing the peptide concentration above 150 M increased  helicity and shifted the equilibrium towards tetramers (Figures S5.1,S6.12). To test if the tetramer resulted from charged residues at core sites generally, the positively charged variants, L24Dab, L24K and L24H (N.B. His has a pKa of 6.8) and the L24nLeu control were examined.67 Again, CD spectroscopy at pH 7.4 showed that L24H, L24K and L24Dab were significantly less -helical than the parent (Figures S4.34.4,S4.6). By SE AUC, L24K, L24Dab and L24H had weights of ~5 x monomer mass (Figures S6.3-6.4,S6.6). By contrast, L24Nle was highly folded and hexameric in solution at pH 7.4; although an additional higher-order species was observed by sedimentation-velocity experiments accounting for 35% of the sample (Figures S4.5,S6.5). Thus, the CC-Hex background is not robust to charged polar mutations in its hydrophobic channel, in contrast to the more robust computationally designed -helical barrels, including a hexamer.65 Moreover, at pH values where these residues are likely charged the structures are less helical, of lower thermal stability, and form lower oligomeric states. X-ray crystal structures reveal a broader accessible structural landscape. To add to the previously documented hexameric structures of CC-Hex-D24 and CC-Hex-H24 (PDB ID codes 3R46 and 3R47), we obtained X-ray crystal structures for L24E, L24D, L24Dab, L24K, L24H and L24Nle. L24E crystallized in two forms that gave different structures: a hexameric bluntended barrel and an antiparallel tetramer (Figure 3A,C, respectively). The crystallization conditions for the two states were markedly different. The tetramer was only observed at pH 8.5, while the hexamer crystallized from several conditions from pH 5 – 6.5, as well from unbuffered solutions where the pH was low due to residual trifluoroacetic acid from peptide synthesis (pH < 2). In the studies reported herein, L24D and L24H only crystallized as the antiparallel tetramer. However, as mentioned above, previously, we crystallized CC-Hex-D24 and CC-Hex-H24 from unbuffered solution and at pH 7.5, respectively.58 With the amine side chains L24K and L24Dab only crystallized as antiparallel tetramers, while L24Nle only crystallized as the parallel hexamer (Figure 3B); these structures were obtained in the pH range 6.5 – 8 (Table S2). For these three sequences, our inability to obtain other crystal forms does not necessarily mean that other states are inaccessible. Nevertheless, the solution-phase biophysical data corroborate the observed crystal structures (Table 1). Comparing the low-pH hexameric structures of CC-Hex-D24 and CC-Hex*-L24E revealed the shorter aspartic acid could be accommodated within the core of a completely folded structure; whereas, glutamic acid could not be fully accommodated, as two of the six peptide chains frayed at the C terminus allowing the glutamates to extend into solvent (Figure 3A).

Figure 2. Solution-phase biophysical data for the CC-Hex* variants L24D and L24E. (A) Circular dichroism (CD) spectra for both peptides at 5 °C, 200 µM concentration, in phosphatebuffered saline (PBS) solution at pH 7.4. (B) CD spectra of L24E at 5°C, 150 µM concentration in acidic to neutral conditions. (C) CD thermal denaturation profiles of L24E from 5–90 °C in acidic to neutral conditions. (D) Molecular weight of L24E in acidic to ACS Paragon Plus Environment neutral conditions determined by sedimentation-equilibrium AUC experiments. Error bars show standard deviation (n=3).

Page 5 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

Table 2. Sequences and summary of biophysical data for CC-Hex* en bloc mutants and apCC-Tet. Peptide

Sequence and Register

Helix1 (%)

SV2 (mass/

SE3 (mass/

monomer mass)

monomer mass)

38

-5

-5

4

61

4.6

4.4

4

69

6.0

5.6

6

gabcdefgabcdefgabcdefgabcdef CC-Hex*-KgEb CC-Hex*-LL CC-Hex*-II

Ac-GKLEAIAQKLEAIAKKLEAIAWKLEAIAQG-NH2 Ac-GELKALAQELKALAKELKALAWELKALAQG-NH2 Ac-GEIKAIAQEIKAIAKEIKAIAWEIKAIAQG-NH2

XRD4 oligomeric state

CC-Hex*-LL-KgEb

Ac-GKLEALAQKLEALAKKLEALAWKLEALAQG-NH2

62

3.95

4.05

4

apCC-Tet

Ac-GELEALAQELEALAKKLKALAWKLKALAQG-NH2

86

4.0

4.1

4

Complete biophysical analysis can be found in the supplementary information. 1Fraction helix determined from the mean residue ellipticity at 222 nm in circular dichroism experiments. Bold letters highlight point mutations in the CC-Hex* sequence. For clarity CC-Hex*-LL-KgEb is not highlighted. Conditions: 10 M peptide concentration, phosphate buffered saline (PBS) solution, pH 7.4 and 5 C. 2Sedimentation-velocity analytical ultracentrifugation. Conditions: 150 M peptide concentration, PBS solution, pH 7.4. 3Sedimentation-equilibrium analytical ultracentrifugation. Conditions: 70 M peptide concentration, PBS solution, pH 7.4. 4Xray crystallography oligomeric state of the biological unit. CC-Hex*-II forms a hexameric collapsed complex coiled coil and all others form tetrameric antiparallel -helical bundles. 5CC-Hex*-KgEb and CC-Hex*-LL-KgEb aggregate at 10 M and 40 M peptide concentration, respectively, when span at 3,000 rpm. Data was collected for CC-Hex*-LL-KgEb at 35 M peptide concentration.

Turning to the structures determined near neutral pH, L24D and L24E plus those for L24Dab, L24H and L24K are closely similar antiparallel four-helix coiled coils with backbone and all-atom RMSDs for residues 1 – 23 across of the whole set of 0.44 Å ± 0.12 Å and 1.13 Å ± 0.44 Å, respectively (Figure 3C). They have staggered rather than blunt-end arrangements of the helices, and the helices are frayed or disordered after the polar 24th residue. As a result, the cores are exclusively hydrophobic. These cores have contributions from residues at a (Leu), d (Ile) and e (Ala) sites of the heptad repeats. This gives two distinct helixhelix interfaces: two “wide faces” centered on d = Ile (Figure 3C left) and two “narrow faces” centered on e = Ala (Figure 3C center). The Leu residues at a are directed towards central long axis of the bundle (Figure 3C right) reminiscent of complementary x-da layers observed in Alacoils (Figure 1B).10 In other words, the structures have oblate crosssections. The two interfaces are flanked by pairs of the same amino acid from neighboring helices; i.e., b:b and g:g, respectively. As b = Lys and g = Glu, the narrow and wide faces present seams of positive and negative charge, respectively. Thus, the formation of the hydrophobic core overrides (i) the complete folding of each chain into  helices, and (ii) potential electrostatic repulsion between like residues at b and g. Structural searches of PDBefold and CAME topsearch using the L24E tetramer identified the following: HAMP proteins (PDB ID code 3ZX6; C RMSD = 1.2 Å); variants of the GCN4p1 peptides (PDB ID code 1W5J; C RMSD = 1.3 Å); the ROP proteins (PDB ID code 1QX8; C RMSD = 1.6 Å); Tet (PDB ID code 6C52; C RMSD = 2.1 Å), a non-aggregating variant of a de novo designed cross- amyloid-like structure; and diZn(II)- DF3l (PDB ID code 2KIK; C RMSD = 2.3 Å), an artificial protein designed for phenol oxidase activity. While

all structures are antiparallel tetrameric bundles, only the HAMP proteins contain regions with oblate cross-section due to regular Alacoil. The L24 variants are also structurally similar to a GCN4 leucine-zipper peptide mutant from Kallenbach and Lu (PDB ID code 2R2V), which crystallizes as an oblate antiparallel tetramer with wide interfaces centered on d = Val and Alacoil interfaces centered on e =

Figure 3. X-ray crystal structures of CC-Hex* point mutants. (A) The hexameric crystal form of L24E. (B) Slice through the structure of L24Nle viewed from the N termini. (C) The overlaid backbones of the tetrameric forms of L24E, L24D, L24Dab, L24K and L24H with the side chains of L24E displayed for reference. The chains are colored in rainbow spectrum from the N termini (blue) to the C termini (red)

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Ala. 68 Furthermore, the L24 variants resemble HexCoil-Ala (PDB ID code 3S0R), a de novo designed antiparallel tetramer with a core of a, d and g residues where the Alacoil interfaces center on g = Ala.69 Interestingly, this different core results in topologies with opposite handedness. Blunt-ended and fully folded tetramers can be redesigned from the CC-Hex* sequence. The above structures of the polar mutants at position 24 of the CC-Hex* sequence demonstrate that another coiled-coil topology—i.e., an up-down-up-down antiparallel tetramer—is accessible to our basis-set of de novo designs.62,64,70 However, the tetramers described thus far are not fully folded. To explore how the fold might be optimized, we synthesized en bloc mutants of CC-Hex* at the a & d and g & b sites (Table 2). Core positions have a significant effect on specificity of oligomeric state and helix orientation of associating coiled-coil peptides.59,62,71 Surprisingly, in the heptad repeat of CC-Hex charged Glu residues occupy core g positions. To test how residues at these positions alter specificity, we swapped the order of the charge residues at the g and b sites. In CC-Hex*, the gf repeat is ELKAIAx. Therefore, the swap had the sequence KLEAIAx, which we refer to as CC-Hex*-KgEb. Unexpectedly, this peptide was only soluble up to 40 M concentration in phosphate buffered saline (PBS). It was 38%  helical but could not by analyzed by AUC due to aggregation. Nevertheless, we were able to obtain an X-ray crystal structure of the charge-swapped variant in the original CC-Hex background (CC-Hex-KgEb) that revealed an antiparallel tetramer (Figure 4A). Unlike the L24 variants, this does not have shifted antiparallel helices and, consequently, it is a blunt-ended tetramer. Thus, swapping the potential salt-bridge positions disrupts the coiled-coil assembly. Analysis of KIH interactions in the CC-Hex structure revealed Glu at g as knob residues, with the ethyl unit (C and C) packing into the interface and the carboxylic acid presented at the surface. By contrast, the Lys

Figure 4. X-ray crystal structures of CC-Hex* en bloc mutants. (A, B) CC-Hex-KgEb (orange) and CC-Hex*-LL (green) form antiparallel tetramers. (C) CC-Hex*-II (blue) forms a collapsed parallel hexamer.

Page 6 of 15

side chains at b extend fully into solution. Given this, it is surprising that the swap was not tolerated: it is reasonable to expect that the methylene groups of Lys should pack at g, and the Glu at b should be solvent accessible. We speculate that the entropic cost of reducing the conformational freedom of Lys side chains at g disfavors hexamer formation. Next, we turned to the en bloc hydrophobic mutations at a and d positions of CC-Hex*. One of these mutants has already been reported: swapping the Leu at a and Ile at d results in the formation of a slipped hexameric barrel, CCHex*-IL (PDB ID code 4H8G).72 For the study presented herein, we made the two other permutants with either all Ile or all Leu at both a and d, i.e., CC-Hex*-II and CC-Hex*-LL, respectively. CC-Hex*-II was -helical and completely unfolded upon heating, whereas CC-Hex*-LL was highly  helical and hyperthermal stable (Figure S4.7). CC-Hex*-LL showed the start of a thermal unfolding transition at ≈75 °C and, on cooling, signs of thermal annealing. AUC experiments indicated that CC-Hex*-II was hexameric in solution, whereas CC-Hex*-LL formed species of lower mass (Table 2). X-ray crystal structures of CC-Hex*-II and CC-Hex*-LL were determined (Figure 4B,C). The former revealed a collapsed hexameric coiled coil. This is a complex coiled coil with multiple unique helical environments in the homomeric assembly. Recently, we have described similar homomeric structures in which symmetry is broken.73 Albeit in a different e/g background, we argue that -branched residues at the a/d sites promote barrel structures.73 Clearly, this is not the case for the new peptides described here: the introduction of additional Ile residues to give CCHex*-II results in a structure with a consolidated core. This demonstrates that residues peripheral to core sites also contribute to the final structure that is adopted. Previously, we solved an alternative structure for a variant of the CCHex*-II peptide, which is a parallel tetramer (PDB ID code 4H7R). The crystal structure for CC-Hex*-LL also revealed an antiparallel tetramer. As with CC-Hex-KgEb, the peptide chains are fully helical. However, the backbones of these two structures could not be aligned (Figure 4A,B): the narrow Alacoil interface of CC-Hex*-LL is offset longitudinally to a lesser extent than in CC-Hex-KgEb resulting in more-flushed termini. N.B. All of the CC-Hex* variants reported here and structurally characterized by X-ray crystallography were confirmed to be coiled coils by SOCKET analysis (Figure S8.1).74 These results demonstrate further the complexity of the self-association landscape of CC-Hex-based peptides. And that alternative states can be accessed by changes in solution conditions and/or small changes to the sequence. Nonetheless, they also suggest ways in which structures within this landscape can be targeted, which we illustrate below. Designing apCC-Tet, an optimized antiparallel coiledcoil tetramer. From the above, we chose CC-Hex*-LL as the best starting point for the core in a rational design of a fully folded, stable antiparallel homotetramer. In addition, we swapped the

ACS Paragon Plus Environment

Page 7 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

Figure 5. (A) Circular dichroism (CD) spectra at 5 °C and (B) thermal unfolding of apCC-Tet (orange) and CC-Hex*-LL (blue). Thermal unfolding of CC-Hex*-LL was performed after thermally annealing the sample. Conditions: 10 µM peptide, PBS. (C) Sedimentation-velocity AUC of apCC-Tet (orange) and CC-Hex*-LL (blue). Fits return masses of 4.0 and 4.6 x monomer mass, respectively. Conditions: 150 µM peptide, PBS. (D) Sedimentation-equilibrium AUC of apCC-Tet (blue, 24k rpm; green, 32k rpm; and red, 40k rpm). Fitting to the experimental data returned a weight of 4.1 x monomer mass. Conditions: 70 µM peptide concentration, PBS. (E,F) X-ray crystal structure of apCC-Tet colored from N (blue) to C terminus (red). Side chains of the g and b residues and core Leu residues are shown as sticks. (G) Backbone comparison of apCC-Tet (orange) and CC-Hex*-LL (blue).

charged residues at g and b positions, as this also favors antiparallel tetramer over hexamer. This gave CC-Hex*-LLKgEb (Table 2). Compared with CC-Hex*-LL, CC-Hex*-LLKgEb gave a sharp single peak in the c(s) distribution in sedimentation-velocity AUC. Sedimentation-equilibrium data and an X-ray crystal structure confirmed CC-Hex*-LLKgEb as a tetramer. However, like CC-Hex*-KgEb, the new variant aggregated in PBS above 40 M concentration. Aggregation of the tetramers could occur along the wide interface of these oblate structures: relative to the narrowinterface, the wide interfaces present a greater exposed hydrophobic surface. It is possible that peripheral lysine residues screen this interface less well than glutamate. Thus, whilst swapping the potential salt bridges helps specify the tetrameric form relative to the hexamer this is at the cost of reduced solubility. Therefore, we sought an alternate and optimized pattern of charge on the exterior of the tetramer. There are several ways to arrange Lys and Glu residues at g and b positions to disfavor parallel association of helices and favor the antiparallel alignment.75–79 For example, Glu could be placed at g and b of heptads 1 and 2, with Lys at these sites in heptads 3 and 4. This arrangement of Glu near to the N terminus and Lys near the C terminus is known to improve -helical stability.80 Retrospectively, we named this charge pattern in the CC-Hex*-LL background apCC-Tet. apCC-Tet was highly  helical at 5 °C with a fraction helicity of 86%, an increase of 25% over CC-Hex*-LL (Figure 5A). apCC-Tet was thermally stable up to 95 °C, whereas CCHex*-LL showed the start of an unfolding transition at ≈75 °C (Figure 5B). This improvement in solution-phase properties of apCC-Tet was also evident in sedimentationvelocity AUC experiments, where apCC-Tet gave a sharper

peak than CC-Hex*-LL (Figure 5C). For apCC-Tet, analyses of these data gave single discrete species with weights of 4.0 and 4.1 x monomer mass by sedimentation velocity and sedimentation equilibrium, respectively; whereas, CC-Hex*LL returned non-integer oligomeric states of 4.6 and 4.4, respectively (Figure 5C,D). Crucially, apCC-Tet did not show any signs of aggregation at 150 M peptide concentration during these experiments. Collectively, these data show that the arrangement of charged residues that should favor antiparallel helices also has improves the folding and thermal stability of apCC-Tet. The X-ray crystal structure for apCC-Tet confirmed an antiparallel coiled-coil tetramer (Figures 5D-F,S8.1). The helix geometry and interfaces are similar to CC-Hex*-LL with backbone and all-atom RMSDs of 0.56 Å and 1.58 Å between the two structures. However, only 7 of the possible 16 pairs of Glu-Lys contacts in the apCC-Tet structure are within 4 Å to form interhelical salt bridges and the sidechain densities are diffuse (e.g. the average temperature factor for the Lys N is 56.9 Å2 in comparison the all-atom average is 40.5 Å2). Therefore, despite improvements in the solution-phase behavior, the X-ray structure reveals that these are not due to altered backbone arrangements or core packing, nor are they from salt-bridge formation. We posit that the changes allow electrostatic steering in helix association and avoid uniformly charged surfaces on apCCTet. To summarize this section, we have achieved the rational design of an antiparallel homomeric coiled-coil tetramer. In the register of the heptad repeat, this has a hydrophobic core of a = d = Leu plus e = Ala, with flanking Glu-Lys pairs at g:g and b:b sites. This sequence pattern directs the assembly of a well-defined, discrete and hyperstable

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

antiparallel tetramer with an up-down-up-down topology and oblate cross-section. This adds to a growing basis set of de novo coiled-coil modules for protein design and synthetic biology.

CONCLUSION CC-Hex was the first reported X-ray crystal structure of a de novo -helical barrel.58 These barrels are a growing class of both natural and designed proteins that are robust and have potential in protein engineering and synthetic biology.57,64– 66,81,82 Nonetheless, CC-Hex was discovered serendipitously: it was a variant of a de novo designed parallel coiled-coil tetramer, CC-Tet.62 Herein, we have described a series of point and en bloc changes to a modified CC-Hex sequence, CC-Hex*, which lacks the original C-terminal AlaGly mass tag. These new variants reveal that the CC-Hex background is somewhat plastic with a variety of coiled-coil states being accessible from it. These include parallel and antiparallel tetramers, and open barrel and collapsed hexamers. Moreover, the structural plasticity is evident even for some of the charge-bearing point variants: charged side chains introduced that are charged are disfavored in the parallel barrel state and, therefore, switch to the offset antiparallel tetramer state. For self-associating systems, core positions strongly specify interface type and we identify Leu at a and d, in the CC-Hex background that preferentially forms antiparallel tetramers. These empirical explorations and observations have allowed us to design de novo a new antiparallel homotetrameric coiled coil, apCC-Tet. A combination of the established positive and negative design rules resulted in a robust and fully characterized structure. This work highlights further the degrees of freedom available to self-associating peptide systems,59,60,73 and the associated challenge of designing specific oligomer states and topologies de novo.12,57,83–85 This problem is particularly acute for less-well-defined peptide-peptide interfaces where many near-isoenergetic states are accessible.12 The energy landscape for -helical coiled-coil assemblies may well be more navigable than the general case of helical bundles. This is for three interrelated reasons: First, the hallmark knobs-into-holes packing of coiled coils dramatically reduces the number of helical arrangements possible. As a result, it is relatively straightforward to enumerate many coiled-coil backbones parametrically.86–91 Second, and directly related to this structural constraint, coiled-coil sequences have relatively simple repeat patterns of hydrophobic and polar residues. Third, and as a consequence of the first two points, only certain residues appear to be tolerated in the helix-helix interfaces of coiled coils.83–85 Thus, as coiled coils have low-complexity sequences, the protein-design problem is more sharply defined for these structures and sequences compared with less-regular structures. That said, the coiled-coil energy landscape is still complex with multiple oligomer states, parallel/antiparallel/mixed arrangements of helices, and homo- and heterotypic assemblies all possible.56,92 Nonetheless, considerable progress has been made to discern sequence-to-structure relationships and to develop computational methods for coiled-coil design. These have led to robust rational and/or

Page 8 of 15

computational designs for parallel dimers through heptamers.58,62,64,81 The work presented herein adds to this effort: it illustrates how alternate states can be distinguished; and it provides guidelines for accessing de novo antiparallel structures, which have been less explored than parallel coiled coils.76–78,93 More subtly, certain polar residues are tolerated at the otherwise hydrophobic coiled-coil interfaces, and these influence partner and oligomer-state selection.83,94,95 This concept has recently been revisited.96,97 Here, we add to this showing that buried charged residues disfavor higholigomer states for alternate antiparallel tetramers in the CC-Hex background. Furthermore, for certain sequences the two states—parallel hexamer and antiparallel tetramer— are sufficiently close in energy to effect switches between them simply by changing the pH of the solution. This presents exciting prospects for developing sequences to switch gross structural state in response to facile perturbations.75,98–102 Coiled-coil peptides that show multiple-defined states have been described by others. Lizatović et al. design a pHresponsive sequence that switches between a pentameric bundle and parallel hexameric bundle.98 Grigoryan et al. present the design of carbon-nanotube solubilizing peptides, which wrap around the surfaces of the nanotubes. In isolation, one of these peptides forms a tetramer, and another forms a dimer/hexamer mixture.69 Others are also discovering that seemingly benign sequence alterations can cause gross structural changes in self-associating coiled coils. Slovic et al. describe the redesign of the membranespanning peptide phospholambin to make it water soluble. Solution-phase biophysics shows that the full-length sequence remains pentameric. However, a truncated peptide (residues 21 – 52) is best modelled as a tetramer/pentamer equilibrium in AUC and a structure reveals an offset antiparallel tetramer.103,104 Finally, Spencer et al. have made variants of parent CC-Hex with phenylalanine in the core that forms a collapsed antiparallel hexameric bundle.105,106 The specification of orientation of peptides in selfassociating systems is a standing challenge in protein design. Antiparallel coiled coils remain a challenging design target with fewer examples of successful designs compared to parallel structures. We have delivered a robust de novo antiparallel tetramer that is characterized thoroughly. This provides another designed module or component adding to the growing basis set of de novo coiled coils with potential applications in the design and assembly of protein origami,107 peptide nanotubes,72 protein colocalization,108,109 and generally as tectons for generating complex self-assembling systems.110,111

ASSOCIATED CONTENT Supporting Information The Supporting Information is available free of charge on the ACS Publications website. Supporting Figures and Tables (PDF)

Crystal-Structure Files

ACS Paragon Plus Environment

Page 9 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

The crystal structures are all available from the Protein Data Bank using the following accession codes: CC-Hex*-L24D (6Q5H), CC-Hex*-L24E_tet (6Q5I), CC-Hex*-L24E_hex (6Q5J), CC-Hex*-L24K (6Q5K), CC-Hex*-L24H (6Q5L), CC-Hex*L24DAB (6Q5M), CC-Hex*-L24Nle (6Q5N), CC-Hex*-LL (6Q5O), CC-Hex*-II (6Q5P), CC-Hex-KgEb (6Q5Q), CC-Hex*-LL-KgEb (6Q5R), apCC-Tet (6Q5S).

AUTHOR INFORMATION Corresponding Author * [email protected]

Author Contributions G.G.R., C.W.W., J.L.B., A.R.T. and D.N.W. conceived the project and designed the experiments. G.G.R., C.W.W. , J.L.B. and A.J.B. synthesized the peptides and conducted the solution-phase biophysics. G.G.R., C.W.W., J.L.B., N.R.Z., A.J.B. and R.L.B. determined the peptide X-ray crystal structures. G.G.R., C.W.W. and D.N.W. wrote the paper. All authors have read and contributed to the preparation of the manuscript.

Notes The authors declare no competing financial interest.

ACKNOWLEDGMENT G.G.R., C.W.W., J.L.B., A.R.T., A.J.B. and DNW are supported by a European Research Council Advanced Grant to DNW (340764). G.G.R. and A.J.B. thank the Bristol Chemical Synthesis Centre for Doctoral Training funded by the Engineering and Physical Sciences Research Council (EP/G036764/1). C.W.W. and D.N.W thank the Biotechnology and Biological Sciences Research Council for funding the South West Biosciences Doctoral Training Partnership (BB/J014400/1) and a responsive-mode grant (BB/R00661X/1). We thank the University of Bristol School of Chemistry Mass Spectrometry Facility for access to the EPSRC-funded Bruker Ultraflex MALDI-TOF/TOF instrument (EP/K03927X/1). We thank the MX group at Diamond Light Source for their support. D.N.W. holds a Royal Society Wolfson Research Merit Award (WM140008). D.N.W. would like to dedicate this paper to Professor Ronald T Raines on the occasion of his 60th birthday.

REFERENCES (1) Vos, A. M. de; Ultsch, M.; Kossiakoff, A. A. Human Growth Hormone and Extracellular Domain of Its Receptor: Crystal Structure of the Complex. Science 1992, 255 (5042), 306–312. https://doi.org/10.1126/science.1549776. (2) Powers, R.; Garrett, D. S.; March, C. J.; Frieden, E. A.; Gronenborn, A. M.; Clore, G. M. The High-Resolution, Three-Dimensional Solution Structure of Human Interleukin-4 Determined by Multidimensional Heteronuclear Magnetic Resonance Spectroscopy. Biochemistry 1993, 32 (26), 6744–6762. https://doi.org/10.1021/bi00077a030. (3) Lederer, F.; Glatigny, A.; Bethge, P. H.; Bellamy, H. D.; Mathews, F. S. Improvement of the 2.5 Å Resolution Model of Cytochrome B562 by Redetermining the Primary Structure and Using Molecular Graphics. J. Mol. Biol. 1981,

148 (4), 427–448. https://doi.org/10.1016/00222836(81)90185-6. (4) Klabunde, T.; Eicken, C.; Sacchettini, J. C.; Krebs, B. Crystal Structure of a Plant Catechol Oxidase Containing a Dicopper Center. Nat. Struct. Mol. Biol. 1998, 5 (12), 1084– 1090. https://doi.org/10.1038/4193. (5) Lawson, D. M.; Artymiuk, P. J.; Yewdall, S. J.; Smith, J. M. A.; Livingstone, J. C.; Treffry, A.; Luzzago, A.; Levi, S.; Arosio, P.; Cesareni, G.; Thomas, C.D.; Shaw, W.V.; Harrison, P.M. Solving the Structure of Human H Ferritin by Genetically Engineering Intermolecular Crystal Contacts. Nature 1991, 349 (6309), 541–544. https://doi.org/10.1038/349541a0. (6) Friedman, A. M.; Fischmann, T. O.; Steitz, T. A. Crystal Structure of Lac Repressor Core Tetramer and Its Implications for DNA Looping. Science 1995, 268 (5218), 1721–1727. https://doi.org/10.1126/science.7792597. (7) Ferré-D’Amaré, A. R.; Prendergast, G. C.; Ziff, E. B.; Burley, S. K. Recognition by Max of Its Cognate DNA through a Dimeric b/HLH/Z Domain. Nature 1993, 363 (6424), 38–45. https://doi.org/10.1038/363038a0. (8) Sutton, R. B.; Fasshauer, D.; Jahn, R.; Brunger, A. T. Crystal Structure of a SNARE Complex Involved in Synaptic Exocytosis at 2.4 Å Resolution. Nature 1998, 395 (6700), 347–353. https://doi.org/10.1038/26412. (9) Wynne, S. A.; Crowther, R. A.; Leslie, A. G. W. The Crystal Structure of the Human Hepatitis B Virus Capsid. Mol. Cell 1999, 3 (6), 771–780. https://doi.org/10.1016/S1097-2765(01)80009-5. (10) Hulko, M.; Berndt, F.; Gruber, M.; Linder, J. U.; Truffault, V.; Schultz, A.; Martin, J.; Schultz, J. E.; Lupas, A. N.; Coles, M. The HAMP Domain Structure Implies Helix Rotation in Transmembrane Signaling. Cell 2006, 126 (5), 929–940. https://doi.org/10.1016/j.cell.2006.06.058. (11) Thomaston, J. L.; Alfonso-Prieto, M.; Woldeyes, R. A.; Fraser, J. S.; Klein, M. L.; Fiorin, G.; DeGrado, W. F. HighResolution Structures of the M2 Channel from Influenza A Virus Reveal Dynamic Pathways for Proton Stabilization and Transduction. Proc. Natl. Acad. Sci. 2015, 112 (46), 14260–14265. https://doi.org/10.1073/pnas.1518493112. (12) Hill, R. B.; Raleigh, D. P.; Lombardi, A.; DeGrado, W. F. De Novo Design of Helical Bundles as Models for Understanding Protein Folding and Function. Acc. Chem. Res. 2000, 33 (11), 745–754. https://doi.org/10.1021/ar970004h. (13) Woolfson, D. N. Core-Directed Protein Design. Curr. Opin. Struct. Biol. 2001, 11 (4), 464–471. https://doi.org/10.1016/S0959-440X(00)00234-7. (14) Hecht, M. H.; Das, A.; Go, A.; Bradley, L. H.; Wei, Y. De Novo Proteins from Designed Combinatorial Libraries. Protein Sci. 2004, 13 (7), 1711–1723. https://doi.org/10.1110/ps.04690804.

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

(15) Horne, W. S.; Gellman, S. H. Foldamers with Heterogeneous Backbones. Acc. Chem. Res. 2008, 41 (10), 1399–1408. https://doi.org/10.1021/ar800009n. (16) Chino, M.; Maglio, O.; Nastri, F.; Pavone, V.; DeGrado, W. F.; Lombardi, A. Artificial Diiron Enzymes with a De Novo Designed Four-Helix Bundle Structure. Eur. J. Inorg. Chem. 2015, 2015 (21), 3371–3390. https://doi.org/10.1002/ejic.201500470. (17) Regan, L.; Caballero, D.; Hinrichsen, M. R.; Virrueta, A.; Williams, D. M.; O’Hern, C. S. Protein Design: Past, Present, and Future. Pept. Sci. 2015, 104 (4), 334– 350. https://doi.org/10.1002/bip.22639. (18) Grayson, K. J.; Anderson, J. L. R. Designed for Life: Biocompatible de Novo Designed Proteins and Components. J. Royal Soc. Interface 2018, 15 (145), 20180472. https://doi.org/10.1098/rsif.2018.0472. (19) Regan, L.; DeGrado, W. F. Characterization of a Helical Protein Designed from First Principles. Science 1988, 241 (4868), 976–978. https://doi.org/10.1126/science.3043666. (20) Argos, P.; Rossmann, M. G.; Johnson, J. E. A FourHelical Super-Secondary Structure. Biochem. Biophys. Res. Commun. 1977, 75 (1), 83–86. https://doi.org/10.1016/0006-291X(77)91292-X. (21) Crick, F. H. The Packing of α-Helices: Simple Coiled-Coils. Acta Crystallogr. 1953, 6 (8–9), 689–697. https://doi.org/10.1107/S0365110X53001964 (22) Weber, P. C.; Salemme, F. R. Structural and Functional Diversity in 4-α-Helical Proteins. Nature 1980, 287 (5777), 82–84. https://doi.org/10.1038/287082a0. (23) Presnell, S. R.; Cohen, F. E. Topological Distribution of Four-Alpha-Helix Bundles. Proc. Natl. Acad. Sci. 1989, 86 (17), 6592–6596. https://doi.org/10.1073/pnas.86.17.6592. (24) Harris, N. L.; Presnell, S. R.; Cohen, F. E. Four Helix Bundle Diversity in Globular Proteins. J. Mol. Biol. 1994, 236 (5), 1356–1368. https://doi.org/10.1016/00222836(94)90063-9. (25) Murzin, A. G.; Finkelstein, A. V. General Architecture of the α-Helical Globule. J. Mol. Biol. 1988, 204 (3), 749–769. https://doi.org/10.1016/00222836(88)90366-X. (26) Cohen, C.; Parry, D. A. D. α-Helical Coiled Coils and Bundles: How to Design an α-Helical Protein. Proteins: Struct., Funct., Genet. 1990, 7 (1), 1–15. https://doi.org/10.1002/prot.340070102. (27) Bryson, J. W.; Betz, S. F.; Lu, H. S.; Suich, D. J.; Zhou, H. X.; O’Neil, K. T.; DeGrado, W. F. Protein Design: A Hierarchic Approach. Science 1995, 270 (5238), 935–941. https://doi.org/10.1126/science.270.5238.935. (28) Chothia, C.; Levitt, M.; Richardson, D. Structure of Proteins: Packing of Alpha-Helices and Pleated Sheets.

Page 10 of 15

Proc. Natl. Acad. Sci. 1977, 74 (10), 4130–4134. https://doi.org/10.1073/pnas.74.10.4130 (29) Chothia, C.; Levitt, M.; Richardson, D. Helix to Helix Packing in Proteins. J. Mol. Biol. 1981, 145 (1), 215– 250. https://doi.org/10.1016/0022-2836(81)90341-7 (30) DeGrado, W. F.; Wasserman, Z. R.; Lear, J. D. Protein Design, a Minimalist Approach. Science 1989, 243 (4891), 622–628. https://doi.org/10.1126/science.2464850. (31) Ho, S. P.; DeGrado, W. F. Design of a 4-Helix Bundle Protein: Synthesis of Peptides Which Self-Associate into a Helical Protein. J. Am. Chem. Soc. 1987, 109 (22), 6751–6758. https://doi.org/10.1021/ja00256a032. (32) Hill, C. P.; Anderson, D. H.; Wesson, L.; DeGrado, W. F.; Eisenberg, D. Crystal Structure of Alpha 1: Implications for Protein Design. Science 1990, 249 (4968), 543–546. https://doi.org/10.1126/science.2382133. (33) Handel, T.; DeGrado, W. F. De Novo Design of a Zn2+-Binding Protein. J. Am. Chem. Soc. 1990, 112 (18), 6710–6711. https://doi.org/10.1021/ja00174a039. (34) Raleigh, D. P.; DeGrado, W. F. A de Novo Designed Protein Shows a Thermally Induced Transition from a Native to a Molten Globule-Like State. J. Am. Chem. Soc. 1992, 114 (25), 10079–10081. https://doi.org/10.1021/ja00051a061. (35) Raleigh, D. P.; Betz, S. F.; DeGrado, W. F. A de Novo Designed Protein Mimics the Native State of Natural Proteins. J. Am. Chem. Soc. 1995, 117 (28), 7558–7559. https://doi.org/10.1021/ja00133a035. (36) Hill, R. B.; DeGrado, W. F. Solution Structure of α2D, a Nativelike de Novo Designed Protein. J. Am. Chem. Soc. 1998, 120 (6), 1138–1145. https://doi.org/10.1021/ja9733649. (37) Hecht, M. H.; Richardson, J. S.; Richardson, D. C.; Ogden, R. C. De Novo Design, Expression, and Characterization of Felix: A Four-Helix Bundle Protein of Native-like Sequence. Science 1990, 249 (4971), 884–891. https://doi.org/10.1126/science.2392678. (38) Joh, N. H.; Wang, T.; Bhate, M. P.; Acharya, R.; Wu, Y.; Grabe, M.; Hong, M.; Grigoryan, G.; DeGrado, W. F. De Novo Design of a Transmembrane Zn2+-Transporting Four-Helix Bundle. Science 2014, 346 (6216), 1520–1524. https://doi.org/10.1126/science.1261172. (39) Polizzi, N. F.; Wu, Y.; Lemmin, T.; Maxwell, A. M.; Zhang, S.-Q.; Rawson, J.; Beratan, D. N.; Therien, M. J.; DeGrado, W. F. De Novo Design of a Hyperstable NonNatural Protein–Ligand Complex with Sub-Å Accuracy. Nat. Chem. 2017, 9 (12), 1157–1164. https://doi.org/10.1038/nchem.2846. (40) Gibney, B. R.; Dutton, P. L. De Novo Design and Synthesis of Heme Proteins. Adv. Inorg. Chem. 2000, 51,

ACS Paragon Plus Environment

Page 11 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

409–456. https://doi.org/10.1016/S08988838(00)51008-3. (41) Koder, R. L.; Dutton, P. L. Intelligent Design: The de Novo Engineering of Proteins with Specified Functions. Dalton Trans. 2006, 0 (25), 3045–3051. https://doi.org/10.1039/B514972J. (42) Koder, R. L.; Anderson, J. L. R.; Solomon, L. A.; Reddy, K. S.; Moser, C. C.; Dutton, P. L. Design and Engineering of an O2 Transport Protein. Nature 2009, 458 (7236), 305–309. https://doi.org/10.1038/nature07841.

(51) Murphy, G. S.; Greisman, J. B.; Hecht, M. H. De Novo Proteins with Life-Sustaining Functions Are Structurally Dynamic. J. Mol. Biol. 2016, 428 (2), 399–411. https://doi.org/10.1016/j.jmb.2015.12.008. (52) Cherny, I.; Korolev, M.; Koehler, A. N.; Hecht, M. H. Proteins from an Unevolved Library of de Novo Designed Sequences Bind a Range of Small Molecules. ACS Synth. Biol. 2012, 1 (4), 130–138. https://doi.org/10.1021/sb200018e. (53) Hoegler, K. J.; Hecht, M. H. A de Novo Protein Confers Copper Resistance in Escherichia Coli. Protein Sci. 2016, 25 (7), 1249–1259. https://doi.org/10.1002/pro.2871.

(43) Farid, T. A.; Kodali, G.; Solomon, L. A.; Lichtenstein, B. R.; Sheehan, M. M.; Fry, B. A.; Bialas, C.; Ennist, N. M.; Siedlecki, J. A.; Zhao, Z.; Stetz, M.A.; Valentine, K.G.; Anderson, J.L.R.; Wand, A.J.; Discher, B.M.; Moser, C.C.; Dutton, P.L. Elementary Tetrahelical Protein Design for Diverse Oxidoreductase Functions. Nat. Chem. Biol. 2013, 9 (12), 826–833. https://doi.org/10.1038/nchembio.1362.

(54) Donnelly, A. E.; Murphy, G. S.; Digianantonio, K. M.; Hecht, M. H. A de Novo Enzyme Catalyzes a Life-sustaining Reaction in Escherichia Coli. Nat. Chem. Biol. 2018, 14 (3), 253–255. https://doi.org/10.1038/nchembio.2550.

(44) Kodali, G.; A. Mancini, J.; A. Solomon, L.; V. Episova, T.; Roach, N.; J. Hobbs, C.; Wagner, P.; A. Mass, O.; Aravindu, K.; Barnsley, J.E.; Gordon, K.C.; Officer, D.L.; Dutton, P.L.; Moser, C. C. Design and Engineering of Water-Soluble Light-Harvesting Protein Maquettes. Chem. Sci. 2017, 8 (1), 316–324. https://doi.org/10.1039/C6SC02417C.

(55) Lombardi, A.; Summa, C. M.; Geremia, S.; Randaccio, L.; Pavone, V.; DeGrado, W. F. Retrostructural Analysis of Metalloproteins: Application to the Design of a Minimal Model for Diiron Proteins. Proc. Natl. Acad. Sci. 2000, 97 (12), 6298–6305. https://doi.org/10.1073/pnas.97.12.6298.

(45) Watkins, D. W.; Jenkins, J. M. X.; Grayson, K. J.; Wood, N.; Steventon, J. W.; Le Vay, K. K.; Goodwin, M. I.; Mullen, A. S.; Bailey, H. J.; Crump, M. P.; MacMillan, F.; Mulholland, A.J.; Cameron, G.; Session, R.B.; Mann, S.; Anderson, J.L.R. Construction and in Vivo Assembly of a Catalytically Proficient and Hyperthermostable de Novo Enzyme. Nat. Commun. 2017, 8 (1), 358. https://doi.org/10.1038/s41467-017-00541-4.

(56) Lupas, A. N.; Bassler, J.; Dunin-Horkawicz, S. The Structure and Topology of α-Helical Coiled Coils. Fibrous Proteins: Structures and Mechanisms 2017, 95–129. https://doi.org/10.1007/978-3-319-49674-0_4.

(46) Kamtekar, S.; Schiffer, J. M.; Xiong, H.; Babik, J. M.; Hecht, M. H. Protein Design by Binary Patterning of Polar and Nonpolar Amino Acids. Science 1993, 262 (5140), 1680–1685. https://doi.org/10.1126/science.8259512. (47) Wei, Y.; Liu, T.; Sazinsky, S. L.; Moffet, D. A.; Pelczer, I.; Hecht, M. H. Stably Folded de Novo Proteins from a Designed Combinatorial Library. Protein Sci. 2003, 12 (1), 92–102. https://doi.org/10.1110/ps.0228003. (48) Wei, Y.; Kim, S.; Fela, D.; Baum, J.; Hecht, M. H. Solution Structure of a de Novo Protein from a Designed Combinatorial Library. Proc. Natl. Acad. Sci. 2003, 100 (23), 13270–13273. https://doi.org/10.1073/pnas.1835644100.

(57) Woolfson, D. N. Coiled-Coil Design: Updated and Upgraded. Fibrous Proteins: Structures and Mechanisms 2017, 35–61. https://doi.org/10.1007/978-3-319-496740_2. (58) Zaccai, N. R.; Chi, B.; Thomson, A. R.; Boyle, A. L.; Bartlett, G. J.; Bruning, M.; Linden, N.; Sessions, R. B.; Booth, P. J.; Brady, R. L.; Woolfson, D.N. A de Novo Peptide Hexamer with a Mutable Channel. Nat. Chem. Biol. 2011, 7 (12), 935–941. https://doi.org/10.1038/nchembio.692. (59) Harbury, P. B.; Zhang, T.; Kim, P. S.; Alber, T. A Switch between Two-, Three-, and Four-Stranded Coiled Coils in GCN4 Leucine Zipper Mutants. Science 1993, 262 (5138), 1401–1407. https://doi.org/10.1126/science.8248779 (60) Harbury, P. B.; Kim, P. S.; Alber, T. Crystal Structure of an Isoleucine-Zipper Trimer. Nature 1994, 371 (6492), 80-83.

(49) Hecht, M. H.; Vogel, K. M.; Spiro, T. G.; Rojas, N. R. L.; Kamtekar, S.; Simons, C. T.; Mclean, J. E.; Farid, R. S. De Novo Heme Proteins from Designed Combinatorial Libraries. Protein Sci. 1997, 6 (12), 2512–2524. https://doi.org/10.1002/pro.5560061204.

(61) Woolfson, D. N. The Design of Coiled-Coil Structures and Assemblies. Advances in Protein Chemistry 2005 70, 79–112. https://doi.org/10.1016/S00653233(05)70004-8.

(50) Fisher, M. A.; McKinley, K. L.; Bradley, L. H.; Viola, S. R.; Hecht, M. H. De Novo Designed Proteins from a Library of Artificial Sequences Function in Escherichia Coli and Enable Cell Growth. PLoS One 2011, 6 (1), e15364. https://doi.org/10.1371/journal.pone.0015364.

(62) Fletcher, J. M.; Boyle, A. L.; Bruning, M.; Bartlett, G. J.; Vincent, T. L.; Zaccai, N. R.; Armstrong, C. T.; Bromley, E. H.; Booth, P. J.; Brady, R. L. A Basis Set of de Novo CoiledCoil Peptide Oligomers for Rational Protein Design and Synthetic Biology. ACS Synth. Biol. 2012, 1 (6), 240–250.

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

(63) Burton, A. J.; Thomas, F.; Agnew, C.; Hudson, K. L.; Halford, S. E.; Brady, R. L.; Woolfson, D. N. Accessibility, Reactivity, and Selectivity of Side Chains within a Channel of de Novo Peptide Assembly. J. Am. Chem. Soc. 2013, 135 (34), 12524–12527. https://doi.org/10.1021/ja4053027. (64) Thomson, A. R.; Wood, C. W.; Burton, A. J.; Bartlett, G. J.; Sessions, R. B.; Brady, R. L.; Woolfson, D. N. Computational Design of Water-Soluble α-Helical Barrels. Science 2014, 346 (6208), 485–488. https://doi.org/10.1126/science.1257452. (65) Burton, A. J.; Thomson, A. R.; Dawson, W. M.; Brady, R. L.; Woolfson, D. N. Installing Hydrolytic Activity into a Completely de Novo Protein Framework. Nat. Chem. 2016, 8 (9), 837–844. https://doi.org/10.1038/nchem.2555. (66) Thomas, F.; Dawson, W. M.; Lang, E. J. M.; Burton, A. J.; Bartlett, G. J.; Rhys, G. G.; Mulholland, A. J.; Woolfson, D. N. De Novo-Designed α-Helical Barrels as Receptors for Small Molecules. ACS Synth. Biol. 2018, 7 (7), 1808–1816. https://doi.org/10.1021/acssynbio.8b00225. (67) Hass, M. A. S.; Mulder, F. A. A. Contemporary NMR Studies of Protein Electrostatics. Annu. Rev. Biophys. 2015, 44 (1), 53–75. https://doi.org/10.1146/annurev-biophys083012-130351.

Page 12 of 15

(74) Walshaw, J.; Woolfson, D.N. SOCKET: a program for identifying and analysing coiled-coil motifs within protein structures. J. Mol. Biol. 2001, 307 (5), 1427–1450. https://doi.org/10.1006/jmbi.2001.4545. (75) Pandya, M. J.; Cerasoli, E.; Joseph, A.; Stoneman, R. G.; Waite, E.; Woolfson, D. N. Sequence and Structural Duality:  Designing Peptides to Adopt Two Stable Conformations. J. Am. Chem. Soc. 2004, 126 (51), 17016– 17024. https://doi.org/10.1021/ja045568c. (76) Gurnon, D. G.; Whitaker, J. A.; Oakley, M. G. Design and Characterization of a Homodimeric Antiparallel Coiled Coil. J. Am. Chem. Soc. 2003, 125 (25), 7518–7519. https://doi.org/10.1021/ja0357590. (77) McClain, D. L.; Woods, H. L.; Oakley, M. G. Design and Characterization of a Heterodimeric Coiled Coil That Forms Exclusively with an Antiparallel Relative Helix Orientation. J. Am. Chem. Soc. 2001, 123 (13), 3151–3152. https://doi.org/10.1021/ja004099l. (78) McClain, D. L.; Binfet, J. P.; Oakley, M. G. Evaluation of the Energetic Contribution of Interhelical Coulombic Interactions for Coiled Coil Helix Orientation Specificity. J. Mol. Biol. 2001, 313 (2), 371–383. https://doi.org/10.1006/jmbi.2001.5044.

(68) Liu, J.; Zheng, Q.; Deng, Y.; Li, Q.; Kallenbach, N.R.; Lu, M. Conformational Specificity of the Lac Repressor Coiled Coil Tetramerization Domain. Biochemisty 2007, 46 (51), 14951-14959. https://doi.org/10.1021/bi701930d.

(79) Kohn, W. D.; Kay, C. M.; Hodges, R. S. Protein Destabilization by Electrostatic Repulsions in the TwoStranded α-Helical Coiled-Coil/Leucine Zipper. Protein Sci. 1995, 4 (2), 237–250. https://doi.org/10.1002/pro.5560040210.

(69) Grigoryan, G.; Kim, Y. H.; Acharya, R.; Axelrod, K.; Jain, R. M.; Willis, L.; Drndic, M.; Kikkawa, J. M.; DeGrado, W. F. Computational Design of Virus-Like Protein Assemblies on Carbon Nanotube Surfaces. Science 2011, 332 (6033), 1071–1076. https://doi.org/10.1126/science.1198841.

(80) Baker, E. G.; Bartlett, G. J.; Crump, M. P.; Sessions, R. B.; Linden, N.; Faul, C. F. J.; Woolfson, D. N. Local and Macroscopic Electrostatic Interactions in Single α-Helices. Nat. Chem. Biol. 2015, 11 (3), 221–228. https://doi.org/10.1038/nchembio.1739.

(70) Thomas, F.; Boyle, A. L.; Burton, A. J.; Woolfson, D. N. A Set of de Novo Designed Parallel Heterodimeric Coiled Coils with Quantified Dissociation Constants in the Micromolar to Sub-Nanomolar Regime. J. Am. Chem. Soc. 2013, 135 (13), 5161–5166. https://doi.org/10.1021/ja312310g.

(81) Huang, P.-S.; Oberdorfer, G.; Xu, C.; Pei, X. Y.; Nannenga, B. L.; Rogers, J. M.; DiMaio, F.; Gonen, T.; Luisi, B.; Baker, D. High Thermodynamic Stability of Parametrically Designed Helical Bundles. Science 2014, 346 (6208), 481–485. https://doi.org/10.1126/science.1257481.

(71) Oakley, M. G.; Kim, P. S. A Buried Polar Interaction Can Direct the Relative Orientation of Helices in a Coiled Coil. Biochemistry 1998, 37 (36), 12603–12610. https://doi.org/10.1021/bi981269m.

(82) Egelman, E. H.; Xu, C.; DiMaio, F.; Magnotti, E.; Modlin, C.; Yu, X.; Wright, E.; Baker, D.; Conticello, V. P. Structural Plasticity of Helical Nanotubes Based on CoiledCoil Assemblies. Structure 2015, 23 (2), 280–289. https://doi.org/10.1016/j.str.2014.12.008.

(72) Burgess, N. C.; Sharp, T. H.; Thomas, F.; Wood, C. W.; Thomson, A. R.; Zaccai, N. R.; Brady, R. L.; Serpell, L. C.; Woolfson, D. N. Modular Design of Self-Assembling Peptide-Based Nanotubes. J. Am. Chem. Soc. 2015, 137 (33), 10554–10562. https://doi.org/10.1021/jacs.5b03973. (73) Rhys, G. G.; Wood, C. W.; Lang, E. J. M.; Mulholland, A. J.; Brady, R. L.; Thomson, A. R.; Woolfson, D. N. Maintaining and Breaking Symmetry in Homomeric CoiledCoil Assemblies. Nat. Commun. 2018, 9 (1), 4132. https://doi.org/10.1038/s41467-018-06391-y.

(83) Woolfson, D. N.; Alber, T. Predicting Oligomerization States of Coiled Coils. Protein Sci. 1995, 4 (8), 1596–1607. https://doi.org/10.1002/pro.5560040818. (84) Woolfson, D. N.; Bartlett, G. J.; Burton, A. J.; Heal, J. W.; Niitsu, A.; Thomson, A. R.; Wood, C. W. De Novo Protein Design: How Do We Expand into the Universe of Possible Protein Structures? Curr. Opin. Struct. Biol. 2015, 33, 16– 26. https://doi.org/10.1016/j.sbi.2015.05.009.

ACS Paragon Plus Environment

Page 13 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

(85) Niitsu Ai; Heal Jack W.; Fauland Kerstin; Thomson Andrew R.; Woolfson Derek N. Membrane-Spanning αHelical Barrels as Tractable Protein-Design Targets. Philos. Trans. R. Soc. B Biol. Sci. 2017, 372 (1726), 20160213. https://doi.org/10.1098/rstb.2016.0213. (86) Crick, F. H. The Fourier Transform of a Coiled-Coil. Acta Crystallogr. 1953, 6 (8–9), 685–689. https://doi.org/10.1107/S0365110X53001952 (87) Grigoryan, G.; DeGrado, W. F. Probing Designability via a Generalized Model of Helical Bundle Geometry. J. Mol. Biol. 2011, 405 (4), 1079–1100. https://doi.org/10.1016/j.jmb.2010.08.058. (88) Harbury, P. B.; Tidor, B.; Kim, P. S. Repacking Protein Cores with Backbone Freedom: Structure Prediction for Coiled Coils. Proc. Natl. Acad. Sci. 1995, 92 (18), 8408–8412. https://doi.org/10.1073/pnas.92.18.8408. (89) Offer, G.; Sessions, R. Computer Modelling of the α-Helical Coiled Coil: Packing of Side-Chains in the Inner Core. J. Mol. Biol. 1995, 249 (5), 967–987. https://doi.org/10.1006/jmbi.1995.0352. (90) Wood, C. W.; Bruning, M.; Ibarra, A. Á.; Bartlett, G. J.; Thomson, A. R.; Sessions, R. B.; Brady, R. L.; Woolfson, D. N. CCBuilder: An Interactive Web-Based Tool for Building, Designing and Assessing Coiled-Coil Protein Assemblies. Bioinformatics 2014, 30 (21), 3029–3035. https://doi.org/10.1093/bioinformatics/btu502. (91) Wood, C. W.; Woolfson, D. N. CCBuilder 2.0: Powerful and Accessible Coiled-Coil Modeling. Protein Sci. 2018, 27 (1), 103–111. https://doi.org/10.1002/pro.3279. (92) Lupas, A. N.; Bassler, J. Coiled Coils – A Model System for the 21st Century. Trends Biochem. Sci. 2017, 42 (2), 130–140. https://doi.org/10.1016/j.tibs.2016.10.007. (93) Negron, C.; Keating, A. E. A Set of Computationally Designed Orthogonal Antiparallel Homodimers That Expands the Synthetic Coiled-Coil Toolkit. J. Am. Chem. Soc. 2014, 136 (47), 16544–16556. https://doi.org/10.1021/ja507847t. (94) Gonzalez Jr, L.; Woolfson, D. N.; Alber, T. Buried Polar Residues and Structural Specificity in the GCN4 Leucine Zipper. Nat. Struct. Mol. Biol. 1996, 3 (12), 1011– 1018. https://doi.org/10.1038/nsb1296-1011. (95) Lumb, K. J.; Kim, P. S. A Buried Polar Interaction Imparts Structural Uniqueness in a Designed Heterodimeric Coiled Coil. Biochemistry 1998, 37 (37), 13042–13042. https://doi.org/10.1021/bi9850468. (96) Fletcher, J. M.; Bartlett, G. J.; Boyle, A. L.; Danon, J. J.; Rush, L. E.; Lupas, A. N.; Woolfson, D. N. N@a and N@d: Oligomer and Partner Specification by Asparagine in Coiled-Coil Interfaces. ACS Chem. Biol. 2017, 12 (2), 528– 538. https://doi.org/10.1021/acschembio.6b00935.

(97) Boyken, S. E.; Chen, Z.; Groves, B.; Langan, R. A.; Oberdorfer, G.; Ford, A.; Gilmore, J. M.; Xu, C.; DiMaio, F.; Pereira, J. H.; Sankaran, B.; Seelig, G.; Zwart, P.H.; Baker, D. De Novo Design of Protein Homo-Oligomers with Modular Hydrogen-Bond Network–Mediated Specificity. Science 2016, 352 (6286), 680–687. https://doi.org/10.1126/science.aad8865. (98) Lizatović, R.; Aurelius, O.; Stenström, O.; Drakenberg, T.; Akke, M.; Logan, D. T.; André, I. A De Novo Designed Coiled-Coil Peptide with a Reversible pH-Induced Oligomerization Switch. Structure 2016, 24 (6), 946–955. https://doi.org/10.1016/j.str.2016.03.027. (99) Ambroggio, X. I.; Kuhlman, B. Computational Design of a Single Amino Acid Sequence That Can Switch between Two Distinct Protein Folds. J. Am. Chem. Soc. 2006, 128 (4), 1154–1161. https://doi.org/10.1021/ja054718w. (100) Aupič, J.; Lapenta, F.; Jerala, R. SwitCCh: Metal-Site Design for Controlling the Assembly of a Coiled-Coil Homodimer. ChemBioChem 2018, 19 (23), 2453–2457. https://doi.org/10.1002/cbic.201800578. (101) Cerasoli, E.; Sharpe, B. K.; Woolfson, D. N. ZiCo:  A Peptide Designed to Switch Folded State upon Binding Zinc. J. Am. Chem. Soc. 2005, 127 (43), 15008–15009. https://doi.org/10.1021/ja0543604. (102) Ciani, B.; Hutchinson, E. G.; Sessions, R. B.; Woolfson, D. N. A Designed System for Assessing How Sequence Affects α to β Conformational Transitions in Proteins. J. Biol. Chem. 2002, 277 (12), 10150–10155. https://doi.org/10.1074/jbc.M107663200. (103) Slovic, A. M.; Lear, J. D.; DeGrado, W. F. De Novo Design of a Pentameric Coiled-Coil: Decoding the Motif for Tetramer versus Pentamer Formation in Water-Soluble Phospholamban. J. Pept. Res. 2005, 65 (3), 312–321. https://doi.org/10.1111/j.1399-3011.2005.00244.x. (104) Slovic, A. M.; Stayrook, S. E.; North, B.; DeGrado, W. F. X-Ray Structure of a Water-Soluble Analog of the Membrane Protein Phospholamban: Sequence Determinants Defining the Topology of Tetrameric and Pentameric Coiled Coils. J. Mol. Biol. 2005, 348 (3), 777– 787. https://doi.org/10.1016/j.jmb.2005.02.040. (105) Spencer, R. K.; Hochbaum, A. I. The Phe-Ile Zipper: A Specific Interaction Motif Drives Antiparallel Coiled-Coil Hexamer Formation. Biochemistry 2017, 56 (40), 5300– 5308. https://doi.org/10.1021/acs.biochem.7b00756. (106) Spencer, R. K.; Hochbaum, A. I. X-Ray Crystallographic Structure and Solution Behavior of an Antiparallel Coiled-Coil Hexamer Formed by de Novo Peptides. Biochemistry 2016, 55 (23), 3214–3223. https://doi.org/10.1021/acs.biochem.6b00201. (107) Ljubetič, A.; Lapenta, F.; Gradišar, H.; Drobnak, I.; Aupič, J.; Strmšek, Ž.; Lainšček, D.; Hafner-Bratkovič, I.; Majerle, A.; Krivec, N.; Benčina, M.; Pisanski, T.; Veličković, T. Ć.; Round, A.; Carazo, J.M.; Melero, R.; Jerala, R Design of

ACS Paragon Plus Environment

Journal of the American Chemical Society 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Coiled-Coil Protein-Origami Cages That Self-Assemble in Vitro and in Vivo. Nat. Biotechnol. 2017, 35 (11), 1094– 1101. https://doi.org/10.1038/nbt.3994. (108) Swainsbury, D. J. K.; Harniman, R. L.; Di Bartolo, N. D.; Liu, J.; Harper, W. F. M.; Corrie, A. S.; Jones, M. R. Directed Assembly of Defined Oligomeric Photosynthetic Reaction Centres through Adaptation with Programmable Extra-Membrane Coiled-Coil Interfaces. Biochim. Biophys. Acta 2016, 1857 (12), 1829–1839. https://doi.org/10.1016/j.bbabio.2016.09.002. (109) Cristie-David, A. S.; Sciore, A.; Badieyan, S.; Escheweiler, J. D.; Koldewey, P.; Bardwell, J. C. A.; Ruotolo, B. T.; Marsh, E. N. G. Evaluation of de Novo-Designed Coiled Coils as off-the-Shelf Components for Protein Assembly. Mol. Syst. Des. Eng. 2017, 2 (2), 140–148. https://doi.org/10.1039/C7ME00012J. (110) Sciore, A.; Su, M.; Koldewey, P.; Eschweiler, J. D.; Diffley, K. A.; Linhares, B. M.; Ruotolo, B. T.; Bardwell, J. C. A.; Skiniotis, G.; Marsh, E. N. G. Flexible, Symmetry-Directed Approach to Assembling Protein Cages. Proc. Natl. Acad. Sci. U. S. A. 2016, 113 (31), 8681–8686. https://doi.org/10.1073/pnas.1606013113. (111) Fletcher, J. M.; Harniman, R. L.; Barnes, F. R. H.; Boyle, A. L.; Collins, A.; Mantell, J.; Sharp, T. H.; Antognozzi, M.; Booth, P. J.; Linden, N.; Miles, M.J.; Sessions, R.B.; Verkade, P.; Woolfson, D.N. Self-Assembling Cages from Coiled-Coil Peptide Modules. Science 2013, 1226558. https://doi.org/10.1126/science.1233936.

ACS Paragon Plus Environment

Page 14 of 15

Page 15 of 15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of the American Chemical Society

ACS Paragon Plus Environment