Refined Crystal Structure of Samia cynthia ... - ACS Publications

May 15, 2017 - Refined Crystal Structure of Samia cynthia ricini Silk Fibroin Revealed ...... The calculations were supported by supercomputer system ...
0 downloads 0 Views 4MB Size
Subscriber access provided by UB + Fachbibliothek Chemie | (FU-Bibliothekssystem)

Article

A Refined Crystal Structure of Samia cynthia ricini Silk Fibroin Revealed by Solid-State NMR Investigations Tetsuo Asakura, Akio Nishimura, Shunsuke Kametani, Shuto Kawanishi, Akihiro Aoki, Furitsu Suzuki, Hironori Kaji, and Akira Naito Biomacromolecules, Just Accepted Manuscript • Publication Date (Web): 15 May 2017 Downloaded from http://pubs.acs.org on May 16, 2017

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Biomacromolecules is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

A Refined Crystal Structure of Samia cynthia ricini Silk Fibroin Revealed by Solid-State NMR Investigations Tetsuo Asakura,* †Akio Nishimura, † Shunsuke Kametani, † Shuto Kawanishi †Akihiro Aoki, †Furitsu Suzuki,§ Hironori Kaji§ and Akira Naito †



Department of Biotechnology, Tokyo University of Agriculture and Technology, Koganei, Tokyo 184-8588 JAPAN

§

Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, JAPAN

*Correspondence to: Tetsuo Asakura

Tel & FAX: +84-42-383-7733

Email: [email protected]

Key words: Silk Fibroin / Samia cynthia ricini / Solid state NMR/NMR chemical shift calculation / Poly-L-alanine

Abstract

1 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 2 of 43

Samia cynthia ricini is one of the wild silkworms and its silk fibroin (SF) consists of alternatively repeating poly-L-alanine (PLA) sequences as crystalline domain and glycine-rich sequences as non-crystalline domain; the structure is similar to those of spider silk and other wild silkworm silks. In this paper, we proposed a new staggered model for the packing arrangement of the PLA sequence through the use of the Cambridge Serial Total Energy Package program and a comparison of the observed and calculated chemical shifts of the PLA sequence with the Gauge Including Projector Augmented Wave method. The new model was supported by the inter-atomic distance information from the cross peaks of Ala Cβ dipolar-assisted rotational resonance (DARR) spectrum of the PLA sequences in S. c. ricini SF fiber. In addition, three

13

C

NMR peaks observed in the β-sheet region were assigned to the carbons with different environments in the same model, but not assigned to different β-sheet structures.

2 ACS Paragon Plus Environment

Page 3 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

Introduction There are a variety of silkworms and spiders, each producing silk with a unique primary and higher order structures.1,2 Their excellent properties such as high strength and high toughness have attracted researchers in diverse fields, such as biology, biochemistry, biophysics, analytical chemistry, polymer technology, textile technology and biomaterials.3-9 In order to optimize the information content of multidisciplinary approaches, it is useful to study the structure-property relationships in silks, particularly with respect to their primary and higher order structures. The most known silk, Bombyx mori (B. mori) silk fibroin (SF) has been much studied, and ample knowledge has been gathered on its structure and dynamics, including structure-property relationships.4,5,9-11 The amino-acid composition of B. mori SF is known and comprises 42.9% Gly, 30.0% Ala, 12.2% Ser, 4.8% Tyr, and 2.5% Val.12,13 Its primary structure consists largely of a repeating sequence of six residues (GAGAGS)n which forms the crystal domains of the SF. The conformation of these sequences is mainly anti-parallel β-sheet structure, which was elucidated by X-ray diffraction analysis,14,15 infrared spectroscopy (IR),16-18 Raman17,19 and solid-state nuclear magnetic resonance (NMR).20-23 However, X-ray diffraction analysis gives only limited structural information because SF exists in the fiber form, not as a single crystal, and the packing structure is heterogeneous and not easy to study. IR and Raman experiments are also 3 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 4 of 43

difficult to yield the packing structure. Instead, the combination of several solid-state NMR techniques can give detailed information on the packing structure of such heterogeneous SF samples. For example, the previous packing model of B. mori SF fiber after spinning (Silk II) initially proposed by Marsh and Pauling et al.14on the basis of X-ray diffraction analysis has been revised through the use of several solid state NMR techniques.23 In addition, a precise packing model of poly (Ala-Gly) as a model peptide for B. mori SF proposed by Takahashi et al.15 using X-ray diffraction data has also revised by us using solid-state NMR and conformational energy calculations.23 Recently much attention has been paid to the wild silkworm SF and spider silk as new biomaterials.24-26 Thus, in this paper, we aimed to determine the packing structure of the SF fiber from a wild silkworm, Samia cynthia ricini (S.c.ricini). The S. c. ricini SF typically contains polyalanine sequence Ala12-13 (PLA), embedded in a Gly-rich amorphous matrix, and the amino acid composition includes Ala (45.4%), Gly (31.7%), Ser (6.7%) and Tyr(5.8%).27 The primary structure is close to those of SF’s from other wild silkworms such as Antheraea pernyi (A. pernyi),

28

Antheraea yamamai (A.

yamamai) 29and Antheraea mylitta (A. mylitta),30 and also the major ampullate silk from the spider Nephila clavipes.31-34 The determination of the packing structure of the crystalline PLA domain can give packing information that are common for these silks. The PLA sequences in S. c. ricini SF fiber are considered to have anti-parallel β-sheet 4 ACS Paragon Plus Environment

Page 5 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

structure, as revealed by X-ray diffraction35,36 and solid-state NMR

36-38

, and more

knowledge of this structure is useful for determining the backbone torsion angles of the SF fiber. In our previous paper,39 we did a systematic structural analysis of alanine oligomers with anti-parallel β-sheet structure as a model for the crystalline region of spider dragline silk and S. c. ricini SF, by using solid-state NMR spectroscopy and X-ray crystallography. The alanine oligomers pack into two different arrangements, depending on the lengths of the sequences. Short sequences (n = 6 or less) pack into a rectangular arrangement. Longer sequences pack in a staggered arrangement. Thus, the packing arrangement of PLA sequences in S. c. ricini SF seems to be staggered. In this paper we report the atomic co-ordinates of the packing structure of PLA sequences in S. c. ricini SF using solid-state NMR techniques, such as dipolar-assisted rotational resonance (DARR),40-47 1H-detection in the double CP 1H-13C correlation48 and 1H and

13

C NMR

chemical shift calculation. In particular, the chemical shift calculation methods [e.g., the geometry optimization under periodic boundary conditions using the Cambridge Serial Total Energy Package (CASTEP) program49 and the Gauge Including Projector Augmented Wave (GIPAW) method50] have been successfully used to determine the packing structures of B. mori SF before and after spinning23,51 and also applied to the determination of the packing structures of (Ala)n (n = 3,4).52,53 In order to obtain the 5 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

observed 13C and 1H chemical shifts of S. c. ricini SF, 13C DARR and 1H-detection in the double CP 1H-13C correlation measurements have been performed for [U-13C] S. c. ricini SF and their crystalline fraction.54 Experimental Section Preparation of Alanine oligomers 13

C selectively labeled 34-mer peptide, GGAGGGYGGDGG(A)6[3-13C] A19(A)5GG-

AGDGYGAG as a typical tandem sequence of S. c. ricini SF was synthesized with Fmoc-Ala-PEG-PS resin (PE Biosystems) on a PioneerTM Peptide Synthesizer using Fmoc chemistry in our laboratory.55 After synthesis, the samples were dissolved in 9 M LiBr and then dialyzed against water for 4 days. The precipitate was obtained and dried. After this treatment, the peptide powder with β-sheet structure was obtained. The [3-13C] Ala (99%

13

C enrichment) was purchased from Cambridge Isotopes Laboratories,

Andover, MA, USA. Preparation of [U-13C] S. c. ricini SF fiber and the crystalline fraction Eggs of S. c. ricini silkworms were kindly given by Prof. Saito (Kyoto Institute of Technology, Japan), and the silkworms were reared with an artificial diet, silk mate L4M (Nosan Co., Japan), in our laboratory. [U-13C] S. c. ricini SF fiber was prepared as follows. Artificial diet with a mixture of [U-13C] glucose (99%, Cambridge Isotopes Laboratories) was fed to the 5th instar larvae from 3 to 6 days old for 4 days twice per 6 ACS Paragon Plus Environment

Page 6 of 43

Page 7 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

day (morning and evening). A total of approximately 600 mg of [U-13C] glucose was fed per silkworm. The cocoons obtained from these S. c. ricini silkworms were degummed three times with 0.1% (w/w) sodium peroxide (Na2O2) solution at 100˚C for 30 min and washed with distilled water in order to remove silk sericin, another silk protein, from the surface of the silk fibers.36 The SF fibers were dried at room temperature prior to

13

C

DARR NMR measurements. The crystalline fraction of [U-13C] S. c. ricini SF was prepared as before.54 Hydrochloric acid solution (20ml, 5 mol/l) was added to 200 mg of the degummed [U-13C] SF and kept for 5 h under 80°C. Then the reaction was stopped by adding aqueous sodium hydroxide. The solution was centrifuged at 8,500 rpm for 30 min at 4°C. Distilled water was added to the precipitate and then centrifuged again under the same condition. This treatment was repeated three times and then the precipitate was freeze-dried. The crystalline fraction of [U-13C] S. c. ricini SF was obtained as a powder form of the precipitate. The non-labeled S. c. ricini SF fiber was also used for

13

C

CP/MAS NMR observation. 13

C CP/MAS NMR and 13C DARR measurements.

The

13

C CP/MAS NMR spectra of S. c. ricini SF and the 34-mer peptide,

GGAGGGYGGDGG(A)6[3-13C] A19(A)5GGAGDGYGAG were recorded at room temperature on a Varian Unity Infinity 400 MHz spectrometer with a

13

C operating

frequency of 100.6 MHz. Samples were spun at a MAS frequency of 10 kHz. The 7 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 8 of 43

number of acquisitions was 8,000, and the recycle delay was 5 s. Radio-frequency (rf) field strength at 50 kHz was used for 1H-decoupling during the acquisition period of 12.8 ms. A 90° pulse width of 5 µs and CP contact pulse of 1 ms were also employed. Phase cycling was used to minimize spectral artifacts. The chemical shifts were referenced to TMS, using adamantane as a secondary standard (13CH peak at 28.8ppm). The

13

DARR spectrum of [U-13C] S.c.ricini SF fiber was obtained after 32 scans at a

13

C C

resonance frequency of 99.5 MHz, using a JEOL ECX400 spectrometer at a spinning speed of 8 kHz with a 4 mm OD rotor.47 The π/2 pulse was 3.8 µs for 13C, and 3.4 µs for 1

H. TPPM 1H decoupling was performed with a contact time of 2 ms. The mixing time

was changed every 100 ms from 100 ms to 500 ms, combined with a relaxation delay of 2 s. In addition, the RF field was set at 8 kHz. The indirect dimension consisted of 256 data points. 1

H-detection in the double CP 1H-13C correlation measurements

The crystalline fraction of [U-13C] S. c. ricini SF was used for 1H-detection in the double CP 1H-13C correlation measurements. This experiment was performed at a 1H resonance frequency of 920 MHz, using a JEOL JNM-ECA920 spectrometer equipped with a 1H-X double resonance and ultra-high speed MAS probe at the Institute for Molecular Science in Okazaki, Japan.23,51-53 The sample spinning speed was actively stabilized by a pneumatic solenoid valve such that the spinning fluctuations were less 8 ACS Paragon Plus Environment

Page 9 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

than ±10 Hz at a spinning rate of 70 kHz. The temperature of the samples increases due to friction under fast MAS was estimated to be around 333 K at 70 kHz MAS according to Pb(NO3)2 temperature calibration.56 The 1H rf field strength for the excitation π/2 pulse (1.29 µs) was 194 kHz. The 1H chemical shift was referenced to the peak of silicon rubber and set to 0.12 ppm from TMS. For 1H-detection in the double CP 1H-13C correlation measurements, the pulse sequence 90Hy-CPx- t1C-90Cφ-τd-90Cy-CPx-t2H was used.39 Here, 90° is a π/2 pulse, CP is a 4-ms cross-polarization period with a 10% (first) and -10% (second) ramp of dephasing of transverse

13

C, t1 is the evolution period, τd is a 5-ms period for

13

C magnetization and 1H magnetization suppression, and t2 is

the detection period. Superscripts H and C indicate 1H and 13C, and subscripts x, y, and φ indicate rf phases, with φ = x and y for quadrature detection in t1. The 1H decoupling amplitude during t1C was 27 kHz. The spectrum was obtained after 64 scans at each period in the y domain with 512 points. 13

C and 1H NMR chemical shift calculations of the packing structure of PLA

sequence in S. c. ricini SF fiber The

characteristic

torsion

angles

of

(φ,ϕ)

=

(-138.6°,

134.7°)

for

an

anti-parallel β-sheet structure were used for Ala residues in PLA chains.35,37,38 The initial packing model of PLA was prepared using the cell dimensions reported by Arnott et al.35; a = 6.890 Å, b =10.535 Å, c = 9.468 Å and α = 90.0, β = 90.0, γ = 90.0. 9 ACS Paragon Plus Environment

The

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

packing model was energy-minimized using the pcff force field of Discover software (Dassault Systems Biovia Corp., San Diego, CA, USA). Then, the geometry optimization was performed under periodic boundary conditions using CASTEP program (Dassault Systems Biovia Corp., San Diego, CA, USA).49 We used the generalized gradient approximation (GGA) for the exchange correlation energy based on the Perdew, Bruke and Ernzerhof (PBE) functional and ultrasoft pseudopotentials with a plane-wave energy cutoff of 380 eV. A 4 × 2 × 3 Monkhorst- Pack k-point grid was used for Brillouin zone sampling. The

13

C and 1H chemical shifts were then calculated using the GIPAW

method.50 The PBE approximation and “on the fly” pseudo potentials were used. The energy cutoff of the plane wave was set to 610 eV and a 4 ×2 × 3 Monkhorst-Pack k-point grid was used as described above. All calculations were carried out using the NMR-CASTEP program. The calculated

13

C and 1H chemical shifts of the PLA model

with anti-parallel β-sheet structure were found to be consistent with the observed Ala Cβ carbon and Ala Hβ proton chemical shifts at the highest field of the β-sheet peaks respectively without changing the relative chemical shift difference among all peaks. Thus, the references of the calculated smallest chemical shift of Ala Cβ carbon will be 19.7 ppm and 1.00 ppm for Ala 1Hβ proton. Results

10 ACS Paragon Plus Environment

Page 10 of 43

Page 11 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

1. Expanded Ala Cβ β peaks in the 13C CP/MAS NMR spectra of (a) S. c. ricini SF fiber

and

(b)

13

C

selectively

labeled

34-mer

peptide,

GGAGGGYGGDGG(A)6[3-13C]A19 (A)5GGAGDGYGAG In our previous papers,20-23,36,57 we reported that the packing effect was clearly observed for Ala Cβ peaks in the 13C CP/MAS NMR spectra of B. mori and S. c. ricini SF fibers.

Fig. 1 shows expanded Ala Cβ peaks of the 13C CP/MAS NMR spectra of (a)

non-labeled S. c. ricini SF fiber and (b)13C single-labeled [3-13C] A19- 34-mer peptide, GGAGGGYGGDGG(A)6[3-13C] A19(A)5GGAGDGYGAG as the model for a typical tandem sequence of S. c. ricini SF. The broad peak observed at 16.6 ppm was assigned to random coil conformation, and other lower field peaks from 19.7 to 22.7 ppm were assigned to β-sheet structure.36,39,57 The peak pattern looked essentially the same between the spectra (a) and (b) although the latter peak was slightly sharper. The

13

C

single-labeled 34-mer peptides with the same molecular structure but with different 13C labeling sites were also synthesized and the 13C CP/MAS NMR spectra observed.58 The Ala Cβ peak patterns of the

13

C CP/MAS NMR spectra were essentially the same as

reported previously. This was an important starting point in the NMR structural analysis of the SF fiber. We must first judge whether the multiplet observed in the lower field Ala Cβ peak came from (a) inter-molecular packing effect of the PLA sequences or (b) intra-molecular heterogenous distribution of the local conformations of the PLA 11 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 12 of 43

sequences along one SF chain. The essentially same pattern between them meant that the multiplet clearly came from the packing effect. It is noted that a particular residue might exist in multiple packing sites in the entire system. The chemical shifts of the highest field peak A and the lowest field peak C in the β-sheet region were easily determined to be 19.7 ppm and 22.7 ppm, respectively. However, the central peak B was slightly broader than the other peaks.

Fig. 1 The expanded Ala Cβ regions in 13C CP/MAS NMR spectra of non-labeled Samia cynthia

ricini

silk

fibroin

fiber

(a)

and

GGAGGGYGGDGG(A)6[3-13C]

A19(A)5GGAGDGYGAG (b).

2. Determination of Ala Cα α and C=O chemical shifts of [U-13C] S. c. ricini SF fiber Although the Ala Cα and Ala C=O peaks looked like a single peak (Fig. 2), it was necessary to examine the chemical shift difference for each peak which corresponded to the multiplet in the β-sheet region of the Ala Cβ peak. Fig.2 shows the

13

C DARR

spectrum of [U-13C] S. c. ricini SF fiber. However, there were no splits in the Ala Cα and 12 ACS Paragon Plus Environment

Page 13 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

Ala C=O peaks. Thus, the packing effect of the SF chains could be observed exclusively in the Ala Cβ peak. The Ala Cβ carbons are located outside of β-sheet planes of SF molecule and therefore only these Cβ carbons seem to be sensitive to the arrangement of the adjacent SF molecules, namely, the packing effect in the solid state. The observed 13C chemical shifts of PLA carbons of S. c. ricini SF fiber are summarized in Table 1 and compared with the calculated chemical shifts.

Fig. 2 13C DARR spectrum of [U-13C] Samia cynthia ricini silk fibroin fiber.

The correlations between Ala Cβ and Ala C=O carbons, and between Ala Cβ and Ala Cα carbons are also shown as the inserted Figs.

Table 1. Observed chemical shifts of Samia cynthia ricini silk fibroin fiber (13C) and their crystalline fraction (1H). The calculated

13

C and 1H chemical shifts of the PLA

model with anti-parallel β-sheet structure are also listed and consistent with the observed

13 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Ala Cβ carbon and Ala Hβ proton chemical shifts at the highest field of the β-sheet peaks without changing the relative chemical shift difference among all peaks. Thus, the calculated smallest chemical shift will be 19.7 ppm for Ala Cβ carbon and 1.0 ppm for Ala 1Hβ proton.

13

C

S. c. ricini fiber

Calc.

1

H

S. c. ricini crystal

Calc.

Ala

Ala

C=O



175.5

50.0

Ala Cβ

r.c.

16.6

A

19.7

B

21.2

C

22.9

176.6

52.6

1 ○

19.7

177.4

50.4

2 ○

20.7

177.2

51.1

3 ○

21.6

178.9

50.5

4 ○

24.4

Ala

Ala

HN



8.95

5.00

Ala Hβ

A

1.00

B

1.20

C

1.40

10.48

5.62

1 ○

1.01

10.14

5.93

2 ○

0.99

14 ACS Paragon Plus Environment

Page 14 of 43

Page 15 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

10.80

5.88

3 ○

1.53

10.82

5.82

4 ○

1.40

3. Determination of Ala Hβ β, Hα α and NH chemical shifts of the crystalline fraction of [U-13C] S. c. ricini SF The resolution of one-dimensional 1H solid state NMR was generally poor, but the use of higher field magnet like 920 MHz equipped with ultra-high speed MAS probe at 70 kHz could attain remarkably well-resolved1H solid-state NMR spectrum.10,23,51-53 The latter ultra-high speed MAS probe for solid-state NMR that needs a small amount of sample was developed by us and the details were already reported elsewhere.52,59-61 The 1

H nuclei are located outside the SF molecules and therefore expected to be more

sensitive to the packing effect. In order to ensure higher resolution 1H solid-state NMR spectrum, we prepared the crystalline fraction of [U-13C] S. c. ricini SF with anti-parallel β-sheet structure.54 In addition, the powder form was suitable for filling the sample into the high speed MAS probe with a 1.5-mm outer diameter. Fig.3 shows 1H-detection in the double CP 1H-13C correlation NMR spectrum of the crystalline fraction. The correlations between Ala Cβ β−sheet Α peak and the top of the Ala Hβ peak and also between Ala Cβ β−sheet C peak and the Ala Hβ C peak were detected in the 2D spectrum, although the Ala Hβ C peak apparently did not show up. A clear correlation could not be observed between Ala Cβ β−sheet B peak and the Ala Hβ B peak in the 2D

15 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

spectrum. There were no such correlations between Ala Cβ (Α and C) and Ala Hα peaks and also between Ala Cβ and Ala NH peaks. These results indicate that chemical shift difference between Ala Cα (A and C) is quite small. The observed 1H chemical shifts are summarized in Table 1 and compared with the calculated chemical shifts.

Fig.3 1H-detection in the double CP 1H-13C correlation spectrum of the crystalline fraction of Samia cynthia ricini silk fibroin.

4. Preparation of new packing structure The original Arnott model35 for PLA molecule is shown in Fig.4(a). It consists of interleaved anti-parallel β-sheets with neighboring sheets randomly displaced ±1/2 a (a being the interchain distance). Optimum values of the packing and conformational

16 ACS Paragon Plus Environment

Page 16 of 43

Page 17 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

parameters for the statistical structure were determined by the linked-atom least-squares method for polymer crystalline fibers.62 In addition, the model claimed that there was essentially no significant difference between the unit cell dimensions of the PLA molecules obtained by Arnott and those of the crystalline region, i.e., the PLA sequences of tussah SF fiber. (Tussah silk is produced from larvae of several species of silkworms belonging to the moth genus Antheraea, including A. mylitta, A. pernyi and A. yamamai.) Thus, the primary structure of tussah SF consisted of polyalanine sequence of Ala12-13 (PLA), embedded in a Gly-rich amorphous matrix, which was similar to that of S. c. ricini SF.27-30

Earlier, we have already reported that the

13

C CP/MAS NMR spectra

were very similar between S. c. ricini SF and A. pernyi SF.63 Thus, the structural model of PLA molecules proposed by Arnott et al.

35

seemed to be suitable as an initial model

for the structural analysis of the crystal structure of S. c. ricini SF fiber. The energy optimization of the original Arnott model was done by CASTEP calculation, and a new staggered model was obtained as shown in Fig.4(b). The packing structures looked slightly different between two structural models.

17 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

(a)

(b)

Fig.4 (a) Arnott model and (b) new staggered model. Details are described in the text. 1 to ○ 4 , and also The Ala Cβ carbons with different environments were noted as from ○

1 * to ○ 4 * in Fig.4(b) used in the peak assignments. The inter-molecular direct from ○

hydrogen bonding pairs of NH…O=C bonds were noted as from I to IV, and also from I* to IV* in Fig.4(b).

In order to discuss the difference of two models more quantitatively, the geometric parameters of the inter-molecular direct hydrogen bonding between the NH group in one 18 ACS Paragon Plus Environment

Page 18 of 43

Page 19 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

molecule and the C=O group in another molecule were summarized in Table 2. Here the numbers I-IV and I’-IV’ correspond to the inter-molecular direct hydrogen bonding pairs of NH…O=C bonds in Fig.4(b). For Arnott model, the averaged distances of the NH…O=C bonds were 1.88 Å and the angles of N-H…O were 163°, while the corresponding averaged distances were 1.81 Å and the angles of N-H…O were 165° for the new staggered model. Thus, the hydrogen bonding networks of the latter model are more packed and the structure is expected to be more stable.57 From the atomic distance of the NH…O=C bond, it is possible to predict the chemical shift of NH protons.52 The predicted values are also listed in Table 2. Table 2. The distances (Å) of the inter-molecular direct hydrogen bonding between NH group in one molecule and C=O group in another molecule together with the related angles (degree) for the hydrogen bonding of both packing structures, i.e., Arnott model in Fig.4(a) and new staggered model in Fig.4(b). Predicted chemical shifts of NH protons (δNH) were evaluated from the hydrogen bond distances of NH…O=C. Arnott model NH-O

New staggered model

distance / Å

angle / deg

δNH / ppm

distance / Å

angle / deg

δNH / ppm

I

1.885

156.7

8.8

1.853

161.0

8.9

II

1.859

168.1

8.9

1.768

168.3

9.4

III

1.887

158.7

8.8

1.808

162.9

9.2

IV

1.905

168.7

8.7

1.817

166.3

9.1

III'

1.887

158.7

8.8

1.809

162.9

9.2

IV'

1.905

168.7

8.7

1.817

166.2

9.1

I'

1.885

156.7

8.8

1.859

160.9

8.9

19 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 20 of 43

II'

1.859

168.1

8.9

1.770

168.2

9.4

Average

1.884

163.1

8.8

1.813

164.6

9.1

5. 13C and 1H NMR chemical shift calculations The

13

C and 1H chemical shifts were calculated using GIPAW method for the new

staggered model obtained here. The calculated chemical shifts are listed in Table 1 1 to ○ 4 for different Ala together with the observed chemical shifts. Here the numbers ○

Cβ carbon correspond to the numbers in Figs. 4(b). The observed and calculated chemical shifts are also shown in Fig. 5 as stick spectra. In order to emphasize the assignments of the Ala Cβ and Ala Hβ peaks which are sensitive to the packing effect, the calculated smallest chemical shifts of Ala Cβ carbon and Hβ proton have been adjusted to be consistent with the observed highest field peaks in the β-sheet regions. These are 19.7 ppm for the 13C peaks and 1.00 ppm for the 1H peaks, respectively. Thus, the apparent difference between the calculated and observed chemical shift tends to be larger for Ala C=O and Ala NH peaks.

20 ACS Paragon Plus Environment

Page 21 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

Fig.5 Observed chemical shifts of Samia cynthia ricini silk fibroin fiber (13C) and their crystalline fraction(1H) together with the assignments. The calculated

13

C and 1H

chemical shifts of the PLA model were also shown which have been adjusted to be consistent with the respective observed Ala Cβ carbon and Ala Hβ proton chemical shifts at the highest field of the β-sheet peaks without changing the relative chemical shift difference among all peaks. Thus, the calculated smallest chemical shift will be 19.7 ppm

21 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

for Ala Cβ carbon and 1.0 ppm for Ala 1Hβ proton. Details are described in the text. The chemical shifts are shown as stick spectra.

1 to ○ 4 , correspond to four carbons The four calculated Ala Cβ chemical shifts, ○

with different environments in the new staggered model shown in the Fig.4 (b). The four 1 * to ○ 4 *, are also shown in Fig.4(b), but the environments are essentially the carbons, ○

1 to ○ 4 , respectively. There are three observed peaks, from peak A to same as those of ○

peak C. The central peak, B, located between peaks A and C seems to be broad. Thus, we 1 and the lowest field peak C to carbon ○ 4 assigned the highest field peak A to carbon ○

2 to peak A . In view of the observed highest intensity of peak A, we assigned carbon ○

3 was assigned to peak B. As for Ala Cα and C=O too. Thus, the remaining carbon ○

1 -○ 4 were also slightly different. carbons, the calculated chemical shifts of the carbons ○

However, the calculated maximum difference was 2.2 ppm for both Ala Cα and Ala C=O carbons, which were less than half of the corresponding chemical shift variation of Ala Cβ carbon at 4.7 ppm (Table 1). Therefore, it seemed that the chemical shift distribution could not be observed in the 13C CP/MAS NMR spectra of Ala Cα and Ala C=O peaks, contrary to the case of Ala Cβ peak.

As shown in Fig.3, there are two

correlations between Ala Cβ β−sheet Α peak and the top of the Ala Hβ peak, and also between Ala Cβ β−sheet C peak and the Ala Hβ C peak in the observed double CP 1

H-13C correlation spectrum. From the comparison with the observed and calculated 22 ACS Paragon Plus Environment

Page 22 of 43

Page 23 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

1 and ○ 2 in Fig.4(b) can be assigned to the Ala Hβ A chemical shifts, the protons ○

3 and ○ 4 are assigned to the lower field peak at 1.0 ppm, and other two protons ○

shoulder of the peak at 1.00 ppm, although the Ala Hα C peaks can be assigned to either one. The calculated peaks are slightly different among four carbons in both Ala Hα and Ala NH regions, but we have not tried to make further assignments. 6. Confirmation of the assignment of the peaks, A, B and C in Ala Cβ β region from the 13C DARR spectrum of S.c.ricini SF fiber Fig. 6(a) shows the expanded Ala Cβ region in the 13C-13C DARR spectrum of [U-13C] S. c. ricini SF fiber shown in Fig. 2. The correlation between the β-sheet peaks, A and C 1 and ○ 4 was clearly observed, indicating that the distance between the carbons ○

assigned to the peaks A and C were shorter than 5 Å under our NMR experimental condition

40,41,47

. In addition, the distance between the carbons assigned to the peaks A

and B was also shorter than 5 Å because the correlation between these two peaks was also observed. However, the correlation between the peaks B and C could not be observed. The inter-atomic distances between two Ala Cβ carbons in the new staggered model shown in Fig.4(b) were systematically calculated and listed in Table 3. The distances less 1 and ○ 1 * atoms, than 5 Å are marked in Table 3. The distances between the carbons ○

were less than 4 Å, but the correlation could not be observed because of the same 23 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1 and ○ 1 *. Similarly, the correlations between ○ 2 and chemical shifts of the carbons ○

2 * (and between ○ 3 and ○ 3 * and between ○ 4 and ○ 4 *) could not be observed. ○

1 (○ 1 *) Except for these, the distances less than 5 Å were observed between carbons ○

3 (○ 3 *) , and between the carbons ○ 2 (○ 2 *) and ○ 4 (○ 4 *). Judging from the and ○

calculated chemical shifts and peak intensities, we assigned the highest field peak, Ala 1 and ○ 2 . Thus, the observation of the correlation between Hβ β-sheet A, to carbons ○

Ala Hβ β-sheet A and Ala Hβ β-sheet C peaks indicates that the distance between 1 or ○ 2 and the carbon ○ 3 or ○ 4 is less than 5 Å. Similarly, the correlation carbons ○

observed between the Ala Hβ β-sheet A and Ala Hβ β-sheet C peaks indicates that the 1 or ○ 2 and the carbon ○ 3 or ○ 4 was less than 5 Å. Thus, distance between carbons ○

these observations support the assignment of the peaks, A, B and C, in the β-sheet region of the Ala Cβ peak.

24 ACS Paragon Plus Environment

Page 24 of 43

Page 25 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

Fig. 6 (a) Expanded Ala Cβ region in 13C-13C DARR spectrum of [U-13C] Samia cynthia ricini silk fibroin fiber. The correlations between the Ala Cβ and Ala Cα carbons, and between the Ala Cβ and Ala C=O carbons are also shown in the figure. The distances 1 and ○ 2 ) and carbon ○ 4 , and also those less than 5 Å between the group (carbons ○

1 and ○ 2 ) and carbon ○ 3 in the new staged model are between the group (carbons ○

shown as a histogram in Fig. 6(b) used for interpretation of the observed correlations in Fig.6(a).

Table 3 The calculated distances (Å) between two Ala Cβ atoms together with the calculated chemical shifts of the marked carbons in Fig.4(b) in the new staggered model. 25 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 26 of 43

chemical shift Ala Cβ

1 * ○

1 ○

2 * ○

2 ○

3 ○

3 * ○

4 ○

/ ppm 19.7

1 ○

19.8

1 * ○

3.94

20.7

2 * ○

5.98

7.17

20.7

2 ○

7.17

5.98

4.07

21.6

3 ○

4.22

4.45

5.15

5.62

21.6

3 * ○

4.44

4.23

5.62

5.15

3.68

24.4

4 ○

5.72

5.45

3.86

4.74

6.45

6.22

24.4

4 * ○

5.46

5.72

4.74

3.85

6.22

6.46

4.06

Discussion PLA sequences, (Ala)n (n= 12,13) with anti-parallel β-sheet structure are found in the crystalline fraction of S. c. ricini SF fiber.27 (Ala)n also occurs in the crystalline fraction of other wild silk worm silks

2,28-30

and the major ampullates of the silks from many

spiders7,31-34,42-46 although the length n is relatively shorter for the latter spider silks. Thus, the PLA sequences seem to be a key element in the structures of these silk fibers with high strength. In our previous paper,39 we determined the packing structures of a series of (Ala)n (n= 3,4,5,6,7,8,12) with anti-parallel β-sheet structure using 13C CP/MAS NMR and X-ray diffraction powder patterns. The (Ala)n peptides pack into two different arrangements, depending on the length of the sequence. Thus, short sequences (n=6 or 26 ACS Paragon Plus Environment

Page 27 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

less) pack into a rectangular arrangement, but longer sequences pack in a staggered arrangement. Both the line shapes of Ala Cβ carbons in the

13

C CP/MAS NMR spectra

and X-ray powder diffraction patterns show a similar pattern for the short, antiparallel oligopeptides (Ala)4, (Ala)5 and (Ala)6, indicating that these peptides have similar crystal structures to that of antiparallel (Ala)4 determined by us.39 using X-ray single crystal structural analysis (Supporting information Fig. S1). On the other hand, for (Ala)7, (Ala)8, (Ala)12 and S. c. ricini SF fiber with anti-parallel β-sheet structures, both the

13

C

CP/MAS NMR spectra and X-ray diffraction data are markedly different from those for short PLA sequences. Thus, if we can determine the latter packing arrangement, which is different from the former packing arrangement that is typical for (Ala)4, we can cover all of the packing structures of PLA sequences that appear in the crystalline domains of spider silks and wild silkworm silks. In this paper, the atomic co-ordinates of the packing structures of S. c. ricini SF fiber was reported. The line shapes of the Ala Cβ carbons were essentially the same between S. c. ricini SF and [3-13C] A19 34-mer peptide with anti-parallel β-sheet structure. This meant that the multiplet observed in the Ala Cβ peaks was due to short range packing effect along the PLA sequence of S. c. ricini SF. Indeed, this was also supported from the observation that the Ala Cβ spectral pattern was essentially the same among (Ala)n (n=7,8,12) and S. c. ricini silk fiber. In our previous paper,36 the structural transition from 27 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

α-helix (the structure before spinning) to β-sheet (the structure after spinning) of S. c. ricini SF film was monitored by stretching of the SF film using 13C CP/MAS NMR. The film was prepared from SF stored in the silk gland. The β-sheet peak was observed with increasing stretching ratio, starting at the x6 stretching ratio. At the initial stage of appearance of β-sheet, the relative intensity of the highest field peak at 19.7 ppm in Ala Cβ region was higher than in the spectrum of the SF fiber. Thus, the β-sheet structure observed at the initial stage of the stretching seems to be incomplete and different from the new staggered model. The structural model of PLA molecule with anti-parallel β-sheet structure reported by Arnott et al.35 using X-ray diffraction analysis was considered to be the initial structure for further structural analysis performed here. The energetically optimized model calculated by CASTEP program and GIPAW chemical shift calculation was effectively used to determine the new staggered model. The advantage of these approaches has been proved in the determination of the packing structures of (Ala)n (n=3,4) where the atomic co-ordinates were known 52,53 and also the packing structures of B. mori SF fibers before and after spinning.23,51 The packing structure as well as the conformation of silk fibers is considered to be the origin of high fiber strength and toughness1-6, and therefore it is very important to determine the packing structure. However, it is not easy to determine the

28 ACS Paragon Plus Environment

Page 28 of 43

Page 29 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

packing structure because SF is heterogenerous10,20,36,39 and it is difficult to obtain single crystals from SF for single crystal X-ray analysis35,39. There are other advantages of these approaches. The new staggered model proposed here can explain the observed 13C and 1H spectra of S. c. ricini SF fiber, especially, the Ala methyl region that is sensitive to the packing structure. The model was also supported by the calculated distances. Although it is difficult to point out the origin of the difference between the observed and calculated chemical shifts correctly, we think that the use of the initial structural model of poly(L-alanine) determined by Arnott et al.35 might be one of the main origins of the discrepancies. Thus, there are Gly-rich regions other than (Ala)12 sequences in real S.c. ricini silk fibroin and the presence of such Gly-rich regions in the chain may modify the structure of (Ala)12 sequences. We believe that the solid state NMR is at present the best analytical method to address this complicated problem and to determine the inter-molecular arrangement of the native silk fibers at atomic level. The 13C DARR spectral pattern of the Cβ region of Ala residue in the PLA sequences of the silk fiber was also compatible with the DARR pattern for distances less than 5 Å predicted from the model. Thus, at present we can propose the packing structures for all PLA sequences that appear in the crystalline domains of spider silks and wild silkworm silks. Our structural analysis gives also an important lesson in the NMR analysis of SF 29 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

fiber. In general, the observation of multiple peaks in the β-sheet region of Ala Cβ peak in the solid-state NMR spectrum of SF fiber may imply the presence of the corresponding number of different packing structure. However, our NMR study showed that multiple peaks were assigned to the carbons with different environments in the same packing structure. This was done by the combination of solid-state NMR and chemical shift calculation with CASTEP and GIPAW methods. Thus, peak assignments must be done very carefully, especially for the β-sheet structure where inter-molecular atomic distances are relatively close. Conclusions From the detailed studies conducted in this work, a new staggered model (Fig.7) could be proposed for the packing arrangement of the poly-L-alanine sequence of Samia cynthia ricini silk fibroin fiber using the Cambridge Serial Total Energy Package program and the chemical shift calculation with Gauge Including Projector Augmented Wave method. This model was supported by the 13C solid-state NMR results of the silk fiber.

30 ACS Paragon Plus Environment

Page 30 of 43

Page 31 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

Fig. 7 A new staggered model proposed in this paper. The details are described in the text.

Supporting Information. Fig.S1 Atomic co-ordinate of (Ala)4 with anti-parallel β-sheet structure determined by single crystal X-ray diffraction analysis.

Acknowledgements

T.A. acknowledges support by a Grant-in-Aid for Scientific Research from the Ministry of Education, Science, Culture and Supports of Japan (26248050) and Impulsing Paradigm Change through Disruptive Technologies Program (ImPACT). The calculations were supported by supercomputer system at ICR, Kyoto University. We also thank Dr H. N. Cheng (Southern Regional Research Center, USDA Agricultural Research Service, New Orleans, LA 70124, USA) for discussions.

References 31 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

(1) Fu, C.; Shao, Z.; Vollrath, F. Animal silks: their structures, properties and artificial production. Chem. Commun. 2009, 6515- 6529.

(2) Malay, A.D.; Sato, R.; Yazawa, K.; Watanabe, H.; Ifuku, N.; Masunaga, H.; Hikima, T.; Guan, J.; Mandal, B.B.; Damrongsakkul, S.; Numata, K. Relationships between physical properties and sequence in silkworm silks. Sci. Rep. 2016, 6, 27573.

(3) Vepari, C.; Kaplan, D.L. Silk as a Biomaterial. Prog. Polym. Sci. 2007, 32, 991-1007.

(4) Hakimi, O.; Knight, D.P., Vollrath, F.; Vadgama, P. Spider and mulberry silkworm silks as compatible biomaterials. Composites: Part B 2007, 38, 324-337.

(5) Vollath, F.; Porter, D. Silks as ancient models for modern polymers. Polymer 2009, 50, 5623-5632.

(6) Humenik, M., Smith, A.M., Scheibel, T. Recombinant Spider Silks-Biopolymers with Potential for Future Applications. Polymers 2011, 3, 640-661.

(7) Asakura, T.; Miller, T. Eds. Biotechnology of Silk; Springer, Dordrecht, 2014.

(8) Tokareva, O.; Jacobsen, M.; Buehler, M.; Wongb, J.; Kaplan, D.L. Structure–function–property–design interplay in biopolymers: Spider silk. Acta Biomaterialia, 2014, 10, 1612-1626. 32 ACS Paragon Plus Environment

Page 32 of 43

Page 33 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

(9) Koh, L.D.; Cheng, Y.; Teng, C.P.; Khin, Y.W.; Loh, X.J.; Tee, S.Y.; Low, M.; Ye, E.; Yu, H.D.; Zhang, Y.W.; Han, M.Y. Structures, mechanical properties and applications of silk fibroin materials. Prog. Polym. Sci. 2015, 46, 86-110.

(10) Asakura, T.; Okushita, K.; Williamson, M.P. Analysis of the Structure of Bombyx mori Silk Fibroin by NMR. Macromolecules 2015, 48, 2345-2357.

(11) Pereira, R.F.P.; Silva, M.M.; Bermudez, V.d.Z. Bombyx mori Silk Fibers: Am Outstanding Family of Materials. Macromol. Mater. Eng. 2015, 300, 1171-1198.

(12) Zhou, C.Z.; Confalonieri, F.; Medina, N.; Zivanovic, Y.; Esnault, C.; Yang, T.; Jacquet, M.; Janin, J.; Duguet, M.; Perasso, R.; Li, Z.G. Fine organization of Bombyx mori fibroin heavy chain gene. Nucleic Acids Res. 2000, 28, 2413-2419.

(13) Zhou C.Z.; Confalonieri, F.; Jacquet, M.; Perasso, R.; Li, Z.G.; Janin, J. Silk fibroin: structural implications of a remarkable amino acid sequence. Proteins: Struct., Funct., Genet. 2001, 44, 119-122.

(14) Marsh, R.E.; Corey, R.B.; Pauling, L. An investigation of the structure of silk fibroin. Biochim. Biophys. Acta. 1955, 16, 1-34.

(15) Takahashi, Y.; Gehoh, M.;Yuzuriha, K. Structure refinement and diffuse streak scattering of silk (Bombyx mori). Int. J. Biol. Macromol. 1999, 24, 127-138.

33 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 34 of 43

(16) Asakura, T.; Kuzuhara, A.; Tabeta, R.; Saito, H. Conformation Characterization of Bombyx mori Silk Fibroin in the Solid State by High-Frequency

13

C Cross

Polarization-Magic Angle Spinning NMR, X-ray Diffraction, and Infrared Spectroscopy. Macromolecules, 1985, 18, 1841-1845.

(17) Percot, A.; Colomban, P.; Paris, C.; Dinh, H.M.; Wojcieszak, M.; Mauchamp, B. Water dependent structural changes of silk from Bombyx mori gland to fibre as evidenced by Raman and IR spectroscopies. Vib. Spectrosc. 2014, 73, 79-89.

(18) Boulet-Audet, M.; Vollrath, F.; Holland, C. Identification and classification of silks using infrared spectroscopy. J. Exp. Biol. 2015, 218, 3138-3149.

(19) Lefevre, T.; Paquet-Mercier, F.; Lesage, S.; Rousseau, Marie-Eve; Bedard, S.; Pezolet, M. Study by Raman spectromicroscopy of the effect of tensile defomation on the molecular structure of Bombyx mori silk. Vib. Spectrosc. 2009, 51,136-141.

(20) Asakura, T.; Yao, J.; Yamane, T.; Umemura, K.; Ultrich, A.S. Heterogeneous structure of silk fibers from Bombyx mori resolved by 13C solid-state NMR spectroscopy. J. Am. Chem. Soc. 2002, 124, 8794-8795.

34 ACS Paragon Plus Environment

Page 35 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

(21) Asakura, T; Suzuki, Y.; Nakazawa, Y.; Yazawa, K.; Holland, G.P.; Yarger, J.L. Silk structure studied with nuclear magnetic resonance. Prog. Nucl. Magn. Reson. Spectrosc. 2013, 69, 23-68.

(22) Asakura, T.; Suzuki, Y.; Nakazawa, Y.; Holland, G.P.; Yager, J.L. Elucidating silk structure using solid-state NMR. Soft Matter 2013, 9, 11440-11450.

(23) Asakura, T.; Ohata, T.; Kametani, S.; Okushita, K.; Yazawa, K.; Nishiyama, Y.; Nishimura, K.; Aoki, A.; Suzuki, F.; Kaji, H.; Ulrich, A.S.; Williamson, M.P. Intermolecular Packing in B. mori Silk Fibroin: Multinuclear NMR Study of the Model Peptide (Ala-Gly)15 Defines a Heterogeneous Antiparallel Antipolar Mode of Assembly in the Silk II Form. Macromolecules 2015, 48, 28-36.

(24) Pal, S.; Kudu, J.; Talukdar, S.; Thomas, T.; Kundu, S.C. An Emerging Functional Natural Silk Biomaterial from the only Domesticated Non-mulberry Silkworm Samia ricini. Macromol. Biosci. 2013, 13, 1020-1035.

(25) Kundu, B.; Kurland, N.E.; Bano, S.; Patra, C.; Engel, F.B.; Yadavalli, V.K.; Kundu, S.C. Silk proteins for biomedical applications: Bioengineering perspectives. Prog. Polym. Sci. 2014, 39, 251-267.

35 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

(26) Silva, S.S.; Oliveira, N.M.; Oliveira, M.B.; Soares da Costa, D.P.; Naskar, D.; Mano, J.F.; Kundu, S.C.; Reis, R.L. Fabrication and characterization of Eri silk fibers-based sponges for biomedical application. Acta Biomaterialia 2016, 32, 178-189.

(27) Sezutsu, H.; Yukuhiro, K. The complete nucleotide sequence of the Eri-silkworm (Samia cynthia ricini) fibroin gene. J. Insect Biotechnol. Sericol. 2014, 83, 59-70.

(28) Sezutsu, H.; Yukuhiro, K. Dynamic Rearrangement Within the Antheraea pernyi Silk Fibroin Gene Is Associated with Four Types of Repetitive Units. J. Mol. Evol. 2000, 51, 329-338.

(29) Numata, K.; Sato, R.; Yazawa, K.; Hikima, T.; Masunaga, H. Crystal structure and physical properties of Antheraea yamamai silk fibers: Long poly(alanine) sequences are partially in the crystalline region. Polymer 2015, 77, 87-94.

(30) Datta, A.; Ghosh, A.K.; Kundu, S.C. Purification and characterization of fibroin from the tropical Saturniid silkworm, Antheraea mylitta. Insect Biochem. Mol. Biol. 2001, 31, 1013-1018.

(31) Gosline, J.M.; Guerette, P.A.; Ortlepp, C.S.; Savage, K.N. The mechanical design of spider silks: from fibroin sequence to mechanical function. J. Exp. Biol. 1999, 202, 3295-3303.

36 ACS Paragon Plus Environment

Page 36 of 43

Page 37 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

(32) Hayashi, C.Y.; Shipley, N.H.; Lewis, R.V. Hypotheses that correlate the sequence, structure, and mechanical properties of spider silk proteins. Int. J. Biol. Macromol. 1999, 24, 271-275.

(33) Bonev, B.; Grieve, S.; Herberstein, M.E.; Kishore, A.I.; Watts, A.; Separovic, F. Orientational order of Australian spider silks as determined by solid-state NMR. Biopolymers 2006, 82, 134-143.

(34) Holland, G.P.; Jenkins, J.E.; Creager, M.S.; Lewis, R.V.; Yarger, J.L. Solid-State NMR Investigation of Major and Minor Ampullate Spider Silk in the Native and Hydrated States. Biomacromolecules 2008, 9, 651-657.

(35) Arnott, S.; Dover, S.D.; Elliott, A. Structure of β-poly-L-alanine: refined atomic co-ordinates for an anti-parallel β-pleated sheet. J. Mol. Biol. 1967, 30, 201-208.

(36) Yang, M.; Yao, J.; Sonoyama, M.; Asakura, T. Spectroscopic characterization of heterogeneous structure of Samia cynthia ricini silk fibroin induced by stretching and molecular dynamics simulation. Macromolecules 2004, 37, 3497-3504.

(37) Asakura, T.; Ito, T.; Okudaira, M.; Kameda, T. Structure of Alanine and Glycine Residues of Samia cynthia ricini Silk Fibers Studied with Solid-State 15N and 13C NMR. Macromolecules 1999, 32, 4940-4946.

37 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 38 of 43

(38) van Beek, J.D.; Beaulieu, L.; Schäfer, H.; Demura, M.; Asakura, T.; Meier, B.H. Solid-state NMR determination of the secondary structure of Samia cynthia ricini silk. Nature 2000, 405, 1077-1079.

(39) Asakura, T.; Okonogi, M.; Horiguchi, K.; Aoki, A.; Saito, H.; Knight, D.P.; Williamson, M.P. Two Different Packing Arrangements of Antiparallel Polyalanine. Angew. Chem., Int. Ed. 2012, 51, 1212-1215.

(40) Takegoshi, K.; Nakamura, S.; Terao, T.

13

C–1H dipolar-assisted rotational

resonance in magic-angle-spinning NMR. Chem. Phys. Lett. 2001, 344, 631–637.

(41) Takegoshi, K.; Nakamura, S.; Terao, T. without

13

13

C–1H dipolar-driven13C–13recoupling

C RF irradiation in nuclear magnetic resonance of rotating solids. J. Chem.

Phys. 2003, 118, 2325–2341.

(42) Bayro, M. J.; Hube, M.; Ramachandran, R.; Davenport, T. C.; Meier, B. H.; Ernst, M.; Griffin, R.G. Dipolar truncation in magic-angle spinning NMR recoupling experiments, J. Chem. Phys. 2009, 130, 114506.

(43) Holland, G.P.; Creager, M.S.; Jenkins, J.E.; Lewis, R.V.; Yager, J.L. Determining Secondary Structure in Spider Dragline Silk by Carbon−Carbon Correlation Solid-State NMR Spectroscopy. J. Am. Chem. Soc. 2008, 130, 9871-9877.

38 ACS Paragon Plus Environment

Page 39 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

(44). Jenkins, J.E.; Creager, M.S.; Lewis, R.V.; Holland, G.P.; Yarger, J.L. Quantitative correlation between the protein primary sequences and secondary structures in spider dragline silks. Biomacromolecules 2010, 11, 192-200.

(45). Izdebski, T.; Akhenblit, P.; Jenkins, J. E.; Yarger, J. L.; Holland, G. P. Structure and Dynamics of Aromatic Residues in Spider Silk: 2D Carbon Correlation NMR of Dragline Fibers. Biomacromolecules, 2010, 11, 168-174.

(46) Jenkins, J.E.; Holland, G.P.; Yarger, J.L. Characterizing the Secondary Protein Structure of Black Widow Dragline Silk Using Solid -State NMR and X-ray Diffraction. Biomacromolecules 2013, 14, 3472-3483.

(47) Okushita, K.; Asano, A.; Williamson, M.P.; Asakura, T. Local Structure and Dynamics of Serine in the Heterogeneous Structure of the Crystalline Domain of Bombyx mori Silk Fibroin in Silk II Form Studied by 2D 13C–13C Homonuclear Correlation NMR and Relaxation Time Observation. Macromolecules 2014, 47, 4308-4316.

(48) Agarwal,

V.; Faelber,

K.; Schmieder, P.; Reif, B.

High-Resolution

Double-Quantum Deuterium Magic Angle Spinning Solid-State NMR Spectroscopy of Perdeuterated Proteins. J. Am. Chem. Soc. 2009, 131, 2–3.

39 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 40 of 43

(49) Segall, M.D.; Lindan, P.J.D.; Probert, M.J.; Pickard, C.J.; Hasnip, P.J.; Clark, S.J.; Payne, M.C. First-principles simulation: ideas, illustrations and the CASTEP code. J. Phys.: Condens. Matter 2002, 14, 2717–2744.

(50) Pickard, C.J.; Mauri, F. All-electron magnetic response with pseudopotentials: NMR chemical shifts. Phys.Rev.B 2001, 63, 245101.

(51) Asakura, T.; Suzuki, Y.; Yazawa, K.; Aoki, A.; Nishiyama, Y.; Nishimura, K.; Suzuki, F.; Kaji, H. Determination of Accurate 1H Positions of (Ala-Gly)

n

as a

Sequential Peptide Model of Bombyx mori Silk Fibroin before Spinning (Silk I). Macromolecules 2013, 46, 8046-8050.

(52) Yazawa, K.; Suzuki, F.; Nishiyama, Y.; Ohata, T.; Aoki, A.; Nishimura, K.; Kaji, H.; Shimizu, T.; Asakura, T. Determination of accurate 1H positions of an alanine tripeptide with anti-parallel and parallel β-sheet structures by high resolution 1H solid state NMR and GIPAW chemical shift calculation. Chem. Comm. 2012, 48, 11199-11201.

(53) Asakura, T.; Yazawa, K.; Horiguchi, K.; Suzuki, F.; Nishiyama, Y.; Nishimura, K.; Kaji, H. Difference in the structures of alanine tri- and tetra-peptides with antiparallel β-sheet assessed by X-ray diffraction, solid-state NMR and chemical shift calculations by GIPAW. Biopolymers 2014, 101, 13-20. 40 ACS Paragon Plus Environment

Page 41 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

(54)Asakura,T.; Kashiba, H.; Yoshimizu, H. NMR of silk fibroin. 8. Carbon-13 NMR analysis of the conformation and the conformational transition of Philosamia cynthia ricini silk fibroin protein on the basis of Bixon-Scheraga-Lifson theory. Macromolecules 1988, 21, 644-648.

(55) Nakazawa, Y.; Asakura, T. Structure Determination of a Peptide Model of the Repeated Helical Domain in Samia cynthia ricini Silk Fibroin before Spinning by a Combination of Advanced Solid-State NMR Methods. J. Am. Chem. Soc., 2003, 125, 7230-7237.

(56) Parrilla, F.A.; Wehrle, B.; Bräunling, H.; Limbach,H.H. Temperature gradients and sample heating in variable temperature high speed MAS NMR spectroscopy. J. Magn. Res., 1969, 87, 592-597.

(57) Asakura, T.; Okonogi, M.; Nakazawa, Y.; Yamauchi, K. Structural Analysis of Alanine Tripeptide with Antiparallel and Parallel β-Sheet Structures in Relation to the Analysis of Mixed β-Sheet Structures in Samia cynthia ricini Silk Protein Fiber Using Solid-State NMR Spectroscopy. J. Am. Chem. Soc. 2006, 128, 6231-6238.

(58) Asakura, T.; Miyazawa, K.; Tasei, Y.; Kametani, S.; Nakazawa, Y.; Aoki, A.; Naito, A. Packing Arragement of

13

C Selectively Labeled Sequence Model Peptides of

41 ACS Paragon Plus Environment

Biomacromolecules

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 42 of 43

Samia cynthia ricini Silk Fibroin Fiber studied by Solid state NMR. Phys. Chem. Chem. Phys., 2017, in press.

(59) Yamauchi,K.; Imada, T.; Asakura, T. Use of Microcoil Probehead for Determination of the Structure of Oriented Silk Fibers by Solid-State NMR. J. Phys. Chem. B 2005, 109, 17689-17692.

(60) Yamauchi, K.; Asakura, T. Development of MicroMAS NMR Probehead for Mass-limited Solid-state Samples. Chem. Lett. 2006, 35, 426-427.

(61) Yamauchi, K.; Yamasaki, S.; Takahashi, R.; Asakura, T. Microscopic structural analysis of fractured silk fibers from Bombyx mori and Samia cynthia ricini using

13

C

CP/MAS NMR with a 1mm microcoil MAS NMR probehead. Solid State NMR 2010, 38 27-30.

(62) Arnott, S.; Wonacott, A.J.; Atomic co-ordinates for an α-helix: Refinement of the crystal structure of α-poly-L-alanine. J. Mol.Biol. 1966, 21, 371-383.

(63) Nakazawa,Y.; Asakura, T. High-Resolution

13

C CP/MAS NMR Study on

Structure and Structural Transition of Antheraea pernyi Silk Fibroin Containing Poly(L-alanine) and Gly-Rich Regions. Macromolecules 2002, 35, 2393-2400.

42 ACS Paragon Plus Environment

Page 43 of 43

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biomacromolecules

For Table of Contents Use Only

Table of Contents Graphic

A Refined Crystal Structure of Samia cynthia ricini Silk Fibroin Revealed by Solid-State NMR Investigations Tetsuo Asakura,* †Akio Nishimura, † Shunsuke Kametani, † Shuto Kawanishi †Akihiro Aoki, †Furitsu Suzuki,§ Hironori Kaji§ and Akira Naito †



Department of Biotechnology, Tokyo University of Agriculture and Technology, Koganei, Tokyo 184-8588 JAPAN

§

Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, JAPAN

*Correspondence to: Tetsuo Asakura

43 ACS Paragon Plus Environment