A pseudo-targeted MS method for the sensitive analysis of protein

A pseudo-targeted MS method for the sensitive analysis of. 1 protein phosphorylation in protein complexes. 2. 3. Jiawen Lyu1,2, Yan Wang1,2, Jiawei Ma...
0 downloads 4 Views 743KB Size
Subscriber access provided by UNIV OF DURHAM

A pseudo-targeted MS method for the sensitive analysis of protein phosphorylation in protein complexes Jiawen Lyu, Yan Wang, Jiawei Mao, Yating Yao, Shujuan Wang, Yong Zheng, and Mingliang Ye Anal. Chem., Just Accepted Manuscript • DOI: 10.1021/acs.analchem.8b00749 • Publication Date (Web): 16 Apr 2018 Downloaded from http://pubs.acs.org on April 16, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

A pseudo-targeted MS method for the sensitive analysis of

2

protein phosphorylation in protein complexes

3



4

Jiawen Lyu1,2, Yan Wang1,2, Jiawei Mao1,2, Yating Yao1,2, Shujuan Wang3, Yong Zheng3, Mingliang

5

Ye1,2*

6 7

1

8

R&A Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences (CAS), Dalian

9

116023, China;

CAS Key Laboratory of Separation Sciences for Analytical Chemistry, National Chromatographic

10

2

University of Chinese Academy of Sciences, Beijing 100049, China

11

3

State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for

12

Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China

13

*

14

411-84379620. E-mail: [email protected].

15



16



17



To whom correspondence should be addressed: (M.L. Ye) Phone: +86-411-84379610. Fax: +86-

18

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1



2

Abstract

3

In this study, we presented an enrichment-free approach for the sensitive analysis

4

of protein phosphorylation in minute amounts of samples, such as purified protein

5

complexes. This method takes advantage of the high sensitivity of parallel reaction

6

monitoring (PRM). Specifically, low confident phosphopeptides identified from the

7

data dependent acquisition (DDA) dataset were used to build a pseudo-targeted list for

8

PRM analysis to allow the identification of additional phosphopeptides with high

9

confidence. The development of this targeted approach is very easy as the same

10

sample and the same LC-system were used for the discovery and the targeted analysis

11

phases. No sample fractionation or enrichment was required for the discovery phase

12

which allowed this method to analyze minute amount of sample. We applied this

13

pseudo-targeted MS method to quantitatively examine phosphopeptides in affinity

14

purified endogenous Shc1 protein complexes at four temporal stages of EGF signaling

15

and identified 82 phospho-sites. To our knowledge, this is the highest number of

16

phospho-sites identified from the protein complexes. This pseudo-targeted MS

17

method is highly sensitive in the identification of low abundance phosphopeptides,

18

and could be a powerful tool to study phosphorylation-regulated assembly of protein

19

complex.

20

ACS Paragon Plus Environment

Page 2 of 30

Page 3 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1



2

1. Introduction

3

Proteins barely perform biological functions on their own but interact with each

4

other to execute an obligation together1. Protein complex is a group of interacted

5

proteins that work together to execute a specific biologic function2. Protein

6

phosphorylation plays a crucial role in regulating the assembling of protein complex

7

3,4

8

complex. Direct analysis of protein digest by shotgun proteomics for protein

9

phosphorylation is very poor in sensitivity because the phosphopeptides are of low

10

abundance in the sample and their ionization is seriously suppressed by the coexisted

11

non-phosphopeptides5. To

12

phosphopeptides, specific enrichment of phosphopeptides was often performed prior

13

to LC-MS/MS analysis6,7, which could identify and quantify over 10,000 of

14

phosphopeptides in a typical phosphoproteome analysis8,9. Though the enrichment

15

method performs very well for phosphoproteomics analysis, it might be not fitted to

16

analyze trace amount of sample, especially in the protein complex sample which is

17

often as low as a few microgram level. This is because huge sample loss may occur

18

when enrichment was performed for trace amount of sample. Alternatively, the sample

19

complexity can also be reduced by fractionation of the peptide sample prior to LC-

20

MS/MS analysis, which allowed identification of low abundant peptides10. To our

21

knowledge, this multidimensional separation scheme was never applied to analyze

22

protein complex probably because it also require large amount of sample.

. It is important to analyze the dynamical change of phosphorylation in the protein

reduce

the

interference

ACS Paragon Plus Environment

of

high

abundant

non-

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

Shotgun proteomics with mass spectrometer operated in data dependent

2

acquisition (DDA) mode is a powerful tool for the discovery of new proteins but faces

3

serious issues with reproducibility and sensitivity11. More importantly, this data

4

acquisition method is difficult to identify low abundant peptides as they have less

5

chance to be delivered to MS2 for fragmentation12. Targeted proteomics using either

6

parallel reaction monitoring (PRM) or multiple reaction monitoring (MRM) has

7

gained popular recent years due to its high sensitivity, high reproducibility and high

8

accuracy in quantification

9

the analysis of protein phosphorylation in protein complex. A classical targeted MS

10

workflow includes two phases. First, in a pilot experiment for discovery, a large

11

quantity of sample is subjected to either 2D-LC-MS/MS analysis or phosphopeptide

12

enrichment to maximize the identifications. Second, in a targeted analysis phase, the

13

identifications of interest can be specifically monitored by 1D-LC-MS with MRM or

14

PRM analysis to quantify across different samples. The sample amount for the pilot

15

experiment was always over 100 ug for phosphorylation analysis15-17. This amount is

16

clearly too much for the development of targeted approach for the analysis of protein

17

complex. In this study, we aim to develop a sensitive method for the identification and

18

quantification of p-sites in the protein complex.

13,14

. It is of interest to develop targeted MS approach for

19

The recent developed quadrupole-Orbitrap offers specific trapping capacities to

20

enhance the analysis of low abundance peptides18,19. Compared with the MRM

21

method in triple quadrupole MS, PRM in quadrupole-Orbitrap has higher resolution

22

and selectivity20. In addition to quantitative proteomics, PRM can also be applied in

ACS Paragon Plus Environment

Page 4 of 30

Page 5 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

targeted peptide identification as it has all fragment ion information of the pre-

2

selected precursor instead of giving only 3-5 pre-selected transitions in MRM21.

3

Taking the advantage of PRM’s high sensitivity and the feature of available full

4

fragment ions in tandem spectra for peptide identification, we proposed a pseudo-

5

targeted MS method for the sensitive analysis of protein phosphorylation in protein

6

complex. The development of this targeted approach is very easy as the same sample

7

and the same LC-system with different acquisition modes, i.e. DDA and PRM were

8

used for the two phases. No sample fractionation or enrichment was required for the

9

discovery phase which allowed this method to analyze minute amount of sample. This

10

method was applied to analyze the phosphorylation in endogenous Shc1 complexes.

11

With three runs of DDA in discovery phase and three runs of PRM in targeted

12

analysis, totally about 3 µg was sufficient for the whole workflow and the

13

identification of phospho-sites increased by around 50%. Take its advantage of

14

accurate quantification, the PRM based method enabled comprehensive mapping the

15

dynamic change of the Shc-1 complexes upon EGF stimulation.

16



17

2. Method

18

Chemicals.

19

DMEM medium was from Gibco (Thermo Fisher Scientific). Anti-Flag M2 agarose

20

beads were from Sigma. The Flag-Shc1 HeLa cell line was a gift from Beijing

21

Proteome Research Center (BPRC). The wild type Hela cell was purchased from

22

National Infrastructure of Cell Line Resource and was used as negative control. The

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

following antibodies were used in western blotting: anti-EGFR pY1068, anti-ERBB2,

2

anti-ERBB3 from Cell Signal Technology, anti-SHC and anti-GRB2 from Abcam,

3

anti-PIK3C2B from Proteintech and anti-Flag from Sigma Aldrich. All the other

4

reagents were purchased from Sigma Aldrich.

5

Cell culture and western blotting verification.

6

The cell line was seeded at 15cm dish (Corning) and cultured at 37℃ in 5% CO2

7

with DMEM plus 10% FBS. To generate phosphopeptide samples for method

8

comparisons, cells were cultured in serum-free medium for 12 h followed with the

9

stimulation with EGF (100 ng/ml) for 0, 2, 5, 20 min. Western blotting verification

10

was performed with anti-Flag and anti-EGFR pY-1068 antibodies.

11

Immunoprecipitation and digestion of Shc-1 complexes.

12

The immunoprecipitation of Shc-1 complexes were performed as previously

13

described22. Briefly, the cells stimulated with EGF were immediately washed three

14

times with pre-cold PBS to quench the cell signal transduction. Then each dish of

15

cells was lysed on ice by NP40 lysis buffer (50 mM HEPES-NaOH, pH 8.0, 150 mM

16

NaCl, 0.5% NP40, 2.5 mM MgCl2, 1 mM DTT, 2% protease inhibitor cocktail, 10%

17

glycerol, 1 mM NaF, 1 mM β-glycerolphosphate, 10 mM Na4P2O7, 1 mM NaVO4).

18

Then the total cell lysate was centrifuged at 20800 g and 4℃ for 20 min and the

19

supernatant was transferred. The protein concentration of the lysate was determined

20

by Bio-Rad protein assay. Flag-Shc1 was pulled down by adding 6 µL (bed volume)

21

anti-Flag M2 agarose beads for 4h or overnight at 4℃. The beads were spun down at

22

5000 g and then washed 3 times with lysis buffer and 2 times with 20 mM ammonium

23

bicarbonate (ABC). The cleaned beads were suspended in 20 µL 20 mM ABC. To

ACS Paragon Plus Environment

Page 6 of 30

Page 7 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

perform an on-bead digestion, 400 ng trypsin (from Sigma) was added to the

2

suspension before incubating the suspension at 37℃ overnight with shaking at 800

3

rpm. The digestion was stopped by adding 3% formic acid to the reaction and the

4

supernatant was transferred into a clean tube and dried down.

5

Enrichment of phosphorylated peptides by Ti-IMAC.

6

The sample pretreatment process was described in previous publications6. Briefly,

7

Ti-IMAC was added to the digest according to a ratio of 1:20 (peptides: beads). After

8

30min incubation, the beads were wash by buffer 1 (50% ACN, 6% TFA, 200 mM

9

NaCl) to remove the non-specific peptides, followed with another wash step with

10

buffer 2 (30% ACN, 0.1% TFA) to desalt. Finally, the phosphopeptides were eluted

11

from beads by adding 10% ammonium hydroxide and dried down.

12

Liquid Chromatography.

13

The dried samples were redissolved in 0.1% FA (1% FA only for Ti-IMAC

14

enrichment sample) and loaded onto a 209-µm inner diameter 3-cm trap column

15

(packed in-house with 5 µm, 120 Å, C-18 resins, from Sunchrom) using a flowrate of

16

5 µL/min of mobile phase A and washed for 10 min. H2O containing 0.1% formic acid

17

was used as mobile phase A, while acetonitrile containing 0.1% formic acid was used

18

as mobile phase B. Then the peptides were eluted from the trap column and the

19

reversed-phase separation was accomplished using a 150 -µm inner diameter 20-cm

20

analytical column with a pulled tip (packed in-house with ReproSil-Pur C18-AQ 1.9

21

µm resin). The 130 min separation gradient was set as follow: the flowrate is 600

22

nL/min, 2% mobile phase B from 0 to 10 min, 7% B at 11 min, 27% B at 75 min, 45%

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

B at 105 min, 90% B at 108 min, 90% B at 118 min, 2% B at 120 min, 2% B at 130

2

min.

3

Mass spectrometry.

4

All data were acquired by Q-Exactive HF hybrid quadrupole-Orbitrap mass

5

spectrometer (Thermo, San Jose, CA) except the comparison with iDDA where Q-

6

Exactive hybrid quadrupole-Orbitrap mass spectrometer was used (Refer to

7

supporting information). The DDA runs were conducted by a TopN method in which a

8

high resolution (resolution of 60000 at 200 m/z) full MS acquisition was followed by

9

20 fast dd-MS2 acquisitions (resolution of 15000). The FTMS acquisition of full MS

10

was set as followed: Automatic Gain Control (AGC) target, 3e6; Maximum injection

11

time (Max IT), 20 ms; scan range, from 350 to 2000 m/z. The parameter of dd-MS2

12

acquisition was set as following: AGC target, 5e5; Max IT, 50 ms; isolation window,

13

1.6 m/z; normalized collision energy, 27%; Centriod mode. The ions that carrying

14

charge lower than +1 and higher than +8 were excluded, and dynamic exclusion was

15

set as 20 s.

16

The PRM runs were conducted under a Full MS and PRM tandem method with

17

inclusion mode on. The isolation list for the identification of phosphopeptides was

18

generated by sorting the potential phosphopeptides that identified by previous DDA,

19

the process of which is described in the next section. The MS was run on positive

20

mode. For full MS scanning, the resolution was set at 60000, AGC target 3e6, Max.IT

21

20 ms and a profile mode was used for spectrum data. For the setting of MS2, which

22

refers to PRM in this method, resolution was set at 30000, AGC target 5e5, Max.IT

ACS Paragon Plus Environment

Page 8 of 30

Page 9 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

503ms, isolation window 1.6 m/z, NCE 27, and the profile mode was also used for

2

spectrum data of MS2. For details of PRM quantification, please refer to supporting

3

information.

4

Building of pseudo-targeted library.

5

The human proteome was obtained from UniProt (http://www.uniprot.org/) and

6

included 20157 sequences of proteins . All of the *.raw files that generated from DDA

7

method were converted to *.MGF files by Thermo Proteome Discoverer (version 1.4).

8

Mascot was used to search the *.MGF files against the database of human with

9

parameter set as follow: precursor-ion mass tolerance, 10 ppm; fragment-ion mass

10

tolerance, 0.05 Da; protease, trypsin with 2 missed cleavages. Variable modifications

11

were set: oxidation on methionine (M, +15.9949 Da), phosphorylation on

12

serine/threonine/tyrosine (S/T/Y, +79.9663 Da). After database searching, the proteins

13

were filtered with < 1% FDR. The proteins identified in each sample were used to

14

generate a new focused database for re-analysis of the *.MGF files to identify

15

potential phosphopeptides. For each matched phosphopeptides, the top-scored

16

spectrum was extracted from the original searching DAT file via an in-house Java

17

language package. Thus, the retention time, m/z, peptide sequence, peptide score and

18

protein accessory of the phosphopeptides were available for the subsequent formation

19

of PRM inclusion list. The phosphopeptides that didn’t belong to the protein

20

complexes were removed and phosphopeptides that scored higher than 30 were also

21

discarded as they were already confidently identified. The phosphopeptide

22

identifications that conformed to the above criteria would be considered as targets for

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

subsequent PRM acquisition. The isolation windows were set to ±3min around the

2

extracted retention time of the top-scored spectrum.

3

Data analysis.

4

All phosphorylation sites, which will be abbreviated as phospho-site or p-site,

5

were identified by MaxQuant23 as it has a module to evaluate the p-site localization

6

confidence. Mascot does not has this function, but its data is easy for us to extract the

7

information on the retention time, m/z, peptide sequence that need for the generation

8

of pseudo-targeted library . The reviewed human proteome mentioned above was also

9

used as a database in the identification of phospho-site. Trypsin was set as the specific

10

enzyme and max miss cleavage was set at 2. Besides phosphorylation on S/T/Y,

11

oxidation on methionine and acetylation on protein N-term were also set as variable

12

modifications. For other items, the default settings were retained. The exported p-sites

13

that localization probability less than 0.75 and score less than 40 were discarded.

14

Quantitative data were processed with Skyline24 and MS2 ion chromatograms of

15

interested peptides were used as surrogates for the quantification of proteins or

16

phospho-sites. For details of label free quantification (LFQ) and PRM quantification,

17

please refer to supporting information. The raw data were uploaded onto JPOST

18

Repository. The accession numbers are PXD008923 for ProteomeXchange and

19

JPST000385 for jPOST.

20

3. Results and discussion

21

Protein phosphorylation plays an important role in the dynamical assembling of

22

protein complex. It is well known that the EGF receptor tyrosine kinase (EGFR) is

ACS Paragon Plus Environment

Page 10 of 30

Page 11 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

autophosphorylated upon EGF stimulation, which provide the binding sites for

2

scaffold protein Shc122.

3

phosphorylated at multiple sites to provide docking sites for cytoplasmic targets.

4

Therefore, Shc1 will recruit many proteins to form protein complexes upon EGF

5

stimulation. Using EGF-dependent Shc1 complex as a test example, we first applied

6

the conventional method to analyze its protein components and their phosphorylation

7

after 2 min EGF stimulation. A special HeLa cell line expressing Flag tagged Shc1

8

was used for this study. The presence of Flag tagged Shc1 and the activation of EGFR

9

were verified by western blotting (Figure S1). When the Shc1 was pulled down by

10

immunoprecipitation using anti-Flag antibody, its bona fide interaction proteins as

11

well as a large number of nonspecific bound proteins were also pulled down25. To

12

distinguish these non-specific bound proteins, the Hela cells also expressing Flag tag,

13

e.g. Flag-GFP, would be an ideal control. However, such a cell line is not available in

14

our lab. Instead, the wild type Hela cells were used as the negative control in this

15

study. After stimulated with EGF for 2min, the cells from these two cell lines were

16

lysed and the Shc1 complex were immunoprecipitated under the same conditions.

17

The obtained proteins were then digested and analyzed by LC-MS/MS in DDA mode

18

as in the standard shotgun proteomics workflow. The identified proteins from these

19

two samples were compared by label-free quantification using MaxLFQ26 and the

20

proteins quantified with significant difference (p < 0.05) was obtained by t test

21

(permutation-based FDR control, 1%) using Perseus27. Only the proteins that

22

exhibited significant high expression in the Flag-Shc1 HeLa cell line were considered

Once Shc1 is associated with EGFR, it will be

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 12 of 30

1

as the potential components of the protein complexes (Figure S2). This resulted in the

2

identification of 29 potential protein components. Since Shc1 recruits proteins to form

3

protein complexes upon EGF stimulation, the bona fide complex components will

4

increase in abundance in the pulled sample with EGF stimulation compared with the

5

unstimulated one. Because of its high sensitivity, PRM was applied to monitor the

6

dynamical change of the potential components in the Shc1 complex between the cells

7

treated with EGF for 2min or not. In addition to the 29 candidate proteins identified

8

above, 7 additional proteins, PPP1R12A, PEAK1, AP2A2, PIK3CB, ERRFI1, ASAP3,

9

PPP1CB, that were reported to be related to EGF-dependent Shc1 complexes22 were

10

also added into the potential component list for PRM analysis (see supplementary

11

results). After analysis, we compared the abundances of above 36 proteins in the IP

12

samples derived from Flag-Shc1 HeLa cells stimulated with 2min or not. It was

13

found that 33 proteins were up-regulated over 50% (Figure S3a). These proteins

14

together with Shc1, totally 34, were confirmed as the components of the EGF-

15

dependent Shc1 complex. Among these 34 components, 20 were reported

16

increases in abundance for five proteins, i.e. PIK3C2B, ERBB3, ERBB2, SHC, and

17

GRB2, after EGF stimulation were verified by Western Blotting (Figure S4). After the

18

component proteins were determined, we then identified the phosphorylation sites on

19

these proteins using DDA data acquired from the IP sample with 2 min EGF

20

stimulation. We searched the DDA data against human proteome database using

21

MaxQuant by setting variable phosphorylation modification on residues of

22

Ser/Thr/Tyr. A total of 2166 unique peptides with score > 40 were identified from 592

ACS Paragon Plus Environment

22

. The

Page 13 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

proteins, including 94 unique phosphopeptides. It was found 52 unique

2

phosphopeptides were derived from the 34 component proteins, which resulted in the

3

identification of 42 p-sites with the localization probability higher than 0.75.

4

Using conventional approach, we identified 42 p-sites from the Shc1 complex

5

upon EGF stimulation for 2min. We then proposed a pseudo-targeted MS method as

6

shown in Figure 1 to identify more p-sites from the protein complex. It is well known

7

that targeted analysis using PRM or MRM is more sensitive than conventional

8

shotgun proteomics using DDA .Unlike MRM, PRM can provide full tandem spectra

9

for peptide identification. PRM was usually used in targeted proteomics, but not in

10

discovery proteomics, as the peptides to be analyzed must be set before LC-MS/MS

11

analysis. Then how can we take the advantage of its high sensitivity to identify

12

additional phosphopeptides with low abundance? The key issue is to build a library of

13

potential phosphopeptides to be monitored. We reason that the potential list could be

14

obtained from the DDA data. DDA is known to bias to identify high abundant

15

peptides28. It has poor sensitivity to identify low abundant phosphopeptides especially

16

when the unphosphorylated peptides and phosphorylated peptides were loaded

17

together for LC-MS/MS analysis. However, we believe that some low abundant

18

phosphopeptides are also fragmented in DDA mode but failed to be identified using

19

the strict filtering criteria. To discover these potential low abundant phosphopeptides,

20

we used a focused database containing only the proteins identified in the sample for

21

database searching of the DDA data. For the case of Shc1 complex sample, we firstly

22

identified the proteins presented in the 2min IP sample through searching the DDA

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

data against human proteome database by MASCOT. These identified proteins were

2

then used to construct a focused database, which was used to search the same DDA

3

data to identify the potential phosphopeptides. No filtering criteria were applied for

4

this step so that we can maximize the number of potential phosphopeptide

5

identifications. From this step, we identified 1757 potential phosphopeptides. Among

6

these, 222 phosphopeptides were derived from the component proteins of Shc1

7

complex. The high scored phosphopeptide identifications were removed as they were

8

already confidently identified. After this step, 152 phosphopeptide identifications

9

scored less than 30 were left. These identifications were of low confidence, but some

10

true positive phosphopeptide identifications should present in this list as the peptide

11

mass and a few fragment ions were matched. Their masses, charges and retention

12

times were used to build an inclusion list for PRM analysis. Considering the protein

13

complex sample is not as complex as the proteome sample, relative wide retention

14

time isolation window of 6 min was set to trap the potential phosphopeptides as much

15

as possible. After the targeted analysis of these peptides by PRM, the resulting tandem

16

spectra were searched against the human database. In this step, we used MaxQuant for

17

database searching to identify phosphopeptides and localize p-sites. It should be noted

18

two search engines, Mascot and MaxQuant, were used in this study. Mascot was used

19

to generate the potential phosphopeptide library for PRM analysis because the

20

retention time was more easy to be extracted by our in house written Java script while

21

MaxQuant was used to identify phosphopeptides and phospho-sites in the final results

22

because it has a module to evaluate the confidence of identified p-sites. After filtered

ACS Paragon Plus Environment

Page 14 of 30

Page 15 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

with the same strict criteria as in DDA data, 167 unique non-phosphopeptides and 43

2

unique phosphopeptides were confidently identified from the PRM dataset. Because

3

the presences of false matched phosphopeptides in the low-scored phosphopeptide

4

target list, the non-phosphopeptides or phosphopeptides were also identified from

5

proteins other than those presented in the protein complex. These identifications were

6

removed and only the 37 unique phosphopeptides derived from Shc1 protein complex

7

were kept. These phosphopeptides leaded to the mapping of 34 p-sites on 13

8

component proteins. Among the 34 p-sites, 19 were the same with the 42 p-sites

9

identified by DDA. Therefore, 15 high confident p-sites were newly identified by this

10

method. Thus, the total number of p-sites identified from 2 min stimulated sample was

11

increased to 57. We named this method as the pseudo-targeted MS method because

12

we do not know if the targets we set are true positive target peptides or not. Among

13

152 potential phosphopeptides from the Shc1 complex that were targeted during PRM

14

analysis, only 37 unique phosphopeptides were confidently identified from the protein

15

complex after searching PRM spectra. Clearly most of them are not true positive

16

targets. However, it should be noted that the scores for 49 peptides were improved

17

(above the identical line) after PRM analysis (Figure 2a). One example is given in

18

Figure 2b. A very poor MS/MS spectrum was generated in DDA, while a fragment

19

rich spectrum was generated in PRM mode for the same phosphopeptide. The DDA

20

spectrum cannot yield a confident phosphopeptide identification (Mascot score of 6)

21

while the PRM spectrum yielded a high confident phosphopeptide identification with

22

score of 34. Clearly the improved spectra quality in PRM enabled confident

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

identification of extra p-sites in Shc1 protein complex.

2

In above experiments, three replicate runs were performed and searched together

3

for the analysis of the 2min stimulated IP sample in both the DDA and PRM modes.

4

Due to the high complexity of the proteome sample and the random sampling in DDA

5

mode, running LC-MS/MS for multiple times is an effective way to improve the

6

analysis coverage in shot-gun proteomics29. We investigated if this is the case for the

7

analysis of p-sites in protein complex. For easy comparison, the MS data for the three

8

replicate runs were searched separately. As shown in Figure 3a, the three replicate

9

DDA runs identified 35, 33 and 36 p-sites on the Shc1 complex, respectively.

10

Combining the results of these three runs leaded to the identification of 43 p-sites

11

(Figure 3a) . The number did not increase significantly as most of these identifications

12

were the same for different runs (Figure 3b). This is probably because the IP sample is

13

much simpler than the common complex proteome sample and so that the mass

14

spectrometer can reproducibly fragment these phosphopeptides. For the PRM analysis,

15

the three replicate runs leaded to the identification of 30, 32 and 32 p-sites from the

16

Shc1 protein complex (Figure 3c). Among them, 12, 12 and 14 p-sites were newly

17

identified from the Shc1 protein complex compared with the p-sites already identified

18

by DDA data. Combining these three PRM runs, the newly identified p-sites on Shc1

19

complex increased to 17. The increase for the three replicate PRM runs was even

20

lower than the DDA data as the reproducibility for the targeted analysis was better

21

(Figure 3d). Clearly, replicate runs, either in DDA or PRM, does not contribute many

22

new identifications. However, compared with the 42 sites identified by DDA, the

ACS Paragon Plus Environment

Page 16 of 30

Page 17 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

PRM analysis operated in pseudo-targeted strategy leaded to 15 new site

2

identifications, which improved by 35.7%. Clearly the pseudo-targeted strategy is

3

able to identify low abundant phosphopeptides that cannot be achieved by

4

conventional DDA method.

5

Inclusion list can also used in DDA mode, termed as iDDA, to enhance the

6

detection sensitivity of the peptides of interest30. We then compared the performance

7

of iDDA with PRM using the 2min stimulated IP sample. Filling time of 503 ms was

8

set in PRM as it yielded maximum number of phospopeptides during our initial

9

optimization of PRM (data was not shown). For fair comparation, filling time of 503

10

ms was also set in iDDA rather than 50ms in conventional DDA we mentioned above.

11

Basically, the comparison experiments were performed in parallel with identical

12

conditions, i.e. the same sample, the same inclusion list, the same machine (QE rather

13

than QE HF was used here) with the same parameter setting. It can be seen from

14

Figure S5.a that both iDDA and PRM yielded much more p-site identifications than

15

conventional DDA did. However, the p-sites identified by PRM were 37.5% more

16

than those identified by iDDA. Clearly PRM has higher sensitivity. This could be

17

partly attributed to their different trigger mechanism. In iDDA, MS2 were triggered

18

only when the presence of the peaks in the full MS corresponding to m/z values set in

19

the inclusion list. While in PRM, MS2 were triggered by pre-set retention times and

20

m/z values independent on MS scan. The low abundance phosphopeptides may not

21

yield strong enough peaks in MS1 to trigger MS2 in iDDA, however these MS2

22

spectra can still be available in PRM mode. This explained why more MS2 spectra

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 18 of 30

1

from the protein complex were collected in PRM mode compared with in iDDA mode

2

(Figure S5.b).

3

This PRM based pseudo-targeted method was further applied to analyze the Shc1

4

complexes with other EGF stimulation times. Obvious increases in the number of the

5

identified p-sites were observed in all the samples as shown in Table 1. Especially 21

6

more p-sites were identified by PRM in the 20 min sample, which accounted for 50%

7

of the p-sites identified by DDA. From this study, we identified 10, 57, 63 and 63 p-

8

sites in Shc1 complexes at 4 states of EGF signaling pathway (rest stage/0 min, early

9

stage/2 min, medium stage/5 min, late stage/20 min), respectively. Accumulatively, a

10

total of 82 unique p-sites were identified in this study. The identified p-sites were

11

listed in Table S2. It was found that over 96% of these p-sites were included in

12

PhosphoSitePlus,

13

modifications31. To our knowledge, this is the highest number of p-sites identified

14

from the EGF-dependent Shc1 complexes. In a previous work22, 22 p-sites were

15

identified on the Shc1 complexes. With the benefits of high sensitivity and high

16

resolution of PRM, the identification of p-sites in the protein complex was

17

dramatically enhanced by the pseudo-targeted strategy.

a

database

of

experimentally

observed

post-translational

18

In conventional phosphoproteomics analysis workflow, the phosphopeptides were

19

specifically enriched from protein digest prior to LC-MS/MS run. Because the

20

interferences

21

phosphoproteomics coverage could be dramatically improved. We also tested the

22

performance of this approach for the analysis of the p-sites in protein complex. The

from

the

non-phosphopeptides

were

ACS Paragon Plus Environment

almost

eliminated,

the

Page 19 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

digest of the 20min EGF stimulated Shc1 complex IP sample, which has been

2

identified most p-sites in both acquisition modes, was subjected to Ti-IMAC

3

enrichment followed by LC-MS/MS analysis. It was found only 5 p-sites on the Shc1

4

complex were identified, which is much fewer than the identification in sample

5

without Ti-IMAC treatment (see Table.1). This is not difficult to understand. The

6

pulled down complex is about a few µg, which is much fewer than the common initial

7

amount for Ti-IMAC enrichment of proteome sample (typically >100 µg15). The

8

enrichment step generated huge sample loss due to the minute amount of sample,

9

which resulted in poor identification. Alternatively, the method presented in this study

10

without enrichment yielded better results.

11

Due to its advantage of accurate quantification, the PRM based method enabled

12

comprehensive mapping the dynamic change of the shc-1 complex upon EGF

13

stimulation (Figure 4). After EGF stimulation, a series of proteins has been recruited

14

to the complex. A total of 33 proteins were up-regulated after EGF stimulating, among

15

which 12/ 15/ 6 proteins achieved peak point at 2min/ 5min/ 20min, respectively

16

(Figure S3b). Some proteins were down-regulated in later stages, as the intensity

17

decreased by 50% comparing to the peak value but still maintained at a remarkable

18

high level. Similarly, dynamical change in protein phosphorylation on the protein

19

complexes was also observed in different stages of EGF stimulation. The numbers of

20

p-sites identified on each protein were labeled on Figure 4. With the assembling of

21

protein complex and signal transduction, the total number of p-sites reached

22

maximum at late stage (20 min after EGF stimulation). About 42.7% p-sites were

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

identified in all EGF stimulated states (2min/5min/20min) (Figure S6), suggesting

2

these sites probably started to be activated immediately after the cells were stimulated

3

and most of p-sites maintained activated from early stage to late stage. The peptides

4

that carrying the p-sites were monitored by PRM and the dynamic change of 45 p-

5

sites across the stimulation was observed (Table S1 and Figure S7.a). Changes in

6

levels of phosphopeptides were further normalized to changes in protein expression to

7

derive changing tendency in occupancy of phosphorylation sites (Figure S7.b). The

8

normalized fold-change indicates the change of phospho-site occupancy. It was found

9

the occupancy for many sites altered during the EGF stimulation, suggesting

10

phosphorylation-mediated dynamic regulation of protein complex. For example, the

11

quantitative tendencies of Y1172 and Y1197 on EGFR were observed to reach peaks

12

after 5 min stimulation, which is similar to the result of previous work that the two p-

13

sites reached peaks at early stage and then decreased22. After normalized by protein

14

change, the occupancy of these two sites started to increase at 2min and were down-

15

regulated at 5 min. The occupancies that seemed high at 20 min may be the result of

16

down-regulation in protein level. The dynamic occupancy changes on these two sites

17

suggested that the sites started to be activated immediately after stimulation and didn’t

18

increase as the same proportion as the increasing of protein level. Bring together, the

19

above results indicated that the PRM based method could be a powerful tool to reveal

20

the process for the dynamical assembly of protein complex.

21



22



23



ACS Paragon Plus Environment

Page 20 of 30

Page 21 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

4. Conclusion

2

In this study, we presented a pseudo-targeted MS method to identify low

3

abundance phosphorylation in minute amount of sample. The development of this

4

targeted approach is very easy as the same sample and the same LC-system were

5

used for the two phases. No sample fractionation or enrichment was required for the

6

discovery phase which allowed this method to analyze minute amount of sample. We

7

have demonstrated that this method has higher sensitivity to identify phosphorylation

8

sites on endogenous Shc1 protein complexes. PRM is typically used in targeted

9

proteomics to quantify the peptides of interest, while the pseudo-targeted MS method

10

presented in this study allowed PRM to identify new peptides. In this new method,

11

low confident peptides identified from the DDA dataset are used to build a pseudo-

12

targeted library for PRM analysis which enabled the identification of new high

13

confident peptides. Thus this strategy is not limited to analyze protein

14

phosphorylation. It is also applicable to analyze other low abundant peptides in a

15

sample with trace amount.

16

Acknowledgments

17

This work was supported, in part, by funds from the China State Key Basic

18

Research Program Grants (2016YFA0501402), the National Natural Science

19

Foundation of China (21605140, 21235006, 21535008, 81600046). MY is a recipient

20

of the National Science Fund of China for Distinguished Young Scholars (21525524).

21

Competing financial interests: The authors declare no competing financial

22

interest.

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1 2 3

Supporting information available Additional information as noted in text. This material is available free of charge via the Internet at http://pubs.acs.org.

4 5

Reference

6

(1) Gavin, A. C.; Bosche, M.; Krause, R.; Grandi, P.; Marzioch, M.; Bauer, A.; Schultz, J.; Rick, J. M.;

7

Michon, A. M.; Cruciat, C. M.; Remor, M.; Hofert, C.; Schelder, M.; Brajenovic, M.; Ruffner, H.; Merino,

8

A.; Klein, K.; Hudak, M.; Dickson, D.; Rudi, T., et al. Nature 2002, 415, 141-147.

9

(2) Guruharsha, K. G.; Rual, J. F.; Zhai, B.; Mintseris, J.; Vaidya, P.; Vaidya, N.; Beekman, C.; Wong, C.;

10

Rhee, D. Y.; Cenaj, O.; McKillip, E.; Shah, S.; Stapleton, M.; Wan, K. H.; Yu, C.; Parsa, B.; Carlson, J. W.;

11

Chen, X.; Kapadia, B.; VijayRaghavan, K., et al. Cell 2011, 147, 690-703.

12

(3) Vazquez, F.; Grossman, S. R.; Takahashi, Y.; Rokas, M. V.; Nakamura, N.; Sellers, W. R. J. Biol. Chem.

13

2001, 276, 48627-48630.

14

(4) van Attikum, H.; Fritsch, O.; Hohn, B.; Gasser, S. M. Cell 2004, 119, 777-788.

15

(5) Lawrence, R. T.; Searle, B. C.; Llovet, A.; Villen, J. Nature methods 2016, 13, 431-434.

16

(6) Zhou, H.; Ye, M.; Dong, J.; Corradini, E.; Cristobal, A.; Heck, A. J.; Zou, H.; Mohammed, S. Nature

17

protocols 2013, 8, 461-480.

18

(7) Kang, T.; Kim, J. H.; Hong, I.; Park, N.; Heinsen, H.; Lee, J.-Y.; Ravid, R.; Ferrer, I.; Yoo, J. S.; Kwon, K.-

19

H.; Park, Y. M. Anal. Bioanal. Chem. 2014, 406, 5433-5446.

20

(8) Roumeliotis, T. I.; Williams, S. P.; Goncalves, E.; Alsinet, C.; Del Castillo Velasco-Herrera, M.; Aben,

21

N.; Ghavidel, F. Z.; Michaut, M.; Schubert, M.; Price, S.; Wright, J. C.; Yu, L.; Yang, M.; Dienstmann, R.;

22

Guinney, J.; Beltrao, P.; Brazma, A.; Pardo, M.; Stegle, O.; Adams, D. J., et al. Cell reports 2017, 20,

23

2201-2214.

24

(9) Bekker-Jensen, D. B.; Kelstrup, C. D.; Batth, T. S.; Larsen, S. C.; Haldrup, C.; Bramsen, J. B.; Sorensen,

25

K. D.; Hoyer, S.; Orntoft, T. F.; Andersen, C. L.; Nielsen, M. L.; Olsen, J. V. Cell systems 2017, 4, 587-599

26

e584.

27

(10) Yang, F.; Shen, Y.; Camp, D. G., 2nd; Smith, R. D. Expert review of proteomics 2012, 9, 129-134.

28

(11) Gillet, L. C.; Navarro, P.; Tate, S.; Roest, H.; Selevsek, N.; Reiter, L.; Bonner, R.; Aebersold, R.

29

Molecular & Cellular Proteomics 2012, 11.

30

(12) Nahnsen, S.; Kohlbacher, O. BMC Bioinformatics 2012, 13.

31

(13) Pan, S.; Aebersold, R.; Chen, R.; Rush, J.; Goodlett, D. R.; McIntosh, M. W.; Zhang, J.; Brentnall, T.

32

A. J. Proteome Res. 2009, 8, 787-797.

33

(14) Elschenbroich, S.; Kislinger, T. Molecular bioSystems 2011, 7, 292-303.

34

(15) de Graaf, E. L.; Kaplon, J.; Mohammed, S.; Vereijken, L. A.; Duarte, D. P.; Redondo Gallego, L.;

35

Heck, A. J.; Peeper, D. S.; Altelaar, A. F. J Proteome Res 2015, 14, 2906-2914.

36

(16) Soste, M.; Hrabakova, R.; Wanka, S.; Melnik, A.; Boersema, P.; Maiolica, A.; Wernas, T.; Tognetti,

37

M.; von Mering, C.; Picotti, P. Nat Methods 2014, 11, 1045-1048.

38

(17) Degryse, S.; de Bock, C. E.; Demeyer, S.; Govaerts, I.; Bornschein, S.; Verbeke, D.; Jacobs, K.; Binos,

39

S.; Skerrett-Byrne, D. A.; Murray, H. C.; Verrills, N. M.; Van Vlierberghe, P.; Cools, J.; Dun, M. D.

ACS Paragon Plus Environment

Page 22 of 30

Page 23 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

Leukemia 2017.

2

(18) Gallien, S.; Domon, B. Bioanalysis 2014, 6, 2159-2170.

3

(19) Bourmaud, A.; Gallien, S.; Domon, B. Proteomics 2016, 16, 2146-2159.

4

(20) Sherman, J.; McKay, M. J.; Ashman, K.; Molloy, M. P. Proteomics 2009, 9, 1120-1123.

5

(21) Peterson, A. C.; Russell, J. D.; Bailey, D. J.; Westphall, M. S.; Coon, J. J. Molecular & Cellular

6

Proteomics 2012, 11, 1475-1488.

7

(22) Zheng, Y.; Zhang, C.; Croucher, D. R.; Soliman, M. A.; St-Denis, N.; Pasculescu, A.; Taylor, L.; Tate,

8

S. A.; Hardy, W. R.; Colwill, K.; Dai, A. Y.; Bagshaw, R.; Dennis, J. W.; Gingras, A. C.; Daly, R. J.; Pawson,

9

T. Nature 2013, 499, 166-171.

10

(23) Cox, J.; Mann, M. Nat. Biotechnol. 2008, 26, 1367-1372.

11

(24) MacLean, B.; Tomazela, D. M.; Shulman, N.; Chambers, M.; Finney, G. L.; Frewen, B.; Kern, R.;

12

Tabb, D. L.; Liebler, D. C.; MacCoss, M. J. Bioinformatics 2010, 26, 966-968.

13

(25) Mellacheruvu, D.; Wright, Z.; Couzens, A. L.; Lambert, J. P.; St-Denis, N. A.; Li, T.; Miteva, Y. V.;

14

Hauri, S.; Sardiu, M. E.; Low, T. Y.; Halim, V. A.; Bagshaw, R. D.; Hubner, N. C.; Al-Hakim, A.; Bouchard,

15

A.; Faubert, D.; Fermin, D.; Dunham, W. H.; Goudreault, M.; Lin, Z. Y., et al. Nature methods 2013, 10,

16

730-736.

17

(26) Cox, J.; Hein, M. Y.; Luber, C. A.; Paron, I.; Nagaraj, N.; Mann, M. Molecular & Cellular Proteomics

18

2014, 13, 2513-2526.

19

(27) Tyanova, S.; Temu, T.; Sinitcyn, P.; Carlson, A.; Hein, M. Y.; Geiger, T.; Mann, M.; Cox, J. Nature

20

methods 2016, 13, 731-740.

21

(28) Lesur, A.; Domon, B. Proteomics 2015, 15, 880-890.

22

(29) Liu, H. B.; Sadygov, R. G.; Yates, J. R. Anal. Chem. 2004, 76, 4193-4201.

23

(30) Jaffe, J. D.; Keshishian, H.; Chang, B.; Addona, T. A.; Gillette, M. A.; Carr, S. A. Molecular & Cellular

24

Proteomics 2008, 7, 1952-1962.

25

(31) Hornbeck, P. V.; Kornhauser, J. M.; Tkachev, S.; Zhang, B.; Skrzypek, E.; Murray, B.; Latham, V.;

26 27

Sullivan, M. Nucleic Acids Res. 2012, 40, D261-D270.

28

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1



2

Table1.

3



4 5

Page 24 of 30

Table 1. P-sites identified in the Shc1 protein complexes by DDA and PRM

6 EGF stimulation 0min

2min

5min

20min

time DDA

6

42

46

42

PRM

+4

+15

+17

+21

total

10

57

63

63

ACS Paragon Plus Environment

Page 25 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1



2

Figure legends

3 4

Figure 1. The pseudo-targeted strategy for the sensitive analysis of protein phosphorylation in

5

protein complex. The low confident phosphopeptide identifications from DDA data were used to

6

build the pseudo-targeted library for PRM analysis, which enabled the identification of additional

7

high confident phosphopeptides.

8

Figure 2. The pseudo-targeted strategy improves the spectra quality. (a) Comparison of peptide

9

matching scores in DDA and PRM for the same ions. Scores of 49 peptides were improved after

10

PRM, locating above the identify line. (b) The matching score for the identification of a

11

phosphopeptide, SPFGSPSAEAVSSR, was 6 in DDA, while it was improved to 34 after PRM.

12

Figure 3. Multiple runs of DDA or PRM analysis do not significantly improve the coverage. The

13

numbers of p-sites identified by the three replicate (a) DDA runs and their combination, (c) PRM

14

runs and their combination; The overlap of p-site identifications for the three replicate (b) DDA

15

and (d) PRM runs. The Shc1 complex immunoprecipitated from cells stimulated with EGF for 2min

16

was used as the sample for all the analysis. The p-sites shown were all identified from the Shc1

17

complex. The numbers of p-sites for the combined results were slightly different with those in the

18

Table 1 because the MS data for each run were searched separately here.

19

Figure 4. The PRM based method enabled comprehensive mapping the dynamic change of the

20

Shc-1 complex upon EGF stimulation. The binding partners of Shc1 at rest phase were labeled in

21

blue. The proteins recruited after EGF stimulation was shown in orange. The proteins down-

22

regulated over 50% compared with their peak values were labeled in white. The number in the

23

circle indicates the number of p-sites identified from that protein.

24

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

Figure 1

2



3 4

Page 26 of 30



5

ACS Paragon Plus Environment

Page 27 of 30

1

Figure 2

2



a

Phosphopeptide Non-phosphopeptide Unmatched ion

120

100

80

PRM score

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

60

40

20

0 0

20

40

60

80

100

120

DDA score

b

Score: 6

Score: 34

3 4



5

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

Page 28 of 30

Figure 3

2



a

b

50 40

43

35

33

36

30

20 10 0

2 min DDA1 2 min DDA2 2 min DDA3 3 DDA runs Combined

c

d

40 30 20 10

12

12

14

18

20

18

17

19

0

2 min PRM1 2 min PRM2 2 min PRM3 3 PRM runs Combined

3 4

Newly identified P-sites in PRM

P-sites identified in DDA



5

ACS Paragon Plus Environment



Page 29 of 30 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

Figure 4.

2



3



4

ACS Paragon Plus Environment

Analytical Chemistry 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

1

TOC graphic

2 3



4

Page 30 of 30



ACS Paragon Plus Environment