Subscriber access provided by Caltech Library Services
Article
Identification of Lysine Acetylation in Mycobacterium abscessus Using LC-MS/MS after Immunoprecipitation Jintao Guo, Changwei Wang, Yi Han, Zhiyong Liu, Tian Wu, Yan Liu, Yang Liu, Yaoju Tan, Xinshan Cai, Yuanyuan Cao, Bangxing Wang, Buchang Zhang, Chunping Liu, Shouyong Tan, and Tianyu Zhang J. Proteome Res., Just Accepted Manuscript • DOI: 10.1021/acs.jproteome.6b00116 • Publication Date (Web): 20 Jun 2016 Downloaded from http://pubs.acs.org on June 21, 2016
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
Journal of Proteome Research is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 45
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 2 of 45
Identification of Lysine Acetylation in Mycobacterium abscessus Using LC-MS/MS after Immunoprecipitation Jintao Guo a, ‡, Changwei Wang a, ‡, Yi Han a, Zhiyong Liu a, Tian Wu a, Yan Liu a, Yang Liu a, c, Yaoju Tan b, Xinshan Cai b, Yuanyuan Cao a, c, Bangxing Wang a, c, Buchang Zhang c, Chunping Liu b, Shouyong Tan b and Tianyu Zhang a, * a
State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine
and Health (GIBH), Chinese Academy of Sciences (CAS), Guangzhou, China. b State Key Laboratory of Respiratory Disease, Department of Clinical Laboratory, The Guangzhou Chest Hospital, Guangzhou, Guangzhou, China. c School of Life Sciences, University of Anhui, Hefei, China.
Running title: Lysine acetylation in M. abscessus
Keywords:
Mycobacterium
abscessus
(M.
abscessus),
lysine
acetylation,
post-translational modifications (PTMs), mass spectrometry (MS)
1 ACS Paragon Plus Environment
Page 3 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
ABSTRACT
Mycobacterium abscessus (MAB), which manifests in the pulmonary system, is one of the neglected causes of nontuberculous mycobacteria (NTM) infection. Treatment against MAB is difficult, characterized by its intrinsic antibiotic drug-resistance. Lysine acetylation can alter the physiochemical property of proteins in living organisms. This study aimed to determine if this protein post-translational modification (PTM) exists in a clinical isolate M. abscessus GZ002. We used the anti-acetyl-lysine immunoprecipitation to enrich the low-abundant PTM proteins, followed by the LC-MS/MS analysis. The lysine acetylome of M. abscessus GZ002 was determined. There were 459 lysine acetylation sites found in 289 acetylated proteins. Lysine acetylation occurred in 5.87 % of the M. abscessus GZ002 proteome and at least 25 % of them were growth essential. Aerobic respiration and carbohydrate metabolic pathways of M. abscessus GZ002 were enriched with lysine acetylation. Through bioinformatics analysis, we identified four major acetyl motif logos (KacY, KacF, KacH and DKac). Further comparison of the reported M. tuberculosis (MTB) acetylomes and that of MAB GZ002 revealed several common features between these two species. The lysine residues of several antibiotic-resistance, virulence and persistence-related proteins were acetylated in both MAB GZ002 and MTB. There were 51 identical acetylation sites in 37 proteins found in common between MAB GZ002 and MTB. Overall, we demonstrate a profile of lysine acetylation in MAB
2 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 4 of 45
GZ002 proteome that shares similarities with MTB. Interventions that target at these conserved sections may be valuable as anti-NTM and/or anti-TB therapies.
INTRODUCTION The nontuberculous mycobacteria (NTM) can cause a wide spectrum of infections, including the manifestation in the pulmonary system. Increasing reports in the literature suggest that the NTM cause a greater number of pulmonary infections than the Mycobacterium tuberculosis (MTB) complex.
1-2
The pulmonary infection caused
by the rapidly growing Mycobacterium abscessus (MAB) is difficult to cure due to its intrinsic resistance to the standard anti-tuberculosis drugs (reviewed in 3-4). As a result, increased prevalence of NTM infections is in contrast with a reduced number of tuberculosis (TB) infections.
5-7
New agents with enhanced activity against MAB and
other NTM are needed. Study of the protein post-translational modification (PTM) is imperative in understanding the biology of NTM and MTB to reveal potential new targets for disease prevention and treatment. Though hundreds of lysine-acetylated proteins were identified in MTB H37Rv and the attenuated strain MTB H37Ra,
8-9
the lysine
acetylome has not been determined for MAB. In general, the lysine acetylation and deacetylation can act as the molecular switches that initiate or inhibit protein activities. 10-12
Acetylation of the glyceraldehyde phosphate dehydrogenase kinase/phosphatase
in Salmonella enterica could shift the enzyme’s activity from gluconeogenesis to glycolysis.
13
Alternatively, lysine acetylation neutralizes the positively charged 3 ACS Paragon Plus Environment
Page 5 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
ε-amino group that prevents protein from interacting or binding with substrates. 10-14 It was also evident that cross-talk between various PTMs is a part of the global protein function regulatory network since co-occurring of acetylation and phosphorylation in the interaction interfaces of multifunctional proteins were frequently found in Mycoplasma pneumoniae.
15
Lysine acetylation in histones is another well-known
mechanism employed by the living organisms to regulate transcription and DNA repair. 16 The general control of amino acid synthesis protein 5 (GCN5)-related N-acetyltransferases (GNATs) catalyze the protein lysine acetylation by transferring an acetyl group from acetyl-coenzyme A to the ε-amine group of lysine (Nε-modification) or to the α-amine of protein N-terminus (Nα-modification).
8-9
The reverse reaction
can be carried out by deacetylases. The changes in protein structures by PTM may have major impacts on protein function and compartmentalization.
17-18
At least three
putative GNATs have been identified in the MAB genome. The MAB_3168c and MAB_4395 were associated with the aminoglycosides resistance, MAB_1725c might play a role in mycobacterial virulence.
20
3, 19
whereas the
The identifications of
these acetyltransferases suggest that lysine acetylation is functional in MAB. However, apart from the general associations with drug-resistance and virulence, the actual effects of lysine acetylation on cell physiology and which acetyltransferase is responsible
for
the
target
protein
modification
remains
exclusive.
The
PTM-dependent pathways that regulate the MAB disease progression are yet to be identified. 4 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 6 of 45
In the current work we examined the lysine acetylome of a MAB clinical strain GZ002. In order to identify the low abundant lysine-acetylated protein, we used immunopurification with an anti-acetyl-lysine antibody followed by liquid chromatography-tandem mass spectrometry (LC-MS/MS). The acetyl-lysines were widely distributed in the MAB GZ002 proteome. Proteins involved in the carbon metabolism pathway, tricarboxylic acid (TCA) cycle, and the aerobic respiration pathway were enriched with acetyl-lysines. We further compared the lysine acetylome of MAB GZ002 with those from the reported MTB strains, array of common acetylated proteins and identical amino acid sequences flanking the acetylation sites were identified in these two species. We hypothesized that these conserved lysine acetylation pathways might be potential targets for therapeutic avenues.
EXPERIMENTAL PROCEDURES Bacterial Strain A full description of the speciation procedures for the MAB clinical isolate GZ002 obtained form the Guangzhou Chest Hospital (P. R. China) can be found in SI. Bacteria were grown in Middlebrook 7H9 broth (Difco), supplemented with 10% Oleic acid-bovine albumin-dextrose-catalase (OADC) (Becton Dickinson), 0.05% Tween-80 and 0.5% glycerol.
9
Cultures were maintained at 37 °C. Three genes of the MAB
GZ002, including the 16S rRNA, the DNA gyrase subunit B (gyrB), and the DNA-directed RNA polymerase subunit β (rpoB)
21-22
were amplified by PCR using
primers listed in Table S-1. After gene sequencing (Beijing Genomics Institute), 5 ACS Paragon Plus Environment
Page 7 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
BLAST searches were performed to compare each of the MAB GZ002 gene sequence with its homologous gene in reference strain MAB ATCC 19977.
Extraction and Trypsin Digestion of Proteins Soluble proteins were extracted from mid-log phase bacteria using sonication as described.
23
The bacteria were harvested from the same batch, washed twice in cold
PBS (pH 7.5) and sonicated in 8 M urea lysis buffer, supplemented with 1% (v/v) protease inhibitor cocktail (Protease Inhibitor Cocktail SetIII; Calbiochem), 1 mM dithiothreitol (DTT), 2 mM EDTA and deacetylase inhibitor mix (30 mM nicotinamide,
50 mM sodium butyrate and 3 µM trichostatin A). After centrifugation, the 2-D Quant kit (GE Healthcare) was used to quantify the protein content in clear supernatant according to the manufacturer’s instructions. Together, 12 mg of soluble proteins were used. The proteins were precipitated with 20% (w/v) trichloroacetic acid overnight at 4 oC. Then the precipitates were washed three times with cold acetone. Peptide reduction and alkylation were carried out using 5 mM dithiothreitol (at 56 °C for 45 min) and 15 mM iodoacetamide (at room temperature for 30 min, protected from light) respectively. The reactions were stopped by adding 15 mM cysteine and kept for 30 min at room temperature. Proteins were subjected to overnight trypsin (Promega) digestion in 100 mM NH4HCO3.
24
The processed peptides received a
second trypsin digestion for 4 h to ensure complete digestion. The digested peptides were lyophilized in 8 cryotubes using a SpeedVac concentrator (Thermo Scientific) and stored at – 80 °C for further characterization. 6 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 8 of 45
Immunoenrichment of Acetylated Peptides The lysine-acetylated peptides in 8 cryotubes were enriched using the anti-acetyl-lysine pan antibody-conjugated beads (PTM Biolabs) as described previously.
25
Peptides were suspended in NETN buffer (50 mM Tris-HCl at pH 8.0,
100 mM NaCl, 1 mM EDTA and 0.5% v/v Nonidet P-40), then incubated with the antibody-conjugated beads for 8 h at 4 °C with rotary shaking. After centrifugation, the beads were washed in cold PBS and distilled water. The bound peptides were eluted in 0.1% (v/v) trifluoroacetic acid and subsequently lyophilized in a SpeedVac concentrator.
Mass Spectroscopy The immunoprecipitated peptides from each cryotube were re-suspended in buffer containing 2% (v/v) acetonitrile and 0.1% (v/v) formic acid (solvent A). After centrifugation, supernatant from each cryotube was loaded onto an Acclaim PepMap 100 C18 trap column (Dionex) by the EASY nLC1000 nanoUPLC (Thermo Scientific) and resolved on an Acclaim PepMap RSLC C18 analytical column (Dionex). Peptides were eluted at a linear gradient of 5-30% solvent B (80% v/v acetonitrile and 0.1% v/v formic acid) for 34 min at constant flow rate of 300 nL/min, followed by 2 min in 40% solvent B, then 2 min to 80% solvent B, and maintaining at 80% solvent B for 4 min. After separation, the peptides were ionized by the Nanospray Ionization (NSI) Source (Thermo Scientific) and analyzed using an online nanoflow LC–MS/MS on a Q 7 ACS Paragon Plus Environment
Page 9 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
Exactive quadrupole Orbitrap mass spectrometer (Thermo Scientific). For the MS/MS analysis, intact peptides with a resolution of 70000 or ion fragments with a resolution of 17500 were detected by the Orbitrap mass analyzer. They were then fragmented using 25% Normalized Collision Energy (NCE) with 4% stepped NCE. The data acquisition was performed in a data dependent manner whereas the 15 most intense precursor ions from each MS survey scan were selected for fragmentation. Full MS spectra were acquired from m/z 350 to 1800. The electrospray voltage applied was 1.8 kV. Automatic
gain control (AGC) was set so that the MS/MS were generated from 2 × 105 ion counts. Together 8 runs of LC–MS/MS were performed and data were pooled for further analysis.
Data Processing The pooled raw MS/MS data from 8 runs were analyzed by MaxQuant software with integrated Andromeda search engine (v. 1.3.0.5) as described. 8-9 Data were searched against the Uniprot M. abscessus ATCC19977/DSM44196 protein database (http://www.uniprot.org/proteomes/UP000007137) that contained 4940 protein sequences, the reverse decoy database and protein sequences of common contaminants. The search parameters in this study were: a) precursor mass ranged between 350 to 1800 Da; b) minimum peak count was set to 5; c) signal to noise threshold set to 1.5; d) trypsin was used as a protease and it was allowed up to 3 missed cleavages; e) precursor mass tolerance of 10 ppm and fragment tolerance of 0.02 Da; f) oxidation of methionine, acetylation on lysine and acetylation on protein N-terminus were set as 8 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 10 of 45
variable modifications and carbamidomethylation of cysteine as fixed modification; and g) 1% false discovery rate. Minimum peptide length was set at 7. The identified lysine acetylation sites with localization probability less than 0.75 or acetyl-lysine sites from reverse or contaminant protein sequences were excluded from further analysis.
Bioinformatics Analysis (i) Protein Functional Annotation The identified acetylated proteins were grouped in the Gene Ontology Annotation (UniProt-GOA) Database (www.http://www.ebi.ac.uk/GOA/) to cellular component, biological process and molecular function based on their Gene Ontology (GO) terms. If an acetyl peptide could not be annotated by the UniProt-GOA database, the peptide was assigned to a GO function using the InterProScan software
26
based on its peptide
sequence. (ii) Functional Enrichment Analysis To gain information on which pathways were enriched for acetyl-lysines, GO enrichment analysis was performed to annotate protein pathways using the Kyoto Encyclopedia of Genes and Genomes (KEGG) online tools, the KEGG Automatic Annotation Server (KAAS) database and the KEGG mapper as described.
8, 23
The
InterPro database and InterProScan software were used to annotate protein domains and the protein’s subcellular localization was predicted using Cello database (http://cello.life.nctu.edu.tw/). In the enrichment analysis, fold enrichment was calculated using the formula: (a/c) / (b/d), where “a” was the number of acetylated 9 ACS Paragon Plus Environment
Page 11 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
proteins in each GO term; “b” was the number of proteins quantified in each GO term; “c” was the total number of acetylated proteins; and “d” was the total number of all proteins. Fisher’s exact test was applied to test if the fold enrichment of acetylated proteins in specific annotation term was significant. A GO term was defined as significantly enriched with acetylated proteins when the fold enrichment > 1 and the p-value < 0.05. (iii) Motif Analysis The acetyl-lysine motifs were analyzed using motif-X
27
to identify enrichment or
depletion of amino acids in the acetyl-21-mers (10 amino acids upstream and downstream of the site) compared to that of the non-acetyl-21-mers in an identical protein. The flanking sequence of 21-mers was examined to provide better coverage of the acetylation motif than the 13-mers. Fisher’s exact test was applied to test if the relative abundance of an amino acid was significant in a position of the acetyl-21-mers. The relative abundances were schematically represented as -log10 of the Fisher’s exact test p-values in an intensity map. The corresponding p-value < 0.05 was considered statistically significant. (iv) Motif-based Clustering Analysis. Clusters of acetylated proteins in each motif logo were subject to GO enrichment analysis as described. For the enrichment analysis of an individual logo in a specific GO term, the Fisher’s exact test p-value was −log10 transformed and it was given a Z score. The motif logos were then clustered to specific GO term using one-way hierarchical clustering (Euclidean distance, average linkage clustering) in Genesis. The 10 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 12 of 45
clusters of motif logos in cellular component, biological process and molecular function were plotted in heat maps using the “heatmap.2” function from the ggplots plotting system in the R graphic package.
RESULTS Identification of MAB GZ002 Nucleotide sequence alignment showed that the full 16S rRNA gene sequence and, partial gyrB and rpoB gene sequences of the clinical isolate MAB GZ002 were identical with the reference strain MAB ATCC 19977 (Figure S-1). We therefore confirmed that the genetic background of this clinical isolate belonged to the MAB complex.
Identification of MAB GZ002 Lysine Acetylome There were 289 lysine-acetylated proteins and 459 acetylation sites in the MAB GZ002 harvested from 7H9 broth culture supplemented with glycerol and OADC. The total protein coverage rate of lysine acetylation in MAB GZ002 was 5.87% (Table 1). Figure 1A shows that 83% of the trypsinized peptides were within the range of trypsin cleavage sizes (8-20 peptides). 28-29 The accuracy of LC-MS/MS analysis was revealed by the mass error that measured the differences between the exact mass and the integer mass of the identified peptide. Majority of the identified peptides (81%) had mass errors of ± 2 ppm, below the mass tolerance of 10 ppm (Figure 1B). In the present study, a single lysine acetylation site was detected in approximately two-thirds (68%, 11 ACS Paragon Plus Environment
Page 13 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
199 proteins) of the lysine acetylome in MAB GZ002 (Figure 1C). The top three highly acetylated proteins were the 60 kDa chaperonin 2 (14 acetyl sites), the 70 kDa chaperone protein (10 acetyl sites) and the 2-oxoglutarate dehydrogenase (7 acetyl sites) (Table S-2). Overall, we found that at least 55% of the MAB GZ002 lysine acetylome (161 acetylated proteins) was in common with the lysine acetylome from MTB H37Rv
8
grown with OADC supplementations (Figure 2). Total 52 identical
acetylation sites in 37 proteins were identified in MAB GZ002 and MTB H37Rv,
8
including 18 identical acetylation sites in 10 proteins were also found in common with MTB H37Ra (Table 2). 9
GO Functional Annotation of Acetylated Protein We conducted GO annotation analysis to determine the functional domains of the MAB GZ002 lysine acetylome using the UniProt-GOA Database. Given the wide distribution of lysine acetylation in eukaryotic and prokaryotic cells,
11
we first
determined the distribution of acetylated proteins in three annotation categories, including the cellular component, biological process and molecular function. It was predicted that 9.5 % of the total proteins mapped to cellular component and biological processes were acetylated, while the acetylation rate in the molecular function category was 8.3 % (Table 3). Since the lysine acetylome profile of MAB GZ002 could be affected by the availabilities of carbon sources, the lysine acetylome identified here represented both carbohydrate and fatty acid metabolisms, as glycerol, oleic acid and dextrose supplementations were included in the culture media. 12 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 14 of 45
Functional Enrichment Analysis GO enrichment analysis was applied to determine if certain cell functions or cellular components (GO terms) of the individual annotation category were in favor of lysine acetylation. We defined a GO term to be significantly enriched with acetyl-lysines if its respective Fisher’s exact test p-value was less than 0.05 (-log10 (p-value) > 1.3) with fold of enrichment greater than 1 (Figure 3 and Table S-3). In the cellular component category, lysine acetylation in the large ribosomal subunit (4 of 7 proteins acetylated in this GO term) was highly enriched (fold enrichment = 6) and statistically significant (-log10 (p-value) = 1.67) (Figure 3A). Two of the biological processes in MAB GZ002 were significantly enriched with acetyl-lysines, including the tricarboxylic acid cycle (8 of 14 proteins acetylated, fold enrichment = 6, -log10 (p-value) = 3.9) and the reactive oxygen species metabolic process (5 of 7 proteins acetylated, fold enrichment = 7.49, -log10 (p-value) = 2.66) (Figure 3A). In the molecular function category, the GO term translation factor activity (nucleic acid binding) that comprised of 12 proteins in which 6 were acetylated was significantly enriched with acetyl-lysines (fold enrichment = 6, -log10 (p-value) = 2.74) (Figure 3A). In addition, several cellular pathways and protein domains were also significantly enriched with acetyl-lysine. The abundance of acetyl-lines in carbon fixation pathways, the aerobic respiration (citrate cycle), carbon/lipid metabolism pathways, and the ATP-binding fold domain (Figure 3B and 3C) suggested this modification was involved in production and utilization of energy that were critical for bacterial survival. To further shed light on 13 ACS Paragon Plus Environment
Page 15 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
this hypothesis, we compared the list of proteins in the lysine acetylome of MAB GZ002 with the genomic regions in MTB that were essential for growth. About 25 % 30 to 37 % 31 of acetylated proteins in the MAB GZ002 were essential depending on the classification in different studies (Table S-2).
Lysine Acetylation Motif To assess if there was significant enrichment of specific amino acids flanking the acetylation sites respect to the general amino acid composition of the entire MAB GZ002 proteome, we conducted motif analysis and generated a heat map to illustrate the relative abundance of amino acids flanking the acetylation sites. It was evident that the acetyl-lysine flanked by four amino acids exhibited high abundances that included the aspartic acid (D) at the -1 position; histidine (H) at the +1 position; tyrosine (Y) at the +1 position; and phenylalanine (F) at the -2 and +1 positions (-log10 (p-value) = 5, Figure 4A). Enrichment of arginine (R) was found in further distance from the acetyl-lysine at +5 position (-log10 (p-value) = 5, Figure 4A).
Motif of Lysine Acetylome Is Clustered to Distinct Cell Functions There were four highly conserved motif sequences (motif logos) in the lysine acetylome of MAB GZ002. The Y, F and H were most commonly found at the +1 position of the acetyl-lysine and the D was commonly found at the -1 position (Figure 4B and Table S-4). Similar patterns were also characterized in the acetyl-lysine substrates derived from the mitochondria in higher eukaryotes.
32
The preference for 14
ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 16 of 45
hydrophilic Y at the +1 position (motif logo 1, KacY) was enriched in pup ligase/deamidase and DNA topoisomerase (type IIA−like domain) proteins (Figure 4C). The same motif logo was also found enriched in various binding activities of glycolysis or gluconeogenesis, carbon metabolism pathways and, to a lesser extent, in amino acids biosynthesis pathway (Figure 4D). Hydrophobic F at the +1 position of acetyl-lysine (motif logo 2, KacF) was more commonly found in the lysine-acetylated proteins annotated to the metabolic pathways of propanoate, β-alanine, fatty acid, and degradation of hydrophobic amino acids (Figure 4D). The motif sequence with a positively charged amino acid H at the +1 position (motif logo 3, KacH) was enriched in the ATP-citrate lyase, the succinyl-CoA ligase or synthetase-like protein domains, and in the TCA cycle of MAB GZ002 (Figure 4C and 4D). There was also a preference for negatively charged D at the -1 position (motif logo 4, DKac). Though it was not unique to the MAB GZ002, this pattern was not reported in MTB.
8-9
The motif logo 4 (DKac) could be commonly
found in the ATP-dependent Clp protease protein domain, as well as the transketolase-like protein, pyrimidine-binding protein and translocation-enhancing protein domains (Figure 4C). This motif was also enriched in diverse cellular pathways, including the RNA degradation and RNA polymerase pathways, and the purine metabolism pathway of mycobacteria (Figure 4D). The GO enrichment analysis further revealed clusters of metabolic processes and molecular functions for individual acetyl-lysine motif. Acetylated proteins containing motif KacY were significantly enriched in binding and isomerase activities (Figure 15 ACS Paragon Plus Environment
Page 17 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
S-2). It was also abundant in wide range of catabolic and metabolic processes as well as glycolysis process (Figure S-3). The motif KacF was mainly enriched in structural protein functions and had weak association with biological processes (Figure S-2 and S3). Consistence with the TCA cycle enriched characteristic, the motif KacH was more commonly found in the enzyme activities mediating acid-thiol and succinate-CoA formations, coenzyme A (CoA) activities of TCA cycle (Figure S-2), and in the cellular respiration processes (Figure S-3). Lastly the motif DKac sequence was abundant in the adenyl nucleotide and ATP binding activities other than the RNA polymerase activity (Figure S-2). The DKac was enriched in protein folding process, protein metabolic process and wide range of catabolic processes (Figure S-3). Taken together, the clusters of proteins that acted on distinctive processes or pathways were associated with the individual acetyl motif. Our results suggested that specific acetyltransferase(s) that recognized the respective motif might be responsible for the acetylation in different cellular pathways of MAB GZ002.
Lysine acetylome is associated with mycobacterial virulence and antibiotic resistance Several antibiotic resistance, virulence and bacterial persistence-related factors were identified in the lysine acetylome of MTB H37Rv, thus, it was speculated that the lysine acetylation might play a role in TB disease progression.
8
Searching from the
MTB drug-resistant genes listed in the TB Drug Resistance Mutation Database, 33 we identified 11 of 40 drug-resistance-related orthologous gene products in the lysine 16 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 18 of 45
acetylome of MAB GZ002 (Table 4). In this list, 9 factors were commonly found in the lysine acetylome of MTB H37Rv, 8 and 3 factors were identified in the attenuated MTB H37Ra strain (Table 4). 9 The 3 highly conserved resistance-related factors in the lysine acetylomes of MAB GZ002, MTB H37Rv and MTB H37Ra were the DNA-directed RNA polymerase subunit β and β', and the 3-oxoacyl-[acyl-carrier-protein] reductase (Table 4). However the 3-oxoacyl-[acyl-carrier-protein] synthase 1 (MAB_4608) implicated in isoniazid (INH) resistance was found in the lysine acetylome of MAB GZ002 but not in MTB (Table 4). This may be important as the MTB but not the MAB was susceptible to INH. 34 We compared the MTB virulence factors
20, 35
with the lysine acetylome of MAB
GZ002 to show that 20 orthologous virulence-related factors were identified in the lysine acetylome of MAB GZ002, of which 16 were found in lysine acetylome of MTB H37Rv and 4 were identified in attenuated strain MTB H37Ra (Table 4). The 4 commonly acetylated virulence-related substrates in MAB GZ002 and the two strains of MTB were the glutamine synthetase, the sulfurtransferase, the isocitrate lyase and the long chain fatty acid CoA (Table 4). Since the actual molecular mechanisms underlying persistence infection of mycobacteria remain unclear, some genes that are considered as persistence-related 8, 36 might also play a role in the antibiotic resistance or virulence. We identified 6 persistence-related factors in the lysine acetylome of MAB GZ002, all of which were identified in the MTB H37Rv and 3 were found in the MTB H37Ra (Table 4). The long chain fatty acid CoA FadD26, the chaperone protein DnaK and the isocitrate lyase 17 ACS Paragon Plus Environment
Page 19 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
were the 3 commonly acetylated persistence-related factors (Table 4). The catalase-peroxidase that was required for activation of INH was substrate for lysine acetylation in MAB GZ002 and MTB H37Rv, but not in MTB H37Ra (Table 4).
DISCUSSION One of the advantages of using fresh clinical isolate rather than the laboratory reference strains was that we could limit the functional impairments due to significant genomic variations after long-history of cultivation in vitro. 37 The use of MAB GZ002 isolate could therefore provide more clinically relevant data. Using the anti-acetyl-lysine immunoprecipitation method, we were able to identify a vast array of acetylated proteins in MAB GZ002. The numbers of lysine-acetylated proteins in the metabolic pathways of MAB GZ002 were abundant, suggesting a role of lysine acetylation in regulating energy metabolism.
13, 38
Several key factors of the aerobic
respiration pathway were commonly found in the lysine acetylomes of MAB GZ002 and MTB. 8-9 These included the two subunits of pyruvate dehydrogenase (E1 and E2) that convert pyruvate to acetyl-CoA in the glycolysis cycle, the acetyl-CoA synthetase that ligates acetate and CoA to produce acetyl-CoA and the citrate synthase catalyzes the first reaction in the TCA cycle (Table S-5). 39 Therefore, lysine acetylation may be a common mechanism for metabolic regulations in mycobacteria. More intriguingly, 52 identical acetylation sites in 37 proteins were determined in MAB GZ002 and MTB H37Rv, 8 including an identical acetylation site (_VYKacNYDPR_) of citrate synthase (Table 2). This suggests that one or several acetyltransferases that recognize the same 18 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 20 of 45
acetylation sites are evolutionary conserved between MAB GZ002 and MTB H37Rv, though these sites might also be the targets for non-enzymatic acetylation. 40 The lysine acetylation profile of mycobacteria can be influenced by numerous factors including the availability of energy sources. To generate comparable data, the MAB GZOO2 used in the present study was grown in a similar way as the previous MTBs studies,
8-9
of which the immunoprecipitation of the lysine acetylated proteins
was also done in a comparable fashion. It should be emphasized that glycerol and OADC supplements were included in the culture media. As shown in Table S-5, the lysine acetylome of MAB GZ002 contained at least 36 % (14 of 39) of the proteins in the glycolytic pathway, 29 % (14 of 48) in pyruvate metabolism pathway, and 48 % (16 of 33) in the TCA cycle. Our data therfore match the wide distribution of of lysine acetylation in central metabolism of MTB strains. 8-9 The key findings that stimulated interest in relationship between lysine acetylation and the anti-tuberculosis drug-resistance were the identifications of resistance-related factors in the lysine acetylome of MTB H37Rv
8
and the involvement of MAB
acetyltransferases in the aminoglycosides resistance. 3, 19 However, comparisons of the lysine acetylomes in MAB GZ002 and the two MTB strains revealed no specific correlations between lysine acetylation and drug-resistant or pathogenic strains (Table 4). Despite that, the abundances of lysine acetylation in drug-resistance- and virulence-related proteins suggests that acetylation may play a role in regulating some of these proteins.
19 ACS Paragon Plus Environment
Page 21 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
Persistent mycobacteria that have limited accesses to oxygen and carbohydrates can utilize the host cholesterol to survive. 41 In order to switch from carbohydrate to fatty acid metabolism, mycobacteria up-regulate isocitrate lyase
42
to promote the
productions of glyoxylate and succinate from the fatty acid-derived acetyl-CoA, and replenish the CO2-generating steps of TCA cycle that are catalyzed by the isocitrate dehydrogenase. 43 Both isocitrate lyase and isocitrate dehydrogenase were identified in the lysine acetylomes of MAB GZ002 (Table S-5), MTB H37Rv and H37Ra.
8-9
We
therefore speculated that the lysine acetylation might also play a role in regulation of the carbon flux between the TCA and glyoxylate cycle. To further support this notion, 45% of the proteins (15 of 33) in the glyoxylate cycle were lysine-acetylated in MAB GZ002 (Table S-5). Another requirement for the mycobacteria to persist in host cell is the ability to survive under hypoxia conditions.
44
The gene products of MAB_3008,
MAB_4254c, tpiA, MAB_2779c, gpmA and MAB_2109 in the glycolysis pathway; and the three subunits of succinate dehydrogenase, MAB_4421, MAB_4422 and MAB_4423 in the TCA pathway were considered as essential factors for the survival in hypoxia that had also been identified in the lysine acetylome of MAB GZ002 (Table S-2). The attenuated MTB H37Ra 9 might have lost the ability to acetylate certain number of virulence and persistence-related proteins. For example, the catalase-peroxidase (katG) that neutralized the reactive oxygen species produced by the macrophage
45
could be found in the lysine acetylomes of MAB GZ002 and MTB H37Rv, 8 but not in the MTB H37Ra (Table 4). 9 The chaperone protein (DnaK), the membrane lipoprotein (LprG) and cytoplasmic 60 kDa chaperonin proteins that mediated the toll-like receptor 20 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 22 of 45
(TLR) signaling in macrophages and/or dendritic cells 46-48 were identified in the lysine acetylomes of MAB GZ002 and MTB H37Rv, 8 but not in MTB H37Ra (Table 4).
9
Our data therefore suggest that certain aspects of the lysine acetylation network are evolutionary conserved in the MAB GZ002 and MTB H37Rv. Comparison of the lysine acetylomes in MAB GZ002, MTB H37Rv 8 and MTB H37Ra
9
revealed 52 sites in 37 proteins were identical in MAB GZ002 and MTB
H37Rv, including 18 acetylation sites in 10 proteins were also conserved in MTB H37Ra (Table 2). Motif KacY was harbored in 8 of 52 identical acetylation sites followed by the motifs KacF and DKac that were found in 4 of 52 identical sites (Table 2). The KacY and KacF motifs were seen in the 18 identical acetylation sites conserved in MAB GZ002, MTB H37Rv 8 and MTB H37Ra
9
without a clear cluster pattern
(Table 2). Despite the discovery of these conserved elements in lysine acetylation of MAB GZ002 and MTB strains, it is still largely unknown whether these identical acetylation sites are the targets of specific acetyltransferases or non-enzymatic acetylations. The preferences for F, H and Y amino acid residues at the +1 position and D at -1 position of the acetyl-lyisne (Figure 4A) may suggest a preference for linear motif binding in MAB GZ002 if the lysine acetylation is carried out by the acetyltransferases. Recognition of H and Y at the +1 position of modified lysine was also observed in the lysine acetylomes of mouse mitochondrial fractions, 32 E. coli
49
and MAB H37Rv. 8 Functional enrichment analysis of the acetyl-lysine motifs revealed enrichments of KacH motif in aerobic respiration process and KacY in various 21 ACS Paragon Plus Environment
Page 23 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
carbohydrate metabolic processes that were consistent with the functions of mitochondria in eukaryotic cells (Figure S-3). Our findings therefore support the hypothesis that the acetylation motifs (KacH and KacY) and the possible related acetyltransferase(s) that recognize these motifs are evolutionally conserved. Additionally, the prefer recognition of amino acids at the ±1 position of the acetyl-lysine suggests that a small subset of acetyltransferases, similar to those identified in E. coli and mitochondria,
32, 49
might be responsible for the lysine
acetylation in MAB GZ002. However, E.coli acetylation can also be catalyzed by non-enzymatic reactions depending on the availability of acetyl phosphate and the accessibility of target lysine in the protein.
40, 50
D at the -1 position was seen more
enriched in an ackA mutant that accumulated acetyl phosphate in E. coli than in the wild type bacteria.
40, 50
Therefore, at least for the D at the -1 position, the specificity of
acetylation motif in target proteins can also be due to non-enzymatic acetylation.
CONCLUSIONS
Here we demonstrate a profile of lysine acetylation in the proteome of rapid-growing strain MAB GZ002. Pathways enriched for this PTM are carbon metabolism, aerobic respiration and TCA cycle. Several drug-resistant, virulence and persistence-related factors were acetylated in MAB GZ002 and MTB strains. An array of identical acetylation sites in MAB GZ002 and MTB strains was also identified. Though the contribution and specificity of acetyltransferases and the non-enzymatic 22 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 24 of 45
acetylation to the acetylation profile in MAB GZ002 is unclear, the involvement of acetylated proteins in MAB pathogenesis and drug-resistance suggests that they are good
targets
for
future
study.
A
better
understanding
of
the
lysine
acetylation-dependent mechanisms in MAB disease progression may lead to potential avenues for more effective treatments.
Figure Caption Figure 1. Profiling lysine acetylome of MAB GZ002. Acetylation covered 5.87 % of total proteins (289 over 4920 proteins) in MAB GZ002. (A) Distribution of trypsin-cleaved peptide length. (B) Distribution of peptide score and mass error of identified acetyl peptides. (C) Number of acetylation sites in lysine acetylated proteins.
Figure 2. Comparison of the lysine acetylomes of MAB GZ002 and the MTB H37Rv adopted from Xie et al. 8
Figure 3. Functional enrichment analysis of lysine acetylome. (A) Enrichment analysis of acetyl protein based on annotations of acetyl-lysine sites for cellular component, biological processes and molecular function. (B) KEGG pathway
23 ACS Paragon Plus Environment
Page 25 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
enrichment analysis for lysine acetylated proteins. (C) InterPro protein domain enrichment analysis for lysine acetylated proteins. Data was presented as –log10 of Fisher exact test p value. A GO term was defined as significantly enriched with acetyl-lysine when –log10 (Fisher’s test p-value) > 1.3 with fold enrichment > 1. All examples shown meet both the p-value and fold-enrichment threshold significance.
Figure 4. Bioinformational analysis of the lysine acetylation sites and functional enrichment analysis of the motif logos. (A) The intensity map showed the relative abundance of amino acids flanking the acetyl-lysine at the acetyl-21-mers (10 amino acids upstream and downstream of the site). The relative abundance was represented as the –log10 (Fisher’s test p-value) of an amino acid in a position from the acetyl-lysine. Red, amino acid was most abundant. Green, amino acid was least abundant. (B) The four most abundant acetylated peptide motif logos. The * symbol represented random amino acid. (C) Heat map representing the InterPro protein domain enrichment analysis for the motif logos. (D) Heat map representing the KEGG pathway enrichment analysis of the motif logos. For each GO term, the –log10 (Fisher’s test p-value) was calculated and the degree of enrichment was estimated according to defined criteria based on Z scores (Zs). Motif logo was significantly enriched when Zs = -1, indicated as red.
24 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 26 of 45
Table 1. Lysine acetylation in Mycobacterium species. Genome
No. of
No. of
No. of
%
size
total
acetylated
acetylation
Proteome
(Mb)
proteins
proteins
sites
acetylated*
GZ002
5.09
4920
289
459
5.87
M. tuberculosis
a
4.41
4018
658
1128
16.38
M. tuberculosis
b
4034
137
226
3.40
Species M. abscessus
Strain
H37Rv H37Ra
a
4.42 8
Data adopted from Xie et al. Data adopted from Liu et al. 9 * % Proteome acetylated = (No. of acetylated proteins / No. of total protein)*100 b
Table 2. Identical lysine acetylation sites in MAB GZ002 and MTB H37Rv. Common lysine acetylated protein with MTB H37Rv 8
Identical lysine acetylatation motif #
Functional category *
DNA gyrase subunit A
_TKacTDLYR_
Information pathways
DNA-directed RNA polymerase subunit beta
_LHHLVDDKacIHAR_
Information pathways
DNA-directed RNA polymerase subunit beta'
_TLKPEKacDGLFCEK_#
Information pathways
RNA polymerase sigma factor SigA
_AVEKacFDYTK_
Information pathways
Elongation factor G
_QADKacYDVPR_
Information pathways
DNA-binding protein HU homolog (Histone-like)
_AELIDVLTQKacLGSDR_
Information pathways
Translation initiation factor IF-3
_IDDHDYETKacK_
Information pathways
50S ribosomal protein L19
_AKacLYYLR_
Information pathways
50S ribosomal protein L5
_SIAQFKacLR_
Information pathways
30S ribosomal protein S16
_ITGDWQKacFK_
Information pathways
Ribonuclease 3
_SEGDLAKacLR_
Information pathways
DNA polymerase III subunit beta
_LLDAEFPKacFR_#
Information pathways
50S ribosomal protein L7/L12
_DLVDGAPKacPLLEK_#
Information pathways
25 ACS Paragon Plus Environment
Page 27 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
_TTLTAAITKacVLHDK_# Elongation factor Tu
Information pathways _VSALKacALEGDA_#
30S ribosomal protein S3
_ADLEKacLTGK_
Information pathways
Ribonucleoside-diphosphate reductase
_SKacYASGEFFDK_
Information pathways
Sulfurtransferase
_DFVDQQQFSKacLLSDK_
Intermediary metabolism and respiration
_DEPHADAEKacYR_ Pup-protein ligase
_IAQLDLAYHDIKacR_
Intermediary metabolism and respiration
_TVLCKacDPFR_ Phosphoenolpyruvate carboxykinase [GTP]
_LPK VFFVNWFR_
Putative oxidoreductase /3-ketosteroid-9-alpha-hydrox ylase reductase subunit
_ELAAKacYPDR_
Adenosylhomocysteinase
_FDNKacYGTR_
Deoxycytidine triphosphate deaminase
_LEGKacSSLGR_
Succinyl-diaminopimelate desuccinylase
_AKacYGWTDVSR_
ATP-dependent Clp protease proteolytic subunit
_ESNPYNKacLFEER_
Imidazoleglycerol-phosphate dehydratase
_DPHHITEAQYKacAVAR_
Citrate synthase
_VYKacNYDPR_
ac
#
_HFFVHYKacDLEPGK_# Inorganic pyrophosphatase _NKacYEVDHETGR_# Transaldolase
_VDTEIDKacR_
Probable ATP-dependent Clp protease ATP-binding subunit
_YIEKacDAALER_
Probable mannose-6-phosphate isomerase ManA
_GGLTPKacHVDVPELLR_
Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration Intermediary metabolism and respiration
26 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Probable phosphoglucomutase PgmA
Intermediary metabolism and respiration
_PSGTEDVYKacIYAESF_ _DKacMAMQR_
Virulence, detoxification, adaptation
Chaperone protein DnaK _NTTIPTKacR_ _AVNIKacPLEDK_# 10 kDa chaperonin
Page 28 of 45
Virulence, detoxification, adaptation
ac
_YGGTEIK YNGEEYLILS AR_# _LAKacLAGGVAVIK_# _AVEKacVTETLLK_# _GIEKacAVEK_ _IGAELVKacEVAK_#
60 kDa chaperonin _KacWGAPTITNDGVSIAK_#
Virulence, detoxification, adaptation
_WGAPTITNDGVSIAKacEIE LEDPYEK_ _NVAAGANPLGLKacR_# _NVVLEKacK_# _KacHSEIMGTINQQR_ Antigen 84
_PLTPADVHNVAFSKacPPI GK_#
Cell wall and cell processes
Probable acyl-CoA thiolase FadA (MAB_0850)
_FQKacDLNIPDEK_
Lipid metabolism
FHA domain-containing protein
_EPVDSAVLANGDEVQIG KacFR_#
Uncharacterized
, Identical acetylation site also in lysine acetylome of MTB H37Ra 9 *, Classifications were based on the Tuberculist database (http://tuberculist.epfl.ch/) #
Table 3. Percentage of lysine acetylation of MAB GZ002 in 3 gene ontology categories. No. of total proteins
No. of acetylated
mapped per
proteins per
category
category
Cell component
880
84
9.5
Biological processes
1836
175
9.5
% Acetylation per category
27 ACS Paragon Plus Environment
Page 29 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
2808
Molecular function
233
8.3
Table 4. Comparison of the antibiotic resistance, virulence and persistence-related factors identified in this and previous studies. Lysine acetylation Protein description
Antibiotic resistance
MAB GZ002 isolate*
MTB H37Ra**
MTB H37Rv***
3-oxoacyl-[acyl-carrier-protein] synthase 1
YES
-
-
Enoyl-[acyl-carrier-protein] reductase [NADH]
YES
-
YES
Catalase-peroxidase
YES
-
YES
3-oxoacyl-[acyl-carrier-protein] reductase
YES
YES
YES
DNA gyrase subunit A
YES
-
YES
DNA gyrase subunit B
YES
-
YES
DNA-directed RNA polymerase subunit beta
YES
YES
YES
DNA-directed RNA polymerase subunit beta'
YES
YES
YES
Enhanced intracellular survival protein
YES
-
-
Polyribonucleotide nucleotidyltransferase
YES
-
YES
Putative aminoglycoside phosphotransferase
YES
-
YES
Superoxide dismutase
YES
-
YES
Protein translocase subunit
YES
-
YES
Iron-dependent repressor
YES
-
YES
Glutamine synthetase
YES
YES
YES
Sulfurtransferase
YES
YES
YES
RNA polymerase sigma factor sigA
YES
-
YES
Virulence
28 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Persistence
Page 30 of 45
Enhanced intracellular survival protein
YES
-
-
Putative cytochrome P450, steroid C27-monooxygenase
YES
-
YES
Nucleoside diphosphate kinase
YES
-
-
Possible Mg2+ transport P-type ATPase C
YES
-
-
Isocitrate lyase
YES
YES
YES
O-succinylbenzoate-CoA ligase
YES
-
YES
Long chain fatty acid CoA FadD26
YES
YES
YES
Lipoprotein lprG
YES
-
YES
Catalase-peroxidase
YES
-
YES
Pup-protein ligase
YES
-
YES
Iron-dependent repressor IdeR
YES
-
YES
Phosphoenolpyruvate carboxykinase [GTP]
YES
-
YES
60 kDa chaperonin
YES
-
YES
Superoxide dismutase [Cu-Zn]
YES
-
-
Long chain fatty acid CoA FadD26
YES
YES
YES
Chaperone protein DnaK
YES
YES
YES
Isocitrate lyase
YES
YES
YES
Catalase-peroxidase
YES
-
YES
3-ketosteroid-9-alpha-hydroxyla se reductase subunit
YES
-
YES
-
YES
ABC transporter ATP-binding YES protein *, MAB GZ002 acetylome dataset from this study **, MTB H37Ra acetylome dataset from Liu et al. 2014. 9 ***, MTB H37Rv acetylome dataset from Xie et al. 2015. 8
ASSOCIATED CONTENT
29 ACS Paragon Plus Environment
Page 31 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
Supporting Information Supporting Method; Figure S-1. Nucleotide sequence alignment of partial sequences of the (A) 16S rRNA, (B) gryB and (C) rpoB genes; Figure S-2. Functional enrichment analysis of the motif logos based on annotations of acetyl-lysine sites for molecular function; Figure S-3. Functional enrichment analysis of the motif logos based on annotations of acetyl-lysine sites for biological process; Table S-1. Primers used for PCR; Table S-2. Acetylated protein annotation summary; Table S-3. Acetyl protein functional enrichment analysis; Table S-4. Acetyl-lysine motif functional enrichment analysis; Table S-5. Lysine acetylated proteins in metabolic processes; MS-MS spectra of lysine acetylated peptides in MAB GZ002. All the supporting methods, figures and tables are available free of charge via the Internet at http://pubs.acs.org.
Raw
MS-MS
spectra
are
available
publicly
via
the
ProteomeXchange Datasets with dataset identifier number PXD004038.
AUTHOR INFORMATION
Corresponding Author *Tianyu Zhang: Room A132, 190 Kaiyuan Road, Science Park, Guangzhou, Guangdong,
China
510530;
Tel:
(+86)-20-32015270;
Email:
[email protected] Author Contributions 30 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
‡
Page 32 of 45
Jintao Guo and Changwei Wang contributed equally to this work.
ACKNOWLEDGMENT This work was supported by One Hundred Talents Program of the Chinese Academy of Sciences (Category A, to T.Z.), by the Open Project Grant (2014SKLRD-O06) and the Key Project Grant (SKLRD2016ZJ003) from the State Key Lab of Respiratory Disease, Guangzhou Institute of Respiratory Disease, First Affiliated Hospital of Guangzhou Medical University, by the PhD Start-up Fund of Natural Science Foundation of Guangdong Province, China (2016A030310123 to J.G.), by the Chinese Academy of Sciences-Commonwealth Scientific and Industrial Research Organization Joint Grant (154144KYSB20150045) and partially supported by Guangzhou Municipal Industry and Research Collaborative Innovation Program (201508020248) and Guangzhou Municipal Clinical Medical Center Program (155700012).
ABBREVIATIONS USED GNATs,
general
control
of
amino
acid
synthesis
protein
5
-related
N-acetyltransferases; GO, gene ontology; INH, isoniazid; InhA, enoyl-acyl carrier protein
reductase;
KatG,
catalase-peroxidase;
LC-MS/MS,
liquid
chromatography-tandem mass spectrometry; LprG, membrane lipoprotein; MAB, Mycobacterium abscessus; MTB, Mycobacterium tuberculosis; NTM, nontuberculous 31 ACS Paragon Plus Environment
Page 33 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
mycobacteria;
OADC,
Oleic
acid-bovine
albumin-dextrose-catalase;
PTMs,
post-translational modifications; rpoB, DNA-directed RNA polymerase subunit β; TB, tuberculosis; TCA, tricarboxylic acid.
REFERENCES
(1). Daley, C. L.; Griffith, D. E. Pulmonary non-tuberculous mycobacterial infections. Int. J. Tuberc. Lung Dis. 2010, 14 (6), 665-671. (2). Benwill, J. L.; Wallace, R. J. Mycobacterium abscessus: challenges in diagnosis and treatment. Curr. Opin. Infect. Dis. 2014, 27 (6), 506-510. (3). Nessar, R.; Cambau, E.; Reyrat, J. M.; Murray, A.; Gicquel, B. Mycobacterium abscessus: a new antibiotic nightmare. J. Antimicrob. Chemother. 2012, 67 (4), 810-818. (4). Griffith, D. E.; Aksamit, T.; Brown-Elliott, B. A.; Catanzaro, A.; Daley, C.; Gordin, F.; Holland, S. M.; Horsburgh, R.; Huitt, G.; Iademarco, MF.; Iseman, M.; Olivier, K.; Ruoss, S.; von Reyn, C. F.; Wallace, R. J. Jr.; Winthrop, K. Subcommittee, ATSMD. An official ATS/IDSA statement: Diagnosis, treatment, and prevention of nontuberculous mycobacterial diseases. Am. J. Respir. Crit. Care Med. 2007, 175 (4), 367-416. (5). Adjemian, J.; Olivier, K. N.; Seitz, A. E.; Holland, S. M.; Prevots, D. R. 32 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 34 of 45
Prevalence of nontuberculous mycobacterial lung disease in US medicare beneficiaries. Am. J. Respir. Crit. Care Med. 2012, 185 (8), 881-886. (6). Shao, Y.; Chen, C.; Song, H.; Li, G.; Liu, Q.; Li, Y.; Zhu, L.; Martinez, L.; Lu, W. The epidemiology and geographic distribution of nontuberculous mycobacteria clinical isolates from sputum samples in the eastern region of China. PLoS Negl. Trop. Dis. 2015, 9 (3), e0003623. (7). Jain, S.; Sankar, M. M.; Sharma, N.; Singh, S.; Chugh, T. D. High prevalence of non-tuberculous mycobacterial disease among non-HIV infected individuals in a TB endemic country - experience from a tertiary center in Delhi, India. Pathog. Glob. Health. 2014, 108 (2), 118-122. (8). Xie, L.; Wang, X.; Zeng, J.; Zhou, M.; Duan, X.; Li, Q.; Zhang, Z.; Luo, H.; Pang, L.; Li, W.; Liao, G.; Yu, X.; Li, Y.; Huang, H.; Xie, J. Proteome-wide lysine acetylation profiling of the human pathogen Mycobacterium tuberculosis. Int. J. Biochem. Cell Biol. 2015, 59, 193-202. (9). Liu, F. Y.; Yang, M. K.; Wang, X. D.; Yang, S. S.; Gu, J.; Zhou, J.; Zhang, X. E.; Deng, J. Y.; Ge, F. Acetylome analysis reveals diverse functions of lysine acetylation in Mycobacterium tuberculosis. Mol. Cell. Proteomics. 2014, 13 (12), 3352-3366. (10). van Els, C. A. C. M.; Corbiere, V.; Smits, K.; van Gaans-van den Brink, J. A. M.; Poelen, M. C. M.; Mascart, F.; Meiring, H. D.; Locht, C. Toward understanding the essence of post-translational modifications for the Mycobacterium tuberculosis immunoproteome. Front. Immunol. 2014, 5, 361. (11). Glozak, M. A.; Sengupta, N.; Zhang, X. H.; Seto, E. Acetylation and 33 ACS Paragon Plus Environment
Page 35 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
deacetylation of non-histone proteins. Gene. 2005, 363, 15-23. (12). Venne, A. S.; Kollipara, L.; Zahedi, R. P. The next level of complexity: crosstalk of posttranslational modifications. Proteomics. 2014, 14 (4-5), 513-524. (13). Wang, Q. J.; Zhang, Y. K.; Yang, C.; Xiong, H.; Lin, Y.; Yao, J.; Li, H.; Xie, L.; Zhao, W.; Yao, Y. F.; Ning, Z. B.; Zeng, R.; Xiong, Y.; Guan, K. L.; Zhao, S. M.; Zhao, G. P. Acetylation of metabolic enzymes coordinates carbon source utilization and metabolic flux. Science. 2010, 327 (5968), 1004-1007. (14). Shaw, B. F.; Arthanari, H.; Narovlyansky, M.; Durazo, A.; Frueh, D. P.; Pollastri, M. P.; Lee, A.; Bilgicer, B.; Gygi, S. P.; Wagner, G.; Whitesides, G. M. Neutralizing positive charges at the surface of a protein lowers its rate of amide hydrogen exchange without altering its structure or increasing its thermostability. J. Am. Chem. Soc. 2010, 132 (49), 17411-17425. (15). van Noort, V.; Seebacher, J.; Bader, S.; Mohammed, S.; Vonkova, I.; Betts, M. J.; Kuhner, S.; Kumar, R.; Maier, T.; O'Flaherty, M.; Rybin, V.; Schmeisky, A.; Yus, E.; Stulke, J.; Serrano, L.; Russell, R. B.; Heck, A. J. R.; Bork, P.; Gavin, A. C. Cross-talk between phosphorylation and lysine acetylation in a genome-reduced bacterium. Mol. Syst. Biol. 2012, 8, 571. (16). Kouzarides, T. Chromatin modifications and their function. Cell. 2007, 128 (4), 693-705. (17). Smith, G. T.; Sweredoski, M. J.; Hess, S. O-linked glycosylation sites profiling in Mycobacterium tuberculosis culture filtrate proteins. J. Proteomics. 2014, 97, 296-306. 34 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 36 of 45
(18). Fortuin, S.; Tomazella, G. G.; Nagaraj, N.; Sampson, S. L.; van Pittius, N. C. G.; Soares, N. C.; Wiker, H. G.; de Souza, G. A.; Warren, R. M. Phosphoproteomics analysis of a clinical Mycobacterium tuberculosis Beijing isolate: expanding the mycobacterial phosphoproteome catalog. Front. Microbiol. 2015, 6, 6. (19). Tsai, S-H.; Shen, G-H.; Lin, C-H.; Liau, J-R.; Lai, H-C.; Hu, S-T. Mab_3168c, a putative
acetyltransferase,
enhances
adherence,
intracellular
survival
and
antimicrobial resistance of Mycobacterium abscessus. PLoS ONE. 2013, 8 (6), e67563. (20). Ripoll, F.; Pasek, S.; Schenowitz, C.; Dossat, C.; Barbe, V.; Rottman, M.; Macheras, E.; Heym, B.; Herrmann, J. L.; Daffe, M.; Brosch, R.; Risler, J. L.; Gaillard, J. L. Non mycobacterial virulence genes in the genome of the emerging pathogen Mycobacterium abscessus. PLoS ONE. 2009, 4 (6), e5660. (21). Leao, S. C.; Tortoli, E.; Viana-Niero, C.; Mizuka Ueki, S. Y.; Batista Lima, K. V.; Lopes, M. L.; Yubero, J.; Carmen Menendez, M.; Jesus Garcia, M. Characterization of mycobacteria from a major Brazilian outbreak suggests that revision of the taxonomic status of members of the Mycobacterium chelonae-M. abscessus group is needed. J. Clin. Microbiol. 2009, 47 (9), 2691-2698. (22). Shallom, S. J.; Gardina, P. J.; Myers, T. G.; Sebastian, Y.; Conville, P.; Calhoun, L. B.; Tettelin, H.; Olivier, K. N.; Uzel, G.; Sampaio, E. P.; Holland, S. M.; Zelazny, A. M. New rapid scheme for distinguishing the subspecies of the Mycobacterium abscessus group and identifying Mycobacterium massiliense isolates with inducible clarithromycin resistance. J. Clin. Microbiol. 2013, 51 (9), 2943-2949. 35 ACS Paragon Plus Environment
Page 37 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
(23). Xie, L. X.; Liu, W.; Li, Q. M.; Chen, S. D.; Xu, M. M.; Huang, Q. Q.; Zeng, J.; Zhou, M. L.; Xie, J. P. First succinyl-proteome profiling of extensively drug-resistant Mycobacterium tuberculosis revealed involvement of succinylation in cellular physiology. J. Proteome Res. 2015, 14 (1), 107-119. (24). Prasad, T. S. K.; Verma, R.; Kumar, S.; Nirujogi, R. S.; Sathe, G. J.; Madugundu, A. K.; Sharma, J.; Puttamallesh, V. N.; Ganjiwale, A.; Myneedu, V. P.; Chatterjee, A.; Pandey, A.; Harsha, H.; Narayana, J. Proteomic analysis of purified protein derivative of Mycobacterium tuberculosis. Clin. Proteomics. 2013, 10 (1), 8. (25). Zhang, K.; Zheng, S.; Yang, J. S.; Chen, Y.; Cheng, Z. Comprehensive profiling of protein lysine acetylation in Escherichia coli. J. Proteome Res. 2013, 12 (2), 844-851. (26). Quevillon, E.; Silventoinen, V.; Pillai, S.; Harte, N.; Mulder, N.; Apweiler, R.; Lopez, R. InterProScan: protein domains identifier. Nucleic Acids Res. 2005, 33, W116-W120. (27). Schwartz, D.; Gygi, S. P. An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets. Nat. Biotechnol. 2005, 23 (11), 1391-1398. (28). Burkhart, J. M.; Schumbrutzki, C.; Wortelkamp, S.; Sickmann, A.; Zahedi, R. P. Systematic and quantitative comparison of digest efficiency and specificity reveals the impact of trypsin quality on MS-based proteomics. J. Proteomics. 2012, 75 (4), 1454-1462. (29). Swaney, D. L.; Wenger, C. D.; Coon, J. J. Value of using multiple proteases for 36 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 38 of 45
large-scale mass spectrometry-based proteomics. J. Proteome Res. 2010, 9 (3), 1323-1329. (30). Zhang, Y. J. J.; Ioerger, T. R.; Huttenhower, C.; Long, J. E.; Sassetti, C. M.; Sacchettini, J. C.; Rubin, E. J. Global assessment of genomic regions required for growth in Mycobacterium tuberculosis. PLoS Pathog. 2012, 8 (9), e1002946. (31). Griffin, J. E.; Gawronski, J. D.; DeJesus, M. A.; Ioerger, T. R.; Akerley, B. J.; Sassetti, C. M. High-resolution phenotypic profiling defines genes essential for mycobacterial growth and cholesterol catabolism. PLoS Pathog. 2011, 7 (9), e1002251. (32). Kim, S. C.; Sprung, R.; Chen, Y.; Xu, Y. D.; Ball, H.; Pei, J. M.; Cheng, T. L.; Kho, Y.; Xiao, H.; Xiao, L.; Grishin, N. V.; White, M.; Yang, X. J.; Zhao, Y. M. Substrate and functional diversity of lysine acetylation revealed by a proteomics survey. Mol. Cell. 2006, 23 (4), 607-618. (33). Sandgren, A.; Strong, M.; Muthukrishnan, P.; Weiner, B. K.; Church, G. M.; Murray, M. B. Tuberculosis drug resistance mutation database. PLoS Med. 2009, 6 (2), 132-136. (34). Brown-Elliott, B. A.; Nash, K. A.; Wallace, R. J. Antimicrobial susceptibility testing, drug resistance mechanisms, and therapy of infections with nontuberculous mycobacteria. Clin. Microbiol. Rev. 2012, 25 (3), 545-582. (35). Forrellad, M. A.; Klepp, L. I.; Gioffre, A.; Garcia, J. S. Y.; Morbidoni, H. R.; Santangelo, M. D.; Cataldi, A. A.; Bigi, F. Virulence factors of the Mycobacterium tuberculosis complex. Virulence. 2013, 4 (1), 3-66. 37 ACS Paragon Plus Environment
Page 39 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
(36). Wang, X. Z.; Wang, H. H.; Xie, J. P. Genes and regulatory networks involved in persistence of Mycobacterium tuberculosis. Sci. China-Life Sci. 2011, 54 (4), 300-310. (37). Perna, N. T.; Plunkett, G.; Burland, V.; Mau, B.; Glasner, J. D.; Rose, D. J.; Mayhew, G. F.; Evans, P. S.; Gregor, J.; Kirkpatrick, H. A.; Posfai, G.; Hackett, J.; Klink, S.; Boutin, A.; Shao, Y.; Miller, L.; Grotbeck, E. J.; Davis, N. W.; Limk, A.; Dimalanta, E. T.; Potamousis, K. D.; Apodaca, J.; Anantharaman, T. S.; Lin, J. Y.; Yen, G.; Schwartz, D. C.; Welch, R. A.; Blattner, F. R.; Genome sequence of enterohaemorrhagic Escherichia coli O157 : H7. Nature. 2001, 409 (6819), 529-533. (38). Zhao, S. M.; Xu, W.; Jiang, W. Q.; Yu, W.; Lin, Y.; Zhang, T. F.; Yao, J.; Zhou, L.; Zeng, Y. X.; Li, H.; Li, Y. X.; Shi, J.; An, W. L.; Hancock, S. M.; He, F. C.; Qin, L. X.; Chin, J.; Yang, P. Y.; Chen, X.; Lei, Q. Y.; Xiong, Y.; Guan, K. L. Regulation of cellular metabolism by protein lysine acetylation. Science. 2010, 327 (5968), 1000-1004. (39). Remington, S. J. Structure and mechanism of citrate synthase. Curr. Top. Cell. Regul. 1992, 33, 209-229. (40). Weinert, B. T.; Iesmantavicius, V.; Wagner, S. A.; Scholz, C.; Gummesson, B.; Beli, P.; Nystrom, T.; Choudhary, C. Acetyl-phosphate is a critical determinant of lysine acetylation in E. coli. Mol. Cell. 2013, 51 (2), 265-272. (41). Pandey, A. K.; Sassetti, C. M. Mycobacterial persistance requires the utilization of host cholesterol. Proc. Natl. Acad. Sci. U. S. A. 2008, 105 (11), 4376-4380. (42). Munoz-Elias, E. J.; McKinney, J. D. Mycobacterium tuberculosis isocitrate 38 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 40 of 45
lyases 1 and 2 are jointly required for in vivo growth and virulence. Nat. Med. 2005, 11 (6), 638-644. (43). Ehrt, S.; Schnappinger, D. Mycobacterium tuberculosis virulence: lipids inside and out. Nat. Med. 2007, 13 (3), 284-285. (44). Fang, X.; Wallqvist, A.; Reifman, J. Modeling phenotypic metabolic adaptations of Mycobacterium tuberculosis H37Rv under hypoxia. PLoS Comput. Biol. 2012, 8 (9), e1002688. (45). Ng, V. H.; Cox, J. S.; Sousa, A. O.; MacMicking, J. D.; McKinney, J. D. Role of KatG catalase-peroxidase in mycobacterial pathogenesis: countering the phagocyte oxidative burst. Mol. Microbiol. 2004, 52 (5), 1291-1302. (46). Bulut, Y.; Michelsen, K. S.; Hayrapetian, L.; Naiki, Y.; Spallek, R.; Singh, M.; Arditi, M. Mycobacterium tuberculosis heat shock proteins use diverse toll-like receptor pathways to activate pro-inflammatory signals. J. Biol. Chem. 2005, 280 (22), 20961-20967. (47). Bartow, R. A.; McMurray, D. N. Cellular and humoral immune responses to mycobacterial stress proteins in experimental pulmonary tuberculosis. Tuber. Lung Dis. 1997, 78 (3-4), 185-193. (48). Drage, M. G.; Tsai, H-C.; Pecora, N. D.; Cheng, T-Y.; Arida, A. R.; Shukla, S.; Rojas, R. E.; Seshadri, C.; Moody, D. B.; Boom, W. H.; Sacchettini, J. C.; Harding, C. V. Mycobacterium tuberculosis lipoprotein LprG (Rv1411c) binds triacylated glycolipid agonists of Toll-like receptor 2. Nat. Struct. Mol. Biol. 2010, 17 (9), 1088-1095. 39 ACS Paragon Plus Environment
Page 41 of 45
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
(49). Zhang, J. M.; Sprung, R.; Pei, J. M.; Tan, X. H.; Kim, S.; Zhu, H.; Liu, C. F.; Grishin, N. V.; Zhao, Y. M. Lysine acetylation is a highly abundant and evolutionarily conserved modification in Escherichia coli. Mol. Cell. Proteomics. 2009, 8 (2), 215-225. (50). Kuhn, M. L.; Zemaitaitis, B.; Hu, L. I.; Sahu, A.; Sorensen, D.; Minasov, G.; Lima, B. P.; Scholle, M.; Mrksich, M.; Anderson, W. F.; Gibson, B. W.; Schilling, B.; Wolfe, A. J. Structural, kinetic and proteomic characterization of acetyl phosphate-dependent bacterial protein acetylation. PLoS ONE. 2014, 9 (4), e94816.
40 ACS Paragon Plus Environment
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
for TOC only Carbon metabolism
M. abscessus Protein extraction
TCA cycle Trypsin diges,on An,-acetyl-lysine precipita,on LC-MS/MS
Protein Lysine Acetylation
Aerobic respiration Antibiotic resistance Virulence
Bioinforma,cs analysis
Persistence
ACS Paragon Plus Environment
Page 42 of 45
Page 43 of 45
Figure 1 A
60
B
83%
400
81%
Peptide Score
No. of peptides
50 40 30 20
300
200
100
10 0
5
10
15
20
25
30
35
40
45
C
-6
-4
-2
0
2
4
200 150 100 50 10 5 0 1
2
3
4
5
6
7
8
6
Mass error (ppm)
Peptide length
No. of proteins
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
9 10 10+
No. of acetylation sites
ACS Paragon Plus Environment
8
10
Journal of Proteome Research
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 2 Lysine acetylated proteins
M. tuberculosis H37Rv
497
161
128 M. abscessus GZ002
ACS Paragon Plus Environment
Page 44 of 45
Page 45 of 45
A
Cellular component
Figure 3 Cytoplasmic part
[5.41]
Non-membrane-bounded organelle
[4.79] [4.32] [4.42] [2.17] [1.67] [1.97] [2.59]
Ribonucleoprotein complex Ribosome Ribosomal subunit Large ribosomal subunit Hexose catabolic process
Biological processes
DNA replication Protein folding Monosaccharide metabolic process Hexose metabolic process Glucose metabolic process NADP metabolic process Aerobic respiration Tricarboxylic acid cycle Reactive oxygen species metabolic process Purine nucleotide binding
Molecular function
Ligase activity RNA binding Ligase activity (carbon-nitrogen bonds) Oxidoreductase activity (CH-OH donors) rRNA binding Oxidoreductase activity (NAD or NADP acceptor) Structural molecule activity Structural constituent of ribosome Translation factor activity, nucleic acid binding
[1.76] [3.46] [3.60] [3.28] [1.55] [2.94] [3.93] [2.66] [3.39] [3.77] [3.42] [2.12] [3.19] [2.44] [3.02] [4.91] [5.01] [2.74]
0 2 4 6 8 Fold of enrichment and the [-log10 (p-value)]
Pathway enrichment
B
Pyrimidine metabolism
[2.07]
Propanoate metabolism
[2.76]
Pyruvate metabolism
[2.69]
Glyoxylate and dicarboxylate metabolism
[2.10]
Carbon metabolism
[8.89]
Pentose phosphate pathway
[1.57]
Citrate cycle (TCA cycle)
[3.63]
Carbon fixation pathways in prokaryotes
[3.95]
RNA degradation
[1.53] [1.53]
PPAR signaling pathway
0 2 4 6 8 Fold of enrichment and the [-log10 (p-value)]
C
Domain enrichment
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Journal of Proteome Research
ClpP/crotonase-like domain
[1.43]
Rossmann-like alpha/beta/alpha sandwich fold
[1.95]
Pre-ATP-grasp domain
[1.39]
Ribosomal protein S5 domain 2-type fold
[1.95]
ATP-grasp fold, subdomain 2
[2.07]
Aldehyde dehydrogenase, cysteine active site
[1.61]
ATP-grasp fold, subdomain 1
[2.21]
ATP-grasp fold
[2.21]
0 2 4 6 8 Fold of enrichment and the [-log10 (p-value)]
ACS Paragon Plus Environment
Journal of Proteome Research
Figure 4 B
A
Motif Logo 4 Motif Logo 4
Motif Logo 2
Motif Logo 3 Motif Logo 3
D
Motif Logo 2
C
Motif Logo 1
-10 -9 -8 -7 -6 -5 -4 -3 -2 -1 +1+2+3 +4+5 +6 +7+8+9+10 A C D E F G H I K L M N P Q R S T V W Y
Motif Logo 1
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 46 of 45
Motif
1
**********KY*********
16
3.98
2
**********KF*********
12
3.60
3
**********KH*********
8
3.50
4
*********DK**********
6
2.28
Z-score (log10 [Fisher p-value]) 1
0
-1
Z-score (log10 [Fisher p-value]) 1
0
Motif Fold score increase
Logo
-1
ACS Paragon Plus Environment