RNA-binding proteomics reveals MATR3 ... - ACS Publications

Dec 5, 2018 - However, the interacting RBPs and biological functions associated with SNHG1 in neuroblastoma remains unknown. In this study, we identif...
2 downloads 0 Views 8MB Size
Subscriber access provided by Stockholm University Library

Article

RNA-binding proteomics reveals MATR3 interacting with lncRNA SNHG1 to enhance neuroblastoma progression Tz-Wen Yang, Divya Sahu, Yi-Wen Chang, Chia-Lang Hsu, Chiao-Hui Hsieh, Hsuan-Cheng Huang, and Hsueh-Fen Juan J. Proteome Res., Just Accepted Manuscript • DOI: 10.1021/acs.jproteome.8b00693 • Publication Date (Web): 05 Dec 2018 Downloaded from http://pubs.acs.org on December 6, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

RNA-binding proteomics reveals MATR3 interacting with lncRNA SNHG1 to enhance neuroblastoma progression Tz-Wen Yang1#, Divya Sahu2#, Yi-Wen Chang3#, Chia-Lang Hsu3,4, Chiao-Hui Hsieh1, HsuanCheng Huang2*, Hsueh-Fen Juan1,3* 1

Institute of Molecular and Cellular Biology, National Taiwan University, Taipei 106, Taiwan

2

Institute of Biomedical Informatics, National Yang-Ming University, Taipei 112, Taiwan

3

Department of Life Science, National Taiwan University, Taipei 106, Taiwan

4

Department of Medical Research, National Taiwan University Hospital, Taipei 100, Taiwan

#These authors contributed equally to this work

Correspondence to: Hsuan-Cheng Huang, Ph.D. Institute of Biomedical Informatics, National Yang-Ming University, No. 155, Sec. 2, Linong St., 112 Taipei, Taiwan. Tel: +886-2-28267357; E-mail: [email protected]

Hsueh-Fen Juan, Ph.D. Department of Life Science, Institute of Molecular and Cellular Biology, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., 106 Taipei, Taiwan. Tel: +886-2-3366-4536; Fax: +886-2-23673374; E-mail: [email protected]

1 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Abstract The interaction of long noncoding RNAs (lncRNAs) with one or more RNA-binding proteins (RBPs) are important to a plethora of cellular and physiological processes. The lncRNA SNHG1 was reported to be aberrantly expressed and associated with poor patient prognosis in several cancers, including neuroblastoma. However, the interacting RBPs and biological functions associated with SNHG1 in neuroblastoma remains unknown. In this study, we identified 283, 31, and 164 SNHG1-interacting proteins in SK-N-BE(2)C, SK-N-DZ and SK-N-AS neuroblastoma cells, respectively, using a RNA-protein pull-down assay coupled with liquid chromatographytandem mass spectrometry (LC-MS/MS). Twenty-four SNHG1-interacting RBPs were identified in common from these three neuroblastoma cell lines. RBPs MATR3, YBX1 and HNRNPL have the binding sites for SNHG1 predicted by DeepBind motif analysis. Furthermore, the direct binding of MATR3 with SNHG1 was validated by western blot and confirmed by RNA immunoprecipitation assay (RIP). Co-expression analysis revealed that the expression of SNHG1 is positively correlated with MATR3 (P = 3.402E-13). The high expression of MATR3 is associated with poor event-free survival (P = 0.00711) and overall survival (P = 0.00064). Biological functions such as ribonucleoprotein complex biogenesis, RNA processing, and RNA splicing are significantly enriched and in common between SNHG1 and MATR3. In conclusion, we identified MATR3 as binding to SNHG1 and the interaction might be involved in splicing events that enhance neuroblastoma progression. Keywords RNA-binding proteomics, Neuroblastoma, lncRNA-SNHG1, RNA-protein pull down assay, LCMS/MS, RNA immunoprecipitation, MATR3

2 ACS Paragon Plus Environment

Page 2 of 36

Page 3 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

1. Introduction Neuroblastoma is the most common childhood malignant cancer of the developing sympathetic nervous system (1, 2). It accounts for approximately 6% of all childhood cancers (3) and more than 90% of the cases are diagnosed by five year of age (4). The tumor displays wide range of enigmatic behavior. It regresses spontaneously in infants but undergo persistent proliferation in children older than 1 year of age (1, 2). Amplification of v-myc avian myelocytomatosis viral oncogene neuroblastoma derived homolog (MYCN) gene serves as potent oncogenic driver of neuroblastoma (5). It is located in short arm of chromosome 2 (2p24) (6, 7), contributes to approximately 25% of the neuroblastoma cases, and associated with poor patient outcome (4, 8). Risk factors including patient age at diagnosis, MYCN amplification status, tumor stage, chromosomal aberrations, and tumor histology are used in clinical practice for evaluation of biological behavior of the tumor, risk assessment of the disease and determination of appropriate treatments (9-11). Yet, five-year survival chance for neuroblastoma patient after the disease relapse remains disappointingly low (12). Long noncoding RNAs (lncRNAs) are pivotal regulatory RNAs of the mammalian noncoding transcriptome (13, 14). They are predominantly characterized by having length greater than 200 nucleotides, transcribed by RNA polymerase II, polyadenylated and typically located in either nucleus or cytoplasm (15, 16). LncRNAs are involved in a plethora of cellular and physiological processes. They are associated in posttranscriptional regulation, splicing of the multiple exons into mature transcripts, differentiation, chromatin modification, dosage compensation and proteins synthesis (17-21). Most of these biological functions require interaction with one or more protein partners known as RNA-binding proteins (RBPs) (22). They bind with target RNA in sequence specific manner through their unique RNA binding domain (23). 3 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

LncRNAs are significantly dysregulated in various types of cancers, including neuroblastoma (24-30). For instance, the widely studied lncRNA SNHG1 was significantly up-regulated in nonsmall cell lung cancer, both in cell lines and tissues (31). It promotes tumor cell proliferation via interaction with miR-101-3p regulating expression of SOX9 activating Wnt/β-catenin signaling pathway (31). SNHG1 was significantly up-regulated in intervertebral disk degeneration tissues leading to nucleus pulposus cell proliferation via inhibiting miR-326 expression and promoting CCND1 expression (32). SNHG1 was significantly up-regulated in neuroblastoma and associated with poor patient survival (33). However, biological functions of lncRNA SNHG1 in neuroblastoma remains unknown. Herein, using a RNA-protein pull down assay coupled with LCMS/MS, we identified 283, 31 and 164 SNHG1-interacting proteins in three neuroblastoma cell lines, SK-N-BE(2)C, SK-N-DZ and SK-N-AS, respectively. We further found the RBP Matrin 3 (MATR3) to be associated with SNHG1, confirmed by western blot and RNA immunoprecipitation (RIP). The subsequent gene set enrichment analysis of SNHG1 and MATR3 revealed ribonucleoprotein complex biogenesis, RNA processing, and RNA splicing to be significantly enriched in common as potential prospective biological functions.

2. Experimental Section 2.1. Cell lines and cell culture Human neuroblastoma cell lines SK-N-DZ and SK-N-BE(2)C were obtained from American Type Tissue Collection (ATCC, USA). SK-N-AS cell lines were kindly provided by Dr. Yung-Feng Liao (Institute of Cellular and Organismic Biology, Academia Sinica, Taiwan). All neuroblastoma cell lines were cultured in Dulbecco’s Modified Eagle’s Medium (DMEM) (Gibco, USA)

4 ACS Paragon Plus Environment

Page 4 of 36

Page 5 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

supplemented with 10% fetal bovine serum (FBS) (Gibco, USA), and incubated at 37oC in an atmosphere with 5% CO2. 2.2. Construction and in vitro transcription assay The SP6 promoter region was first introduced onto 5’-end of SNHG1 by primers as follows: SNHG1-forward (cloning)-XhoI: 5’-CCG-CTC-GAG-ATT-TAG-GTG-ACA-CTA-TAG-AAGTTC-TCA-TTT-TTC-TAC-TGC-TCG-3’, SNHG1-reverse (cloning)-NotI: 5’-ATA-GTT-TAGCGG-CCG-CTT-TTT-TTT-TTT-TTT-TTA-TGT-AAT-CAA-TCA-TTT-TAT-3’.

Next,

the

amplified PCR product was further constructed into the pBlueScript SK (+) vector at XhoI and NotI sites. After resultant SNHG1 plasmid was linearized by NotI, in vitro transcription was then implemented using MEGAscript® SP6 Transcription Kit (Thermo Scientific, USA) following the manufacturer’s protocol. RNase inhibitors provided by MEGAscript® SP6 Transcription Kit were used through all procedures to avoid RNA degradation. Finally, DNA template was removed by TURBO DNaseI and synthetic SNHG1 was precipitated with LiCl and chilled on -20°C for 1 hour. RNA pellet was dissolved with RNase-free water and the concentration was determined by NanoDrop ND-1000 (NanoDrop Technologies, USA). 2.3. RNA-protein pull down assay Pull-down assay was carried out with Pierce Magnetic RNA-Protein Pull-Down Kit (Thermo Fisher Scientific, USA). First, the 3’-end of in vitro transcripted SNHG1 was biotinylated using Pierce RNA 3’ End Desthiobiotinylation Kit (Thermo Fisher Scientific, USA) following manufacturer’s protocol and then incubated with Streptavidin magnetic beads at room temperature for 30 minutes. Meanwhile, cell lysate was harvested by Pierce IP Lysis Buffer (Pierce, USA) with protease inhibitor cocktail (Bioshop, Canada), tyrosine phosphatase inhibitor cocktail and serine/threonine phosphatase inhibitor cocktail (Bionovas, USA) at 4°C. Next, the biotin-labeled 5 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

SNHG1-beads complex were further mixed well with cell lysate at 4°C with rotation for 60 minutes. After beads collection and washing, eluted protein concentration was measured with Pierce BCA Protein Assay Kit (Thermo Fisher Scientific, USA). The precipitated SNHG1interacting proteins were then analyzed by LC-MS/MS or western blot. Streptavidin magnetic beads were served as negative controls. 2.4. Sample preparation for proteome Whole SNHG1-interacting proteins were dissolved with 50 mM triethylammonium bicarbonate (TEABC, Sigma-Aldrich) (pH 8.5). Then, proteins were reduced by 5 mM Tris (2-carboxyethyl) phosphine hydrochloride (TCEP) for 30 minutes at 37oC and were alkylated by using 2 mM SMethyl methanethiosulfonate (MMTS) for 30 minutes at room temperature in the dark. We performed gel-assisted protein digestion to obtain peptides. Acrylamide/bisacrylamide (40%, v/v, 37.5:1), 10% APS (w/v) and TEMED were mixed with the protein solution by vortex for few seconds and spin down soon (the ratio of protein solution: acrylamide/bisacrylamide: APS: TEMED = 14:5:0.3:0.3, v/v) until the solution polymerized into a gel. The gels were excised into small pieces and washed continuously with 25 mM TEABC and 25 mM TEABC/50% (v/v) acetonitrile (ACN, Thermo Fisher Scientific) until no bubbles were visible. The gel pieces were further dehydrated with 100% ACN dried completely with a centrifugal evaporator (miVac Duo Concentrator; UK; Eppendorf concentrator 5301). 25 mM TEABC was added to rehydrate the gel and proteins were further digested by trypsin (protein: trypsin = 10:1, w/w) (Thermo Fisher Scientific, USA) at 37oC overnight. After in-gel digestion, peptides were extracted from the gel with 0.1% (v/v) Trifluoroacetic acid (TFA), 50% (v/v) ACN /0.1% (v/v) TFA, and 100% ACN sequentially. The extracted peptide solution was dried with a sample concentrator (miVac Duo

6 ACS Paragon Plus Environment

Page 6 of 36

Page 7 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Concentrator; Genevac, Ipswich, UK; Eppendorf concentrator 5301) and then subjected to desalting by using ZipTip® Pipette Tips (Millipore, USA). 2.5. Peptide desalting by ZipTip pipette tips Dried peptides were resolved by 20 μl of 0.1% Pierce™ Trifluoroacetic Acid (TFA) (Thermo Fisher Scientific, Waltham, MA, USA) and added 10 % TFA to adjust the pH to pH 2-3. The ZipTip was wetted by 50% (v/v) acetonitrile (ACN) / 0.1% TFA for 5 cycles first and followed by equilibrated in 0.1% TFA with 10 cycles. Aspirated and dispensed the sample peptides with ZipTip for 20 cycles by pipetting slowly to let peptides bind to ZipTip. We washed the ZipTip with 0.1% TFA for 10 cycles. At the final stage, peptides were eluted with 20 μl of 50% (v/v) acetonitrile (ACN)/0.1% TFA for 10 cycles by aspirating and dispensing for 10 cycles to new tubes. The eluted solution was dried with a centrifugal evaporator (CVE-2000, Eyela, Tokyo, Japan). 2.6. NanoLC-MS/MS analysis NanoLC-MS/MS analysis was performed on a nanoACQUITY UPLC system (Waters, Milford, MA) connected to an LTQ-Orbitrap XL hybrid mass spectrometer (Thermo Electron, Bremen, Germany) equipped with a nanospray interface (Proxeon, Odense, Denmark). Peptide samples were loaded onto a 2 cm × 180  µm capillary trap column and then separated in a 75 µm × 25 cm nanoACQUITY 1.7 µm BEH C18 column (Waters, Milford, MA) at a flow rate of 300 nL/minute. Mobile phase A consisted of 0.1% formic acid, and B consisted of 0.1% formic acid and 80% ACN. A linear gradient of 10% to 40% B in 90 minutes and 40% to 85% B in 10 minutes was employed throughout this study. Mass spectra from survey full scans were acquired on the Orbitrap (m/z 350–1500). The resolution was set to 60,000 at m/z 400 and the automatic gain control (AGC) was set to 1 × 106 ions. The m/z values triggering MS/MS were put on an exclusion list for 90

7 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

seconds. The top ten most-intense precursor ions were selected from the MS scan for subsequent collision-induced dissociation MS/MS scan by ion trap (AGC target at 7000). 2.7. Proteome data analysis Raw MS spectral data was processed for peak detection and identification by using MaxQuant software version 1.5.2.8 (34). Peptide identification was performed by using the Andromeda search engine (35) against the Reviewed Swiss-Prot human database version canonical (Published February, 2017) (36). Search criteria used in this study were trypsin specificity, fixed modification of carbamidomethyl (C), variable modifications of carbamidomethyl (C), oxidation (M), carbamyl (N-term) and deamidation (NQ), and allowed for up to two missed cleavages. The following parameters were chose: the minimal peptide length is 7; the precursor mass tolerance is 3 ppm; and the fragment ion tolerance is 0.5 Da. By using a decoy database strategy, peptide identification was accepted based on the posterior error probability with a false discovery rate of 1%. The leading proteins were selected for further bioinformatics analysis. 2.8. Bioinformatics analysis The comparison of proteins list among biological duplicates and negative control were performed by Venny (37). The annotations of cellular location of each protein were obtained from UniProt (36). We considered five major cellular compartments, including nucleus, cytoplasm, mitochondrion, endoplasmic reticulum and golgi apparatus. The proteins not located in these cellular compartment were assigned as “other”. The over-representation analysis was performed by ConsensusPathDB (38) with gene sets of GO molecular function. We only considered the GO molecular function terms at level 4 and 5 and q-value of 0.05 as the cutoff to determine the overrepresented GO terms. For motif analysis, DeepBind tool (39) were used to scan the whole SNHG1 sequence with the window size of 20 nucleotides. 8 ACS Paragon Plus Environment

Page 8 of 36

Page 9 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

2.9. Western blot The protein samples were denatured at 95 °C for 10 minutes, and separated by 10% SDS-PAGE. The linear protein in SDS-PAGE gel were transferred onto a polyvinylidene fluoride (PVDF) membrane (Millipore, USA) and blocked the membrane in 5% skim milk/TBST at room temperature for 1 hour. The membrane was incubated overnight with the primary antibodies of interest in 5% skim milk/TBST at 4°C. After washing three times in 10 minutes, the membrane was incubated with HRP-conjugated secondary antibody at room temperature for two hours. Finally, the bands of membrane were detected by FluorChem M (proteinsimple, San Jose, California). All of the primary antibodies, anti-MATR3 (GTX115291), anti-DDX5 (GTX100234) and anti-YBX1 (GTX131630), were purchased from Genetex (GeneTex International Corporation, Hsinchu City, Taiwan). The secondary antibodies conjugated with HRP were purchase from Sigma-Aldrich (Taipei, Taiwan). 2.10. RNA immunoprecipitation The procedure was followed by the manuscript protocol of Magna RIPTM RNA-Binding Protein Immunoprecipitation kit (Millipore, Bedford, MA, USA). The antibody of MATR3 for immunoprecipitation was obtained from Abcam (Cambridge, MA, USA). Briefly, the precipitated RNAs were reverse transcribed using RevertAid H Minus First Strand cDNA Synthase Kit (Thermo Fisher Scientific, Waltham, MA, USA) and RT-PCR analyses were further carried out to detect SNHG1 by using primers as follows: SNHG1-F: 5’-ACAGCAGTTGAGGGTTTGCT-3’ and SNHG1-R: 5’-ACAGTGCCTGAGTTTGGGTT-3’. The PCR products were resolved by 1.7 % agarose gel electrophoresis for 110V/30 minutes. 2.11. Survival analysis

9 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The Kaplan-Meier event-free survival (EFS) and overall survival (OS) analysis were performed for the neuroblastoma patients with Gene Expression Omnibus accession number GSE62564 (40) using the survival R package (41). The patients were divided into two groups, classified on the basis of median gene expression value. The significance of the survival curve was evaluated using log-rank (Mantel-Haenszel) test. 2.12. Gene Set Enrichment Analysis We calculated Spearman correlation coefficients (SCC) between the protein-coding gene (PCG) MATR3 and whole-genome protein-coding genes (19,199 PCGs) from the neuroblastoma RNAseq cohort. Gene Set Enrichment Analysis (GSEA v2.2.3, Broad Institute) (42) was performed by the stand-alone tool using MSigDB C5.bp.v6.1.symbols.gmt gene set collection, ranked list as MATR3-correlated PCGs and their corresponding SCC, maximum gene set size of 5000, minimum gene set size of 15, 1000 permutations, and weighted enrichment statistics. Overrepresented gene sets with false discovery rate (FDR) q value 0.05 were filtered. Similarly, GSEA was also performed for lncRNA SNHG1.

3. Results 3.1. Identification of the SNHG1 interacting proteins from human neuroblastoma cells LncRNAs play important regulatory functions via interacting with their protein partners. In order to specifically identify SNHG1-interacting proteins in neuroblastoma, we performed lncRNAprotein interactomic profiling (Figure 1A). First, we cloned SNHG1 sequence into pBlueScript SK+ vector containing SP6 RNA polymerase binding site (Figure 1B). Next, the circular plasmid was linearized using NotI restriction enzyme and followed by in vitro transcription. In vitro 10 ACS Paragon Plus Environment

Page 10 of 36

Page 11 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

synthetic SNHG1 transcripts were further labeled by biotin at 3’-end and incubated with whole cell lysate freshly extracted from two MYCN-amplified (SK-N-BE(2)C and SK-N-DZ) and one MYCN-nonamplified (SK-N-AS) neuroblastoma cell lines. Biotin labeled SNHG1-protein complex were precipitated by using streptavidin agarose beads. After RNA pull-down assay, potential SNHG1-interacting proteins were identified by LC-MS/MS and subsequently analyzed by MaxQuant software. To avoid non-specific interacting proteins, beads alone were performed as negative control. 3.2. SNHG1 interacting proteins partners are localized in nucleus and cytoplasm To obtain the true positive SNHG1-interacting proteins, we only considered the proteins which were commonly identified in two biological replicates but not in negative control samples. A total of 283, 31 and 164 SNHG1-interacting proteins in SK-N-BE(2)C, SK-N-DZ and SK-N-AS respectively, were identified (Table S1-3 and Figure 2A). Most of SNHG1-interacting proteins are located in nucleus and cytoplasm (Figure 2B), consistent with the previous report that SNHG1 localizes not only nucleus but also cytoplasm (43). Moreover, GO enrichment analysis revealed that the terms related to nucleotide binding were statistically significantly enriched in all the neuroblastoma cell lines (Figure 2C). These evidence further confirm that the proteins identified by lncRNA pull down combined with proteomics are potential SNHG1-interacting partners. 3.3. SNHG1 involved in various biological pathways via interaction with diverse regulators Among the SNHG1-interacting proteins, 24 were shared in the three neuroblastoma cell lines, SKN-BE(2)C, SK-N-DZ and SK-N-AS (Figure 3A). These 24 SNHG1-interacting proteins were divided into four groups by their biological functions, including ribosome-related, hnRNP-related, splicing-related and others (Figure 3B). 60S ribosomal protein L22 (RPL22), 60S ribosomal

11 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

protein L6 (RPL6), 60S ribosomal protein L3 (RPL3) and 60S ribosomal protein L19 (RPL19) are involved in ribosome-related functions. Heterogeneous nuclear ribonucleoprotein M (HNRNPM), Heterogeneous nuclear ribonucleoprotein L (HNRNPL), heterogeneous nuclear ribonucleoprotein A0 (HNRNPA0), heterogeneous nuclear ribonucleoprotein A/B (HNRNPAB), heterogeneous nuclear ribonucleoprotein D-like (HNRPNDL) and heterogeneous nuclear ribonucleoprotein Q (SYNCRIP) belong to hnRNP-related functions. Probable ATP-dependent RNA helicase DDX5 (DDX5), nuclease-sensitive element-binding protein 1 (YBX1), non-POU domain containing octamer binding protein (NONO), small nuclear ribonucleoprotein Sm D1 (SNRPD1), small nuclear ribonucleoprotein Sm D3 (SNRPD3) and Aly/REF export factor 2 (ALYREF) are important players in splicing. Matrin 3 (MATR3), Ras GTPase-activating protein-binding protein 1 (G3BP1), interferon-induced protein with tetratricopeptide repeats 5 (IFIT5), histone H1x (H1FX), leucine-rich PPR motif-containing protein (LRPPRC), interleukin enhancer-binding factor 3 (ILF3), ribonuclease inhibitor (RNH1) and KH domain-containing, RNA-binding, signal transduction-associated protein 1 (KHDRBS1) have the other functions not described above “ribosome-related, hnRNP-related, splicing-related”. 3.4. SNHG1 recognized by regulators through binding motif To elucidate the SNHG1 directly interacting proteins, we searched predicted binding motif from the DeepBind tool for all the 24 proteins. Three proteins, nuclease-sensitive element-binding protein 1 (YBX1), heterogeneous nuclear ribonucleoprotein L (HNRNPL) and Matrin 3 (MATR3) were recognized (Figure 4). These three proteins were recognized as the highest score of the twenty nucleotides on SNHG1. We also checked the MS/MS spectrum peaks and coverage percentage for each of these three proteins (Figure S1) and selected them for further experimental validation. 3.5. Direct binding of MATR3 with SNHG1 12 ACS Paragon Plus Environment

Page 12 of 36

Page 13 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

To validate the binding association of YBX1, HNRNPL, and MATR3 with SNHG1, the SNHG1pull down proteins were further detected by western blot. The results revealed only MATR3 to be associated with SNHG1 in all the three neuroblastoma cell lines (Figure 5A). To cross validate this direct binding, we performed RNA immunoprecipitation assay (RIP). The results depicted a high affinity binding of MATR3 with SNHG1 in MYCN amplified neuroblastoma cell lines (Figure 5B). Moreover, we also found a statistical significant positive correlation between SNHG1 and MATR3 from a RNA-seq neuroblastoma cohort (n = 493) (Figure 5C). Interestingly, the protein level of MATR3 was upregulated in SK-N-AS, which expressed less SNHG1 transcript, compared to SK-N-DZ and SK-N-BE(2)C (Figure 5A, left panel). These results indicate that SNHG1 is able to directly interact with MATR3 and possibly regulates MATR3 protein stability. 3.6. MATR3 is up-regulated in high-risk neuroblastoma MATR3 is found to be significantly up-regulated in MYCN amplified (Figure 6A) and high-risk (Figure 6B) neuroblastoma patients. To address the possibility that expression of MATR3 is associated with patient survival status, we performed Kaplan-Meier survival analysis on the expression value of MATR3 from the RNA-seq cohort. First, we classified the patients into lowexpression (n = 247) and high-expression (n = 246) groups using the median expression of MATR3. We observed patients in high-expression group had poorer event-free survival (P = 0.00711) and overall survival (P = 0.00064) than those in the low-expression group (Figures 6C and 6D). Biological functions regulated by SNHG1 and MATR3 in neuroblastoma LncRNAs are devoid of protein-coding capacity. Thus, we applied guilt-by-association strategy (44) and performed gene set enrichment analysis (GSEA) to explore the biological functions of lncRNA SNHG1 (Table S4). Moreover, we also performed GSEA for MATR3 (Table S5). We

13 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

found biological functions such as ribonucleoprotein complex biogenesis, ncRNA processing, translation initiation, protein localization, and RNA splicing to be significantly enriched and in common between SNHG1 and MATR3 (Figure 7). Taken together, these results indicate that MATR3 might associate with SNHG1 and enhances neuroblastoma progression.

Discussion RBPs control the fate of various transcripts including lncRNAs at the posttranscriptional level in the cell (22, 45). By the use of SNHG1-pull down assay followed by LC-MS/MS spectrometry, we were able to identify 24 potential RBPs binding with SNHG1. Study reported that SNHG1 might associate with spliceosomes or ribosomal proteins that facilitate in its transcription or modification (46). As expected, we found the potential SNHG1-intereacting proteins were classified either into ribosome-related, spliceosome-related or hnRNP-related. Among these 24 proteins, Probable ATP-dependent RNA helicase DDX5 (DDX5), was reported to form complex with ROR t, a ligand receptor that controls the differentiation of T helper 17 lymphocytes. DDX5ROR t complex coordinates with lncRNA Rmrp and regulates tissue-specific transcription in T helper 17 lymphocytes (47). To validate the interaction of DDX5 and SNHG1, we analyzed the MS/MS spectrum, coverage percentage of DDX5 and performed western blot (Figure S1, Figure 1C and Figure 5A). The result depicted that DDX5 might interact indirectly or bind other proteins with SNHG1. DeepBind motif analysis identified only three proteins YBX1, HNRNPL, and MATR3 to have the predicted binding motifs for SNHG1. The protein, YBX1 is a highly conserved protein with cold-shock domain and involved in cellular processes such as DNA repair, proliferation,

14 ACS Paragon Plus Environment

Page 14 of 36

Page 15 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

transcription (48). Studies indicated that YBX1 participated in drug-resistance in human cancer cells (49), and acted as a transcription factor that regulates embryogenesis (50). YBX1 has also four RNA recognized binding motifs in arginine-rich (51). These features suggest that YBX1 might interacts with RNA through RNA binding motifs. DeepBind motif analysis identified YBX1 recognized SNHG1 through binding motif. However, western blot analysis shows no association between YBX1 and SNHG1. Thus, we hypothesize that there might be some other proteins which indirectly helps in association of YBX1 with SNHG1. The heterogeneous nuclear ribonucleoproteins (hnRNPs) comprises of RNA binding proteins, which helps in processing of heterogeneous nuclear RNAs to mature mRNAs (52). The protein, hnRNPL interacts with linc1992, and regulates TNF α expression (53). Study shows that during fasting, hnRNPL interacted with lncRNA lncLGR and suppressed the expression of hepatic glucokinase (54). Moreover, hnRNPL interacts with Sam68 and forms a SLM/Sam nuclear bodies (55). Among all the 24 candidate proteins, only MATR3 found to be associated with SNHG1, confirmed by western blot and RIP. MATR3 is a RNA and DNA binding proteins with its mutation associated to amyotrophic lateral sclerosis (ALS) (56). It is primarily located in nucleus (57) and participates in RNA splicing (58). From the Kaplan-Meier survival analysis, MATR3 is found to be associated with neuroblastoma patient survival status. The patients with high MATR3 expression had poorer event-free and overall survival than those in the low expression group. This finding suggests that MATR3 could be as a prognostic biomarker for neuroblastoma progression. Molecular functions of lncRNAs are critically based on their subcellular localization (59). We found majority of the SNHG1-binding proteins to be located in nucleus suggesting ribonucleoprotein complex biogenesis, RNA processing, and RNA splicing as its potential prospective functions. 15 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Conclusion In conclusion, our integrative analysis revealed RBP MATR3 has the binding association with lncRNA SNHG1, confirmed by RNA pull-down and RIP assay. MATR3 expression was significantly up-regulated in MYCN amplified and high-risk neuroblastoma patients. High expression of MATR3 was associated with poor patient survival. MATR3 after binding with SNHG1 might be involved in RNA splicing and cell cycle that enhances neuroblastoma progression.

SUPPORTING INFORMATION: The following supporting information is available free of charge at ACS website http://pubs.acs.org. Figure S1. The mass spectrum of candidate proteins. Table S1. The list of identified proteins interacting with SNHG1 in SK-N-BE(2)C cells. Protein identification was performed by MaxQuant Version 1.5.2.8. Table S2. The list of identified proteins interacting with SNHG1 in SK-N-DZ cells. Protein identification was performed by MaxQuant Version 1.5.2.8. Table S3. The list of identified proteins interacting with SNHG1 in SK-N-AS cells. Protein identification was performed by MaxQuant Version 1.5.2.8. Table S4. GSEA report for SNHG1 positive correlated genes. Table S5. GSEA report for MATR3 positive correlated genes. 16 ACS Paragon Plus Environment

Page 16 of 36

Page 17 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Acknowledgements This work was supported by the Ministry of Science and Technology (MOST 105-2320-B-002057-MY3 and MOST 106-2320-B-002-053-MY3) and the National Health Research Institutes (NHRI-EX107-10530PI and NHRI-EX107-10709BI) in Taiwan. We thank Dr. Yung-Feng Liao (Academia Sinica, Taiwan) for kindly providing the neuroblastoma cell lines, Dr. Jin-Der Wen (Institute of Molecular and Cellular Biology, National Taiwan University, Taiwan) for the help in SNHG1 RNA construction, and Technology Commons at the College of Life Science, National Taiwan University for technical assistance.

Author Contributions T.-W. Y., Y.-W. C. and C.-H. H. executed the experiments; D. S., Y.-W. C. and C.-L. H. analyzed the data; H.-C. H. and H.-F. J. designed and supervised all experiments; T.-W. Y., D. S., Y.-W. C., C.-L. H., H.-C. H. and H.-F. J. wrote the manuscript. All authors approved the final manuscript.

Declaration of Interests The authors declare no competing financial interests.

Abbreviations

17 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

EFS: Event-free survival; FDR: False discovery rate; GO: Gene Ontology; GSEA: Gene set enrichment analysis; HNRNPL: Heterogeneous Nuclear Ribonucleoprotein L; LC-MS/MS: liquid chromatography-tandem mass spectrometry; LncRNAs: long noncoding RNAs; MATR3: Matrin 3; MYCN: v-myc avian myelocytomatosis viral oncogene neuroblastoma derived homolog; OS: Overall survival; PCGs: Protein-coding genes; RBPs: RNA-binding proteins; RIP: RNA immunoprecipitation; RNA-seq: RNA sequencing; SNHG1: small nucleolar RNA host gene 1; YBX1: Y-box binding protein 1

18 ACS Paragon Plus Environment

Page 18 of 36

Page 19 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

REFERENCES 1.

Brodeur, G. M., Neuroblastoma: biological insights into a clinical enigma. Nat Rev Cancer

2003, 3, (3), 203-16. 2.

Maris, J. M.; Hogarty, M. D.; Bagatell, R.; Cohn, S. L., Neuroblastoma. Lancet 2007, 369,

(9579), 2106-20. 3.

American Cancer Society. Cancer Facts & Figures 2018. In American Cancer Society.

4.

Huang, M.; Weiss, W. A., Neuroblastoma and MYCN. Cold Spring Harb Perspect Med

2013, 3, (10), a014415. 5.

Seeger, R. C.; Brodeur, G. M.; Sather, H.; Dalton, A.; Siegel, S. E.; Wong, K. Y.;

Hammond, D., Association of multiple copies of the N-myc oncogene with rapid progression of neuroblastomas. N Engl J Med 1985, 313, (18), 1111-6. 6.

Schwab, M.; Varmus, H. E.; Bishop, J. M.; Grzeschik, K.-H.; Naylor, S. L.; Sakaguchi, A.

Y.; Brodeur, G.; Trent, J., Chromosome localization in normal human cells and neuroblastomas of a gene related to c-myc. Nature 1984, 308, 288. 7.

Corvi, R.; Amler, L. C.; Savelyeva, L.; Gehring, M.; Schwab, M., MYCN is retained in

single copy at chromosome 2 band p23-24 during amplification in human neuroblastoma cells. Proc Natl Acad Sci U S A 1994, 91, (12), 5523-7. 8.

Brodeur, G. M.; Seeger, R. C.; Schwab, M.; Varmus, H. E.; Bishop, J. M., Amplification

of N-myc in untreated human neuroblastomas correlates with advanced disease stage. Science 1984, 224, (4653), 1121-4. 9.

Brodeur, G. M.; Pritchard, J.; Berthold, F.; Carlsen, N. L.; Castel, V.; Castelberry, R. P.;

De Bernardi, B.; Evans, A. E.; Favrot, M.; Hedborg, F.; et al., Revisions of the international criteria for neuroblastoma diagnosis, staging, and response to treatment. J Clin Oncol 1993, 11, (8), 146677. 10.

Maris, J. M.; Matthay, K. K., Molecular biology of neuroblastoma. J Clin Oncol 1999, 17,

(7), 2264-79.

19 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

11.

Brodeur, G. M.; Maris, J. M., Neuroblastoma. In Principles and practice of pediatric

oncology, Pizzo, P.; Poplack, D., Eds. Lippincott Williams & Wilkins: 2002; pp 895-937. 12.

Shao, J.; Lu, Z.; Huang, W.; Lv, Z.; Jiang, H., A single center clinical analysis of children

with neuroblastoma. Oncol Lett 2015, 10, (4), 2311-8. 13.

Mercer, T. R.; Dinger, M. E.; Mattick, J. S., Long non-coding RNAs: insights into

functions. Nat Rev Genet 2009, 10, (3), 155-9. 14.

Iyer, M. K.; Niknafs, Y. S.; Malik, R.; Singhal, U.; Sahu, A.; Hosono, Y.; Barrette, T. R.;

Prensner, J. R.; Evans, J. R.; Zhao, S.; Poliakov, A.; Cao, X.; Dhanasekaran, S. M.; Wu, Y. M.; Robinson, D. R.; Beer, D. G.; Feng, F. Y.; Iyer, H. K.; Chinnaiyan, A. M., The landscape of long noncoding RNAs in the human transcriptome. Nat Genet 2015, 47, (3), 199-208. 15.

Kapranov, P.; Cheng, J.; Dike, S.; Nix, D. A.; Duttagupta, R.; Willingham, A. T.; Stadler,

P. F.; Hertel, J.; Hackermuller, J.; Hofacker, I. L.; Bell, I.; Cheung, E.; Drenkow, J.; Dumais, E.; Patel, S.; Helt, G.; Ganesh, M.; Ghosh, S.; Piccolboni, A.; Sementchenko, V.; Tammana, H.; Gingeras, T. R., RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 2007, 316, (5830), 1484-8. 16.

Prensner, J. R.; Chinnaiyan, A. M., The emergence of lncRNAs in cancer biology. Cancer

Discov 2011, 1, (5), 391-407. 17.

Romero-Barrios, N.; Legascue, M. F.; Benhamed, M.; Ariel, F.; Crespi, M., Splicing

regulation by long noncoding RNAs. Nucleic Acids Res 2018, 46, (5), 2169-84. 18.

Gong, C.; Li, Z.; Ramanujan, K.; Clay, I.; Zhang, Y.; Lemire-Brachat, S.; Glass, D. J., A

long non-coding RNA, LncMyoD, regulates skeletal muscle differentiation by blocking IMP2mediated mRNA translation. Dev Cell 2015, 34, (2), 181-91. 19.

Rinn, J. L.; Kertesz, M.; Wang, J. K.; Squazzo, S. L.; Xu, X.; Brugmann, S. A.; Goodnough,

L. H.; Helms, J. A.; Farnham, P. J.; Segal, E.; Chang, H. Y., Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell 2007, 129, (7), 1311-23. 20.

Brown, C. J.; Ballabio, A.; Rupert, J. L.; Lafreniere, R. G.; Grompe, M.; Tonlorenzi, R.;

Willard, H. F., A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome. Nature 1991, 349, 38. 20 ACS Paragon Plus Environment

Page 20 of 36

Page 21 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

21.

Song, Y.-x.; Sun, J.-x.; Zhao, J.-h.; Yang, Y.-c.; Shi, J.-x.; Wu, Z.-h.; Chen, X.-w.; Gao,

P.; Miao, Z.-f.; Wang, Z.-n., Non-coding RNAs participate in the regulatory network of CLDN4 via ceRNA mediated miRNA evasion. Nature Communications 2017, 8, (1), 289. 22.

Ferre, F.; Colantoni, A.; Helmer-Citterich, M., Revealing protein-lncRNA interaction.

Brief Bioinform 2016, 17, (1), 106-16. 23.

Hong, S., RNA Binding Protein as an Emerging Therapeutic Target for Cancer Prevention

and Treatment. J Cancer Prev 2017, 22, (4), 203-10. 24.

Gupta, R. A.; Shah, N.; Wang, K. C.; Kim, J.; Horlings, H. M.; Wong, D. J.; Tsai, M.-C.;

Hung, T.; Argani, P.; Rinn, J. L.; Wang, Y.; Brzoska, P.; Kong, B.; Li, R.; West, R. B.; van de Vijver, M. J.; Sukumar, S.; Chang, H. Y., Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature 2010, 464, (7291), 1071-6. 25.

Kotake, Y.; Nakagawa, T.; Kitagawa, K.; Suzuki, S.; Liu, N.; Kitagawa, M.; Xiong, Y.,

Long non-coding RNA ANRIL is required for the PRC2 recruitment to and silencing of p15(INK4B) tumor suppressor gene. Oncogene 2011, 30, (16), 1956-62. 26.

Gutschner, T.; Hammerle, M.; Eissmann, M.; Hsu, J.; Kim, Y.; Hung, G.; Revenko, A.;

Arun, G.; Stentrup, M.; Gross, M.; Zornig, M.; MacLeod, A. R.; Spector, D. L.; Diederichs, S., The noncoding RNA MALAT1 is a critical regulator of the metastasis phenotype of lung cancer cells. Cancer Res 2013, 73, (3), 1180-9. 27.

Taniue, K.; Kurimoto, A.; Sugimasa, H.; Nasu, E.; Takeda, Y.; Iwasaki, K.; Nagashima,

T.; Okada-Hatakeyama, M.; Oyama, M.; Kozuka-Hata, H.; Hiyoshi, M.; Kitayama, J.; Negishi, L.; Kawasaki, Y.; Akiyama, T., Long noncoding RNA UPAT promotes colon tumorigenesis by inhibiting degradation of UHRF1. Proc Natl Acad Sci U S A 2016, 113, (5), 1273-8. 28.

Pandey, G. K.; Kanduri, C., Long noncoding RNAs and neuroblastoma. Oncotarget 2015,

6, (21), 18265-75. 29.

Pandey, G. K.; Mitra, S.; Subhash, S.; Hertwig, F.; Kanduri, M.; Mishra, K.; Fransson, S.;

Ganeshram, A.; Mondal, T.; Bandaru, S.; Ostensson, M.; Akyurek, L. M.; Abrahamsson, J.; Pfeifer, S.; Larsson, E.; Shi, L.; Peng, Z.; Fischer, M.; Martinsson, T.; Hedborg, F.; Kogner, P.; Kanduri,

21 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

C., The risk-associated long noncoding RNA NBAT-1 controls neuroblastoma progression by regulating cell proliferation and neuronal differentiation. Cancer Cell 2014, 26, (5), 722-37. 30.

Liu, P. Y.; Erriquez, D.; Marshall, G. M.; Tee, A. E.; Polly, P.; Wong, M.; Liu, B.; Bell, J.

L.; Zhang, X. D.; Milazzo, G.; Cheung, B. B.; Fox, A.; Swarbrick, A.; Huttelmaier, S.; Kavallaris, M.; Perini, G.; Mattick, J. S.; Dinger, M. E.; Liu, T., Effects of a novel long noncoding RNA, lncUSMycN, on N-Myc expression and neuroblastoma progression. J Natl Cancer Inst 2014, 106, (7). 31.

Cui, Y.; Zhang, F.; Zhu, C.; Geng, L.; Tian, T.; Liu, H., Upregulated lncRNA SNHG1

contributes to progression of non-small cell lung cancer through inhibition of miR-101-3p and activation of Wnt/beta-catenin signaling pathway. Oncotarget 2017, 8, (11), 17785-94. 32.

Tan, H.; Zhao, L.; Song, R.; Liu, Y.; Wang, L., The long noncoding RNA SNHG1

promotes nucleus pulposus cell proliferation through regulating miR-326 and CCND1. Am J Physiol Cell Physiol 2018, 315, (1), C21-7. 33.

Sahu, D.; Hsu, C. L.; Lin, C. C.; Yang, T. W.; Hsu, W. M.; Ho, S. Y.; Juan, H. F.; Huang,

H. C., Co-expression analysis identifies long noncoding RNA SNHG1 as a novel predictor for event-free survival in neuroblastoma. Oncotarget 2016, 7, (36), 58022-37. 34.

Cox, J.; Mann, M., MaxQuant enables high peptide identification rates, individualized

p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 2008, 26, (12), 1367-72. 35.

Cox, J.; Neuhauser, N.; Michalski, A.; Scheltema, R. A.; Olsen, J. V.; Mann, M.,

Andromeda: a peptide search engine integrated into the MaxQuant environment. J Proteome Res 2011, 10, (4), 1794-805. 36.

The UniProt Consortium, UniProt: the universal protein knowledgebase. Nucleic Acids

Research 2017, 45, (D1), D158-69. 37.

Oliveros, J.C. (2007-2015) Venny. An interactive tool for comparing lists with Venn's

diagrams. http://bioinfogp.cnb.csic.es/tools/venny/index.html.

22 ACS Paragon Plus Environment

Page 22 of 36

Page 23 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

38.

Kamburov, A.; Wierling, C.; Lehrach, H.; Herwig, R., ConsensusPathDB—a database for

integrating human functional interaction networks. Nucleic Acids Research 2009, 37, (Database issue), D623-8. 39.

Alipanahi, B.; Delong, A.; Weirauch, M. T.; Frey, B. J., Predicting the sequence

specificities of DNA- and RNA-binding proteins by deep learning. Nature Biotechnology 2015, 33, 831. 40.

Su, Z.; Fang, H.; Hong, H.; Shi, L.; Zhang, W.; Zhang, W.; Zhang, Y.; Dong, Z.; Lancashire,

L. J.; Bessarabova, M.; Yang, X.; Ning, B.; Gong, B.; Meehan, J.; Xu, J.; Ge, W.; Perkins, R.; Fischer, M.; Tong, W., An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era. Genome Biol 2014, 15, (12), 523. 41.

Therneau, T., A Package for Survival Analysis in S. version 2.38, https://CRAN.R-

project.org/package=survival. In 2015. 42.

Subramanian, A.; Tamayo, P.; Mootha, V. K.; Mukherjee, S.; Ebert, B. L.; Gillette, M. A.;

Paulovich, A.; Pomeroy, S. L.; Golub, T. R.; Lander, E. S.; Mesirov, J. P., Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102, (43), 15545-50. 43.

Carlevaro-Fita, J.; Rahim, A.; Guigó, R.; Vardy, L. A.; Johnson, R., Cytoplasmic long

noncoding RNAs are frequently bound to and degraded at ribosomes in human cells. RNA 2016, 22, (6), 867-82. 44.

Rinn, J. L.; Chang, H. Y., Genome regulation by long noncoding RNAs. Annu Rev Biochem

2012, 81, 145-66. 45.

Turner, M.; Díaz-Muñoz, M. D., RNA-binding proteins control gene expression and cell

fate in the immune system. Nature Immunology 2018, 19, (2), 120-29. 46.

Shen, Y.; Liu, S.; Fan, J.; Jin, Y.; Tian, B.; Zheng, X.; Fu, H., Nuclear retention of the

lncRNA SNHG1 by doxorubicin attenuates hnRNPC-p53 protein interactions. EMBO Rep 2017, 18, (4), 536-48. 47.

Huang, W.; Thomas, B.; Flynn, R. A.; Gavzy, S. J.; Wu, L.; Kim, S. V.; Hall, J. A.; Miraldi,

E. R.; Ng, C. P.; Rigo, F.; Meadows, S.; Montoya, N. R.; Herrera, N. G.; Domingos, A. I.; 23 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Rastinejad, F.; Myers, R. M.; Fuller-Pace, F. V.; Bonneau, R.; Chang, H. Y.; Acuto, O.; Littman, D. R., DDX5 and its associated lncRNA Rmrp modulate TH17 cell effector functions. Nature 2015, 528, 517. 48.

Kohno, K.; Izumi, H.; Uchiumi, T.; Ashizuka, M.; Kuwano, M., The pleiotropic functions

of the Y-box-binding protein, YB-1. Bioessays 2003, 25, (7), 691-8. 49.

Basaki, Y.; Taguchi, K.; Izumi, H.; Murakami, Y.; Kubo, T.; Hosoi, F.; Watari, K.; Nakano,

K.; Kawaguchi, H.; Ohno, S.; Kohno, K.; Ono, M.; Kuwano, M., Y-box binding protein-1 (YB-1) promotes cell cycle progression through CDC6-dependent pathway in human cancer cells. Eur J Cancer 2010, 46, (5), 954-65. 50.

Lu, Z. H.; Books, J. T.; Ley, T. J., YB-1 is important for late-stage embryonic development,

optimal cellular stress responses, and the prevention of premature senescence. Mol Cell Biol 2005, 25, (11), 4625-37. 51.

Burd, C. G.; Dreyfuss, G., Conserved structures and diversity of functions of RNA-binding

proteins. Science 1994, 265, (5172), 615-21. 52.

Geuens, T.; Bouhy, D.; Timmerman, V., The hnRNP family: insights into their role in

health and disease. Hum Genet 2016, 135, (8), 851-67. 53.

Li, Z.; Chao, T. C.; Chang, K. Y.; Lin, N.; Patil, V. S.; Shimizu, C.; Head, S. R.; Burns, J.

C.; Rana, T. M., The long noncoding RNA THRIL regulates TNFalpha expression through its interaction with hnRNPL. Proc Natl Acad Sci U S A 2014, 111, (3), 1002-7. 54.

Ruan, X.; Li, P.; Cangelosi, A.; Yang, L.; Cao, H., A Long Non-coding RNA, lncLGR,

Regulates Hepatic Glucokinase Expression and Glycogen Storage during Fasting. Cell Rep 2016, 14, (8), 1867-75. 55.

Rajan, P.; Dalgliesh, C.; Bourgeois, C. F.; Heiner, M.; Emami, K.; Clark, E. L.; Bindereif,

A.; Stevenin, J.; Robson, C. N.; Leung, H. Y.; Elliott, D. J., Proteomic identification of heterogeneous nuclear ribonucleoprotein L as a novel component of SLM/Sam68 Nuclear Bodies. BMC Cell Biol 2009, 10, 82. 56.

Johnson, J. O.; Pioro, E. P.; Boehringer, A.; Chia, R.; Feit, H.; Renton, A. E.; Pliner, H. A.;

Abramzon, Y.; Marangi, G.; Winborn, B. J.; Gibbs, J. R.; Nalls, M. A.; Morgan, S.; Shoai, M.; 24 ACS Paragon Plus Environment

Page 24 of 36

Page 25 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Hardy, J.; Pittman, A.; Orrell, R. W.; Malaspina, A.; Sidle, K. C.; Fratta, P.; Harms, M. B.; Baloh, R. H.; Pestronk, A.; Weihl, C. C.; Rogaeva, E.; Zinman, L.; Drory, V. E.; Borghero, G.; Mora, G.; Calvo, A.; Rothstein, J. D.; Drepper, C.; Sendtner, M.; Singleton, A. B.; Taylor, J. P.; Cookson, M. R.; Restagno, G.; Sabatelli, M.; Bowser, R.; Chio, A.; Traynor, B. J., Mutations in the Matrin 3 gene cause familial amyotrophic lateral sclerosis. Nat Neurosci 2014, 17, (5), 664-6. 57.

Gallego-Iradi, M. C.; Clare, A. M.; Brown, H. H.; Janus, C.; Lewis, J.; Borchelt, D. R.,

Subcellular Localization of Matrin 3 Containing Mutations Associated with ALS and Distal Myopathy. PLoS One 2015, 10, (11), e0142144. 58.

Coelho, M. B.; Attig, J.; Bellora, N.; Konig, J.; Hallegger, M.; Kayikci, M.; Eyras, E.; Ule,

J.; Smith, C. W., Nuclear matrix protein Matrin3 regulates alternative splicing and forms overlapping regulatory networks with PTB. Embo j 2015, 34, (5), 653-68. 59.

Chen, L. L., Linking Long Noncoding RNA Localization and Function. Trends Biochem

Sci 2016, 41, (9), 761-72.

25 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure Legends Figure 1: Identification of lncRNA SNHG1 interacting proteins by RNA-protein pull down assay and proteomics approaches. (A) Work flow of the SNHG1-protein interactomic profiling. (B) The map of constructed plasmid for in vitro transcription, which containing full length of SNHG1 sequence along with SP6 RNA polymerase binding site. (C) Schematic diagram of whole experiment procedures. First, the constructed SNHG1-plasmid was digested to linear form for in vitro transcription. Then, biotin was labeled onto SNHG1, and proteins interacting with this biotinlabeled-SNHG1 were precipitated by RNA pull-down assay. LC-MS/MS and MaxQuant were performed to identify SNHG1-interacting proteins. Experiments were performed in biological duplicates. Figure 2: SNHG1-interacting protein in neuroblastoma cell lines. (A) Venn-diagram shows the number of interacted proteins with SNHG1 in three neuroblastoma cell lines (MYCN amplified: SK-N-BE(2)C and SK-N-DZ and MYCN nonamplified: SK-N-AS). SNHG1 was RNA pull-down with biotin-labeled SNHG1 and Empty was beads alone severed as negative control. Rep1 and Rep2 represented the first and second biological replicates. (B) Pie chart shows cellular distribution of the SNHG1-interacting proteins in the neuroblastoma cell lines, respectively. (C) The point plot shows molecular functions of the SNHG1-interacting proteins in three neuroblastoma cell lines. The size and color of each of the circle denotes ratio of matched genes to the query gene set and P-value of the molecular function, respectively. Figure 3: Comparison of potential SNHG1-interacting proteins in three neuroblastoma cell lines. (A) Venn diagram shows overlapping of SNHG1-interacting proteins in three neuroblastoma cell lines, SK-N-BE(2)C and SK-N-DZ and SK-N-AS. (B) Classification of the 24 common

26 ACS Paragon Plus Environment

Page 26 of 36

Page 27 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

SNHG1-interacting proteins into 4 groups based on their biological processes. The protein information is described in main text. Figure 4: The predicted binding motif of interacting proteins on SNHG1 sequence. (A-D) The highest peak of each candidate protein shows predicted binding motif on SNHG1 sequence. The highest score of each peak comprises twenty nucleotides of SNHG1. The IDs of the three proteins MATR3 (D00119.001), YBX1 (D00163.001 and D00163.002) and HNRNPL (D00110.001) were obtained from DeepBind tool. Figure 5: MATR3 interacts with SNHG1. (A) Interaction between MATR3 and SNHG1 is confirmed using SNHG1 pull-down combining western blotting. Input mean cell lysate without RNA precipitation process. Each cell line had beads only as negative control of RNA pull-down assay. Experiments were performed in biological triplicates. (B) The binding of MATR3 with SNHG1 is validated using RNA immunoprecipitation in two neuroblastoma cell lines. PC was RTPCR positive control which used pBlueScript SK (+)-SNHG1 plasmid as the template. Experiments were performed in biological duplicates. (C) Scatter plot shows the positive correlation between SNHG1 and MATR3 expression levels in neuroblastoma patients from RNAseq cohort (n = 493). Figure 6: MATR3 predicts poor clinical outcome in neuroblastoma patients. (A-B) Boxplot shows the normalized log2RPM expression value of MATR3 in risk status (A) and MYCN status (B) in a RNA-seq neuroblastoma cohort (n = 493). (C-D) Kaplan-Meier plots shows event-free survival (C) and overall survival (D) of low expression and high expression groups based on median expression value of MATR3 in neuroblastoma patient (n = 493). The P-values were obtained from log-rank (Mantel-Haenszel) test.

27 ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 7: Gene set enrichment analysis of MATR3 and SNHG1. (A) Bar chart shows top overrepresented gene sets common between MATR3 and SNHG1 analyzed by GSEA. Colors represent normalized enrichment score (NES) of MATR3 and SNHG1. (B) Enrichment plot for the gene set of ribonucleoprotein complex biogenesis, ncRNA processing and RNA splicing. These biological processes enriched for MATR3 and SNHG1 correlated genes.

28 ACS Paragon Plus Environment

Page 28 of 36

Page 29 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Figure 1

ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 30 of 36

Figure 2 a

SK-N-BE(2)C

SK-N-DZ

SK-N-AS

b

SK-N-BE(2)C

SK-N-DZ

SK-N-AS

c

ACS Paragon Plus Environment

Page 31 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Figure 3 a

b SK-N-AS

Ribosome-related

SK-N-BE(2)C RPL22 RPL6 RPL3 RPL19

MATR3 G3BP1 IFIT5 H1FX

SK-N-DZ

LRPPRC ILF3 RNH1 KHDRBS1

Others

hnRNP-related HNRNPM HNRNPL HNRNPA0 HNRNPAB HNRNPDL SYNCRIP

DDX5 YBX1 NONO

SNRPD1 SNRPD3 ALYREF

Splicing-related

ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 4

ACS Paragon Plus Environment

Page 32 of 36

Page 33 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Figure 5

ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 6

ACS Paragon Plus Environment

Page 34 of 36

Page 35 of 36 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Journal of Proteome Research

Figure 7

ACS Paragon Plus Environment

Journal of Proteome Research 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

For TOC Only

ACS Paragon Plus Environment

Page 36 of 36