Document not found! Please try again

Protein Methyltransferases: A Distinct, Diverse, and Dynamic Family of

Dec 10, 2015 - Finding the genetic mechanisms of folate deficiency and neural tube defects-Leaving no stone unturned. Kit Sing Au , Tina O. Findley , ...
1 downloads 0 Views 2MB Size
Subscriber access provided by CMU Libraries - http://library.cmich.edu

Current Topic/Perspective

Protein methyltransferases: A distinct, diverse and dynamic family of enzymes P. Ann Boriack-Sjodin, and Kerren K Swinger Biochemistry, Just Accepted Manuscript • DOI: 10.1021/acs.biochem.5b01129 • Publication Date (Web): 10 Dec 2015 Downloaded from http://pubs.acs.org on December 15, 2015

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Biochemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Protein methyltransferases: A distinct, diverse and dynamic family of enzymes P. Ann Boriack-Sjodin and Kerren K. Swinger Epizyme, Inc. 400 Technology Square, Cambridge, MA 02139 Address correspondence to P. Ann Boriack-Sjodin. Email: [email protected]

Abstract Methyltransferase proteins are a superfamily of enzymes that add one or more methyl groups to substrates that include protein, DNA, RNA and small molecules. The subset of proteins that act upon arginine and lysine side chains are characterized as epigenetic targets due to their activity on histone molecules and their ability to affect transcriptional regulation. However, it is now clear that these enzymes target other protein substrates as well, greatly expanding their potential impact on normal and disease biology. Protein methyltransferases are well characterized structurally. In addition to revealing the overall architecture of the subfamilies of enzymes, structures of complexes with substrates and ligands have enabled detailed analysis of biochemical mechanism, substrate recognition, and design of potent and selective inhibitors. This review focuses on how knowledge gained from structural studies has impacted the understanding of this large class of epigenetic enzymes.

Introduction Since their discovery less than two decades ago, methyltransferases have developed into one of the largest classes of epigenetic enzymes and continue to be an active area of research for industry and academia alike. More than half of the known epigenetic enzymes add or remove methyl groups from protein, DNA, or RNA substrates, resulting in changes in transcriptional

1 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

regulation (1). The largest enzyme class is the writers that act upon protein substrates, protein methyltransferases (PMTs). Initially labeled as histone methyltransferases, there is a growing understanding that histone molecules are only one of many potential protein targets within the cell (2). Ironically, this knowledge greatly expands yet also complicates the biology of protein methylation. It is known that dysregulation of epigenetic targets can lead to pathological changes that result in human disease (3), hence protein methyltransferases are active targets for therapeutic intervention. Inhibitors for two PMTs, DOT1L and EZH2, have reached the clinic, and efforts by pharmaceutical and biotechnology companies are reported for several other family members. Therefore, understanding the structure and function of these enzymes may have potential implications for drug discovery efforts as well as expand knowledge of the biology of this enzyme class. Protein methyltransferases can be broadly differentiated based on the target amino acid of the methylation reaction. This review focuses on the lysine and arginine methyltransferases. Phylogenic trees for these PMTs have been designed (4-6), dividing the methyltransferases into smaller subfamilies based on chemogenic analysis, and this review will feature several of these subfamilies in detail (Figure 1). Although the catalytic domains of lysine or arginine PMTs share common sequence and structural features, there is significant diversity in sequence and quaternary structure between the enzymes. Some protein methyltransferases exist as globular proteins, such as the SMYD (SET and MYND domain containing) family (7), while other enzymes contain multiple, distinct domains. For example, in addition to the methyltransferase catalytic domain, NSD (nuclear receptor SET domain-containing) family member WHSC1L1 contains PHD (plant homeodomain) fingers, PWWP (Pro-Trp-Trp-Pro motif) domains, and an NSD specific C5HCH domain (a cysteine-rich domain) (8), and structures have been solved for

2 ACS Paragon Plus Environment

Page 2 of 41

Page 3 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

the catalytic domain (Tempel et al; PDB ID 4YZ8), a PWWP domain (Qin I., PDB ID 4RXJ) and the tandem PHD5-C5HCH domains (9). Although many proteins are active as individual full-length or truncated enzymes, PMTs are often found in multi-protein complexes and a few, such as EZH2 (10), are only functional within these larger units. Knowledge of the environment in which a PMT is found in vivo is important when analyzing structural and functional data obtained in vitro, particularly for truncated enzymes. Structural biology efforts have been extremely productive for this class of enzymes, as structures of PMT catalytic domains are known for more than half of PMT family members (6). A subset of enzymes has been crystallized in ternary complexes containing both peptide and nucleotide substrates or analogs, providing a basis for understanding substrate specificity. Additionally, there is active research to develop tool compounds to understand the complex biology of PMTs (11, 12), and structural biology is playing a key role in developing many of these potent and selective reagents. In this review, structural biology efforts for several subfamilies of PMTs will be examined detailing the diversity within the enzyme family, highlighting the role protein dynamics plays for many of these targets, and specifying the continuing role structure will play in understanding the biochemistry and biology of these complex enzymes.

Domain structure of PMTs With one exception, lysine methyltransferases (PKMTs) contain a catalytic SET (Su(var), E(z) and Trithorax) domain defined by specific amino acid motifs (ELxF/YDY and RFINHxCxPN where x is any amino acid) and a pseudo-knot structure (13) (Figure 2A). Although the SET domain contains all residues important for catalysis, I-SET (Immunoglobulin-

3 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

SET) and post-SET domains are also found in all SET-containing protein methyltransferases. These additional domains contribute to the peptide binding pocket and in many cases to the Sadenosylmethionine (SAM ) binding site as well (13). DOT1L is a structurally unique lysine methyltransferase (Figure 2B), as described below. Arginine methyltransferases (PRMTs) are structurally distinct from SET domaincontaining PKMTs. The catalytic core of these proteins contains three domains (Figure 2C). The first is an MTase domain (methyltransferase domain) that is similar to the Rossmann fold and includes all the residues that contribute to the SAM binding site. This domain is structurally conserved (14). The second domain is a β-barrel unique to PRMT enzymes, and the third is a dimerization domain (14). Substrate peptides bind in a pocket located at the interface between the MTase domain and the β-barrel (15-18). Although PRMTs share sequence homology with other methyltransferase enzymes (5), only arginine methyltransferases structures will be discussed.

PMT mechanism and specificity To appreciate PMT structure one must also understand the mechanism of these enzymes. All PMTs require two substrates for enzyme catalysis, SAM and the targeted residue of methylation. SAM binds to the protein and donates a methyl group to the lysine or arginine side chain of the protein substrate using an SN2-based transfer reaction producing Sadenosylhomocysteine (SAH) in the process. PKMT enzymes bind lysine residues in a narrow hydrophobic channel within the SET domain and orient the terminal ε-amine using carbonoxygen hydrogen bonding (19). Water molecules play a key role in the deprotonation event for PKMT enzymes (20). PRMT-substrate enzyme complexes (15-18) do not have a conserved

4 ACS Paragon Plus Environment

Page 4 of 41

Page 5 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

water molecule for deprotonation of the arginine side chain. The higher pKa of arginine may necessitate a different proton transfer mechanism (21). It is also possible that deprotonation of the guanidino moiety is not essential prior to transfer of the methyl group from SAM to substrate (22). Lysine can exist in four possible methylation states: unmodified or having one, two, or three methyl groups covalently bound (Figure 3A). Arginine can also exist in several methylation states including unmodified, monomethylated or dimethylated; dimethylated arginine residues can occur in either symmetric (1 methyl group per terminal nitrogen atom) or asymmetric (2 methyl groups on a single terminal nitrogen atom) geometry (Figure 3B). The addition of more than one methyl group to protein residues can be performed by a single enzyme or may be performed by different enzymes. For example, wild type EZH2 can produce mono-, di- and tri-methylated lysine at position 27 of histone H3 (H3K27) with differing catalytic efficiencies for each substrate, and active site mutations can alter these kinetic profiles (23). In contrast, H3K36 is trimethylated by two different enzymes; mono- and dimethylation occurs through the NSD family (24), while trimethylation is performed by SETD2 (25). Some methylation sites have unique methyltransferase writers, such as H3K79 methylation by DOT1L (26), while H3K9 is a substrate for no fewer than eight human methyltransferases (4), and p53 is a target for at least five lysine methyltransferases (27).

Conversely, individual PMTs can have

multiple protein substrates. SETD7 has at least 11 non-histone substrates characterized by in vitro or in vivo methods (27). The interplay between enzymes and substrates creates opportunities for structural biology and biochemistry efforts to develop and test hypotheses regarding substrate specificities and kinetic mechanisms.

5 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

SETD7: A model methyltransferase enzyme SETD7 was the first mammalian PKMT solved by protein crystallography (28, 29) and together with the structures of Neurospora crassa DIM-5 (30), Schizosaccharomyces pombe Clr4 (31), and garden pea Rubisco LSMT (32) defined the three-dimensional SET domain architecture and demonstrated the SET methyltransferase fold was evolutionarily conserved across a variety of species. To date, nearly 30 structures of wild type and mutant SETD7 proteins have been deposited in public databases, more than any other individual PMT. In addition to structures of SETD7 bound to SAM and SAH, complexes with various substrate peptides or inhibitors occupying either SAM or peptide binding sites have been solved. Many of the structures solved for SETD7 were generated because the enzyme has been used as a model system to illuminate structural aspects of methyltransferase biochemistry. For example, active site mutants of SETD7 are known that alter product specificity from a monomethylase to an enzyme that can catalyze di- and trimethylation (33) . Structural characterization of these mutants in complex with peptides differing in methylation states highlighted the important role active site water molecules perform in orienting the lysine ε nitrogen with increasing numbers of methyl groups prior to enzyme catalysis (34). Additional structural and biochemical studies of active site mutants of SETD7 showed CH---O hydrogen bonds constrict the motion of the SAM methyl group and are critical to high affinity SAM binding and transition-state stabilization during catalysis (35). As stated previously, SETD7 methylates multiple substrate protein and several structures of SETD7-peptide complexes are known (histone H3, TAF10, p53, ERα and DNMT1). Analysis of the binding interactions lead to the hypothesis of a consensus motif (27). However, more recent analysis of SETD7 substrate sequences reveals sequences outside the consensus sequence are also substrates (36). This result

6 ACS Paragon Plus Environment

Page 6 of 41

Page 7 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

may indicate that interactions beyond the SET domain play an important role in substrate specificity (27) and provides another opportunity for structural biology efforts to contribute to the understanding of substrate recognition for this model enzyme.

Suv39/EHMT: A model methyltransferase family The Suv39/EHMT family of PKMTs includes EHMT1/2 (GLP/G9a), SUV39H1/2, SETDB1/2, and SETMAR. With the exception of SETDB1/2, the sequence of which is unique due to a large insertion domain within the SET domain, all family members have been structurally characterized (6). This group was the first PMT subfamily to have apo, peptidebound and inhibitor-bound structures, making these enzymes an early model for understanding substrate and inhibitor specificity and enabling structure-guided design of probe molecules. H3K9 is a substrate of several PMT enzymes, including all members of the Suv39/EHMT family except SETMAR (4). Crystal structures of human EHMT1, EHMT2 and Suv39H2 were compared to each other and to other H3K9 enzymes including human PRDM2 to determine similarities between protein structures for enzymes with the same substrate (4) The comparisons showed conservation in I-SET conformation and variability in post-SET domain dependent on the presence or absence of the substrate peptide, indicating a pre-formed platform may be necessary for substrate binding but that protein dynamics were also important to binding and recognition (4). However, sequence differences in the substrate ablated binding for one methyltransferase while only mildly affecting another, indicating selectivity mechanisms may not be conserved even for enzymes with the same substrate (4). Structural biology has played a pivotal role in the development of tool compounds for the EHMT enzymes. BIX-01294 (37) was the first inhibitor of a PMT molecule whose structure was

7 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

documented in the literature (38). Comparison of the GLP-BIX01294 structure to that of GLP bound to an H3K9 peptide (4) showed the compound bound in the peptide binding site and revealed plasticity in the side chain conformation of Arg1214 dependent on the identity of the ligand. The structure also revealed the importance of the αZ helix to the specificity of the compound for EHMT1 and EHMT2 over other PMT enzymes tested. Interestingly, BIX-01294 did not engage the lysine channel, and subsequent work within this scaffold showed targeting this pocket, in addition to other interactions, resulted in dramatic increases in potency (39, 40). Additional efforts, often guided by structure-based design, have resulted in numerous scaffolds suitable as in vitro and in vivo probe molecules (39, 41-44) and enabled testing of biological hypotheses for these enzymes.

NSD and related sub-family members: A dynamic class of enzymes The NSD family of enzymes contains three members: NSD1, WHSC1 (NSD2/MMSET) and WHSC1L1 (NSD3). Ranging in size from 1365 amino acids (WHSC1) to 2696 amino acids (NSD1), all family members contain a SET domain near the C-terminus followed by a PHD domain, as well as containing three additional PHD and two PWWP domains (25). Each of these proteins may play an important role in select cancers. WHSC1 has been demonstrated as a driver in multiple myeloma containing the t(4:14) chromosomal translocation, while NUP98NSD1 and NUP98-WHSC1L1 fusion proteins have been described in acute myeloid leukemias (45). Overexpression of these proteins has also been seen in different cancer cell lines. Therefore, there is much interest in understanding the structure and catalytic mechanism of this family of enzymes in both academia and industry.

8 ACS Paragon Plus Environment

Page 8 of 41

Page 9 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

To date, NSD1 (Figure 4A) and WHSC1L1 (Figure 4B) each have only a single SAMbound structure of the SET domain for each enzyme in the public domain, while the WHSC1 SET domain remains structurally uncharacterized. However, ASH1L and SETD2 are closely related proteins on the phylogenic trees (Figure 1), and additional structural and biochemical studies on these proteins have been recently documented. Examination of these proteins provides key insights into the role protein dynamics may have in enzymatic turnover within the NSD family. The structures of NSD1 (46), SETD2 (47), and ASH1L (48) reveal the post-SET loops of these enzymes occupy an auto-inhibitory conformation which sterically blocks substrate from accessing the lysine channel (Figure 4C). However, additional structures of SETD2 and ASH1L reveal this same loop can adopt a number of conformations under different conditions. A structure of SETD2 bound to SFG-Pr, a nucleotide analog with an extended group in the lysine pocket, results in a flipped conformation of the loop and an open conformation of the protein that may be accessible to peptide substrates (46) (Figure 4C). Select point mutants in the autoinhibitory loop of ASH1L had varying effects on catalytic activity of the enzyme, and several different conformations of this loop and the I-SET loop were documented (49). Both studies postulated that the conformation of the loop is important for substrate recognition and acts as a regulatory feature beyond a a loop-open or loop-closed binary gating mechanism.

Taken

together, these structures reveal the auto-inhibitory loop to be a conformationally dynamic region of this family of proteins that is critical to the catalytic mechanism. This flexibility may be of importance for those interested in drug discovery efforts. Interestingly, the recent structure of the SET domain of WHSC1L1 (Tempel et al; PDB ID 4YZ8) does not include electron density for residues 1264-1268 (Figure 4A), presumably because these residues exist in multiple conformations or are disordered. As this is the only available structure for the SET domain of

9 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

WHSC1L1, additional studies will be needed to determine whether this flexibility is an inherent feature of the protein or a consequence of the truncated protein construct that was characterized crystallographically.

SMYD family: PMTs with deep pockets The SMYD proteins comprise a family of five soluble enzymes that are known to target histone and non-histone substrates (7). All SMYD family members contain a MYND domain, a zinc finger motif known to mediate protein-protein interactions with proline-rich sequences (50). Crystal structures of the full-length proteins are known for three of the five family members: SMYD1 (51), SMYD2 (52-54), and SMYD3 (55-57)(Lam et al, PDB ID 3MEK). These molecules share a bilobal architecture; the N-terminal lobe contains the SET, I-Set, post-SET and MYND domains, while the α-helical C-terminal domain shows similarities to TPR domains despite disparate sequences (7). The most structurally dissimilar molecules in the family are as yet unsolved. SMYD4 contains additional TPR domains at the N-terminus and is nearly twice the size of the other SMYD molecules, while SMYD5 has a unique C-terminal sequence that is unrelated to the known C-terminal domains (7). The interface of the N- and C-terminal lobes forms large, deep binding sites for protein substrates. Structural diversity between the molecules is defined by both the sequence and orientation of the C-terminal lobes, resulting in different surface topologies for SMYD1, 2, and 3 peptide binding sites (Figure 5A-C). In all structures, the nucleotide is bound in the N-terminal domain, with conserved residues forming a narrow channel for the lysine substrate before opening up to large, solvent-filled regions (7). Structures of p53 (52, 54) and ERα (58) peptides have been solved in complex with SMYD2. Both peptides follow a similar U-shaped

10 ACS Paragon Plus Environment

Page 10 of 41

Page 11 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

conformation and are superimposable by the substrate lysine and surrounding residues, but diverge in position and interactions at residues further from the site of methylation. Comparison of the similarities and differences in recognition motifs for different substrates will provide a greater understanding of substrate specificity of the SMYD enzymes. Therefore, additional structures of SMYD2-peptide complexes and substrate complexes for other SMYD enzymes are desired. SMYD2 and SMYD3 methylate targets known to be important in cancer biology, potentially affecting signal transduction pathways. SMYD3 methylates MAP3K2 and has been shown to promote Ras-mediated cancer progression (59). The tumor suppressor p53 is a substrate for SMYD2 and methylation by SMYD2 was shown to repress its function (60). Additionally, overexpression of SMYD2 and 3 has been documented in several cancers (reviewed in (7)). Therefore, there is significant interest in the development of compounds to specifically inhibit each SMYD enzyme. Although the peptide binding sites are relatively large, potent small molecule inhibitors have been found for both SMYD2 and SMYD3. Two distinct chemical series, defined by the chemical moiety found in the lysine channel, have been published for SMYD2: a benzooxazinone series (AZ505 (52), A-893 (61)), and a pyrrolidine series (LLY507 (62)). Despite differences in chemical structure, both series bind to SMYD2 in a similar fashion, engaging the lysine channel and clasping the surface of the protein while engaging two hydrophobic pockets. In contrast, the potent SMYD3 inhibitor, EPZ030456, exits the lysine channel and extends directly into the solvent filled cavity of the enzyme (63). The structures show that at least two distinct binding modes can lead to potent and selective inhibitionof the SMYD family (Figure 5D).

11 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

PRDMs: Evolutionary cousins of SET-domain containing enzymes The PRDM family is defined by the presence of an N-terminal PR (PRDI-BF1 and RIZ1 homology) domain. Despite low (20-30%) sequence identity, PR domains are structurally related to SET domains and may be evolutionarily derived from these enzymes (64). Interestingly, although PR domains lack one of the key sequence motifs (NHxC) that define the SET domain (65) and are important to catalytic function (66), only select members of the family (PRDM2/RIZ1, PRDM8, PRDM9) have been shown to methylate histones (67-71). Studies with other PRDM enzymes have been more enigmatic. PRDM3 and PRDM16 were reported to have methyltransferase activity (68), but subsequent work with a PR domain mutant of PRDM16 also showed methyltransferase activity, potentially due to association of PRDM16 with EHMT1 (72). The reasons why some PRDM family members are not active methyltransferases is currently not known, but catalytically inactive family members can contribute to chromatin structure remodulation through the recruitment of modifying enzymes or complexes, including SET domain containing PMTs (65, 73). The PR domain of PRDM2 was the first to be structurally characterized and was solved by both NMR and x-ray crystallography (4, 74). The crystal structures, solved in the absence of peptide or nucleotide substrates, confirmed that the PR domain retained the predicted SET domain fold (Figure 6A), while the NMR structure showed high mobility in the Post-SET domain, a sequence involved in substrate binding. Additionally, high dissociation constants were found for the H3 peptide (0.7 mM) and product SAH (>10 mM) for this catalytically active PRDM (74). To date, several crystal structures of PR domains of PRDM proteins have been solved, but only one, murine PRDM9, has been solved with substrates bound (71). Despite the lack of conserved sequences, the SAH molecule is bound to the enzyme in a similar way in

12 ACS Paragon Plus Environment

Page 12 of 41

Page 13 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

PRDM9 as in SET domain proteins. However, sequence differences between PRDM molecules with catalytic activity indicate the binding mode of PRDM9 with nucleotide may not be conserved within the PRDM family (71). Comparison of substrate-bound and apo forms of PRDM9 also indicate the post-SET domain undergoes conformational changes in the absence of substrates and adopts an auto-inhibited conformation (Figure 6B). Thus, while PRDM enzymes exhibit many of the same characteristics seen in their SET domain containing relatives, additional structural studies of catalytically active PRDM molecules with substrates bound may be beneficial in understanding the structure-activity relationships within this enigmatic subfamily of PMT enzymes.

Dot1L: A unique lysine methyltransferase As previously noted, DOT1L is a structurally unique enzyme within the PKMT superfamily. Although it is active against H3K79 and is therefore a lysine methyltransferase, early sequence analysis showed characteristics similar to that of the arginine methyltransferases (75) rather than SET domain-containing molecules. The structure of the catalytic domain was first solved in 2003. The structure revealed an elongated molecule with two domains distinct from SET-containing proteins and confirmed similarity to previously solved non-SET domain containing methyltransferases (76). The SAM binding site of DOT1L is contained in the Cterminal α/β domain (Figure 3B, 7A). Subsequent comparisons of the conformation of SAM in DOT1L overlaid well with PRMTs and was dissimilar to SAM conformations in SET domain containing proteins (5). Recent structural analyses have revealed that DOT1L is a dynamic enzyme that accesses a number of conformational states. When bound to iodotubercidin, a small nucleotide analog

13 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

with µM potency, the activation loop adopts an open conformation such that the adenine ring is more solvent exposed than it is in the SAM-bound conformation (77) (Figure 7B). Crystal structures of DOT1L bound to several SAM-competitive inhibitors (77-81) together with hydrogen/deuterium exchange experiments (77) indicate that loops in and near the SAM binding site adopt several different discrete conformations or can be structurally disordered when bound to different chemical classes. Some potent and selective inhibitors of DOT1L induce a pocket within the enzyme that is not present in the SAM, SAH or iodotubercidin structures (Figure 7CD). Significant conformational changes to the substrate-binding loop and movements of side chains occur when these inhibitors bind, burying a hydrophobic portion of the compounds within the protein core (77-79). One of these compounds, pinometostat (EPZ-5676), has been extensively characterized biochemically and structurally. Pinometostat has a Ki of 80 pM and a residence time on DOT1L protein of more than 24 hours (79). The conformational changes that are required for this inhibitor to bind likely contribute to its potency, selectivity, and residence time (82). It is possible that the protein dynamics seen in DOT1L structures are important for substrate recognition and binding or enzyme turnover. For example, DOT1L is only active in vitro on nucleosomes and ubiquitinylation of H2B has been demonstrated to stimulate activity (83). Characterization of DOT1L substrate complexes would help to elucidate the structural underpinnings of substrate-based effects on activity. This question will require additional experimental efforts, as no structure of a DOT1L-substrate complex is currently available.

Protein arginine methyltransferases (PRMTs): A subfamily with similarity and diversity

14 ACS Paragon Plus Environment

Page 14 of 41

Page 15 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Mammalian PRMT enzymes are divided into separate sub-classes of enzymes based on the type of methylation performed on the terminal nitrogen atoms of the arginine residue: monomethylation, symmetric dimethylation or asymmetric dimethylation (84). Type I enzymes (PRMT1, 2, 3, 6, 8 and 4 (also known as CARM1)) produce asymmetric dimethylation products after first producing the monomethylated arginine product. Type II enzymes (PRMT5, 7) produce symmetrically methylated arginine residues after production of the monomethylated state (Figure 2B). The first structure of SAH-bound PRMT3 solved in 2000 (85) documented the domain structure common to PRMT enzymes (Figure 3C), identified the arginine binding site, and enabled postulation of a catalytic mechanism for the PRMT family. To date, several structures of mammalian PRMTs have been solved (Figure 8) and knowledge gained from the sum of these structures allows insights into this subfamily of PMTs beyond the gross architecture of the catalytic core. CARM1 structures solved with and without nucleotide show structural rearrangement occurs upon binding of the substrate (86, 87), and highlight the role protein dynamics likely play in substrate turnover. In addition, domains external to the catalytic domain can contribute to the various protein-protein interactions important for activity and biology. Two examples include the structure of PRMT5 bound to MEP50 (16, 88) and the structure of an isolated construct of the N-terminus of CARM1 revealing a PH domain architecture (87). The structure of C. elegans (89) and murine (90) PRMT7 revealed a tandem repeat of the PRMT core likely achieved by gene duplication, although only one of the domains had catalytic activity. Interestingly, the structure of PRMT7 from a more primitive organism Trypanosoma brucei did not show this gene duplication (17).

15 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

The structure of PRMT1 was the first arginine methyltransferase to provide evidence of the substrate binding mode. Although structural rearrangement was observed in the PRMT1 binding site, an arginine residue was seen engaging in the substrate pocket (15). In addition, evidence of multiple binding sites for the protein substrate external to the arginine channel was visible, though multiple binding modes obscured any analysis of substrate recognition motifs. The structure of PRMT5-MEP50 complexed with a histone H4-derived peptide first detailed the specific interactions made between the peptide substrate and the PRMT enzyme beyond the arginine pocket (16). More recent structures of human CARM1 (18) and of Trypanosoma brucei PRMT7 (17) with peptide substrates have underscored the engagement of backbone rather than side chain interactions beyond the arginine residue, which may be thematic for enzymes with multiple protein substrates. Additionally, these structures show the absence of large conformational changes upon peptide binding, in contrast to select PKMT enzymes. As was seen in SET domain containing enzymes, structure has played a significant role in the development of inhibitors for the PRMT enzymes. Type I enzymes are structurally similar, and inhibitors that target multiple PRMTs have been documented (91, 92). Even so, selectivity is possible within the family, as was shown with both CARM1-selective (Figure 8B) and PRMT6-selective compounds (Figure 8D) (91, 93). A recent potent and selective in vivo tool compound for PRMT5 (Figure 8C) was published revealing a π-cation interaction between the tetrahydroisoquinoline headgroup and the SAM substrate unique in PRMT inhibitors to date (94). Selective inhibitors for PRMT3 have been achieved through an allosteric mechanism (95, 96). In this case the compounds bind the dimer interface (Figure 8E) changing the conformations of helix αY and causing disorder in helix αX, reminiscent of the disorder seen in holo CARM1 structures (95). Recent structure-based optimization of this inhibitor class has

16 ACS Paragon Plus Environment

Page 16 of 41

Page 17 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

resulted in a potent and selective probe compound suitable for in vivo experiments (97). The ability to achieve potent and selective inhibitors through different mechanisms of inhibition is exciting for the epigenetics field and provides needed tool compounds to elucidate the complex biology of these enzymes.

Structurally challenging PMTs: EZH2 and SETDB1 Despite marked success by structural biologists with many PMT families, some of the most interesting PMTs are not structurally enabled. As noted earlier, SETDB1, a member of the SUV39/EHMT subfamily, is expected to be a structurally distinct enzyme within the PKMT class. It contains a unique 347 amino acid sequence located within the SET domain, known as the bifurcation domain (98). Modeling of the position of the domain based on family member SETMAR indicates this domain may engage with the region of the protein involved in SAM binding (Boriack-Sjodin and Swinger, unpublished results), resulting in a SET domain architecture that could be unique among PMTs. Knowledge of the impact of the bifurcation domain on both the structure of the SET domain and changes to the structure of the SAM or peptide binding site may help to provide insight into this unique SET domain structure and its impact on enzyme activity and substrate specificity. EZH2 is a PKMT that is only active within a multiprotein complex. In vivo, EZH2 is found within the four component Polycomb Repressive Complex (PRC2; ~230 kD), although only three proteins, EZH2, SUZ12 and EED are required for catalytic activity (99). In 2012, a structure of PRC2 containing EZH2, Suz12, EED and RbAp48 was solved by electron microscopy and known structures of the individual components were placed into the reconstruction (100), but resolution limited a detailed analysis regarding the structure of EZH2.

17 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Crystal structures of truncated constructs of EZH2 have also been solved, however, the nucleotide active site conformations were incompatible with SAM binding and do not represent an active protein conformation (101, 102). Recent crystal structures of a minimal, catalytically active 170 kDa PRC2 with and without histone peptide bound from the thermohilic yeast Chaetomium thermophilum has provided the field with a detailed look at the extensive interactions between EZH2, SUZ12 and EED within the complex (103). The structure has also revealed the importance of SET activation loop (SAL) and its interactions with other EZH2 domains as well as SUZ12 and EED in maintaining the EZH2 SET domain in an active conformation. These interactions may explain the lack of activity from the truncated constructs. Analysis of the structure has provided potential insights into direct and allosteric regulation of EZH2 activity and has provided a scaffold on which to base future structural and mechanistic studies (103). Hypermethylation of the EZH2 substrate, H3K27, has been linked to cancer (104), making it a target for drug discovery efforts (105-109). Although potent, SAM-competitive inhibitors of EZH2 have entered the clinic, there are outstanding mechanistic questions which remain to be understood from a structural perspective. For example, EZH2 inhibitors show SAM competitive kinetic profiles but recent data showed generation of mutants outside the SET domain confer resistance to EZH2 inhibitors (110). The sequence of Chaetomium thermophilum EZH2 is quite divergent from that of human EZH2 with only 28% similarity and several sequence insertions and deletions (103), therefore it is not known whether the Chaetomium thermophilum complex can be used to investigate the mechanism of inhibition of these compounds. If the compounds are not cross-reactive with Chaetomium thermophilum PRC2, a structure of a potent EZH2 inhibitor bound to human or other mammalian EZH2 protein or the

18 ACS Paragon Plus Environment

Page 18 of 41

Page 19 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

PRC2 complex would be required for elucidation of the inhibitory mechanism.. This example highlights the potential utility of structural biology at every stage of target validation.

Other methyltransferase enzymes Thus far, only enzymes that transfer methyl groups to protein residues have been discussed in detail. However, structural biology efforts are also active for other methyltransferase enzymes. DNA methyltransferases (DNMTs) are a small family of enzymes that methylate CpG islands. Hypermethylation of these groups has been shown to reduce tumor suppression (111) and nucleoside inhibitors that target DNMTs, azacitidine and decitabine, are in clinical use. Methyltransferase domains of four of the five DNMTs are known (6), including complexes with DNA (112, 113) and regulatory enzymes (114), providing a wealth of structural data for this family. In contrast, RNA methyltransferases belong to a much larger enzyme family with 57 members for which less than a third have been structurally characterized (6). Therefore, RNA methyltransferases provide new opportunities and challenges for structural biology researchers. Additional methyltransferase enzymes methylate small molecules including catechol and nicotinamide; structural biology efforts have been fruitful for these enzymes as well. For example, more than 40 structures of catechol-O-methyltransferase are available in the public domain. These have provided valuable insights into the catalytic mechanisms of these enzymes and have contributed to the design of potent and selective inhibitors (115).

Summary Structural biology has had tremendous impact on the study of protein methyltransferases. It has highlighted the similarity and diversity of the PMT enzyme families, provided insights on

19 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

enzyme mechanism and substrate specificity, and guided efforts towards the design of selective and potent tool compounds to probe biology in normal cells and disease models. These efforts will continue, particularly for methyltransferases implicated in human disease. Going forward, structural characterization of methyltransferase enzymes with external domains beyond the catalytic domain, or in multiprotein complexes with and without substrates bound, will provide even more insight into the role of these additional domains on substrate recognition, binding, and regulation. Structural studies of larger proteins and multiprotein complexes will also increase our understanding of the environment of these catalytic domains in cellular settings. These larger complexes will likely be challenging and may require engagement of multiple structural methods in concert. Once coupled with information from biochemistry and cell biology, these more complicated structural studies will provide information critical to understanding these distinct, diverse and dynamic enzymes and their impact on human biology.

Acknowledgements The authors wish to thank R.A. Copeland, S. Ribich, A. Finley and W. Janzen for their critical reading of the manuscript and colleagues at Epizyme for helpful discussions.

Funding Sources All efforts were funded by Epizyme.

References 1. 2. 3.

Kouzarides, T. (2007) Chromatin modifications and their function, Cell 128, 693-705. Huang, J., and Berger, S. L. (2008) The emerging field of dynamic lysine methylation of non-histone proteins, Curr Opin Genet Dev 18, 152-158. Arrowsmith, C. H., Bountra, C., Fish, P. V., Lee, K., and Schapira, M. (2012) Epigenetic protein families: a new frontier for drug discovery, Nat Rev Drug Discov 11, 384-400. 20 ACS Paragon Plus Environment

Page 20 of 41

Page 21 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

4.

5.

6. 7.

8.

9.

10.

11. 12. 13. 14. 15.

16.

17.

18.

Wu, H., Min, J., Lunin, V. V., Antoshenko, T., Dombrovski, L., Zeng, H., Allali-Hassani, A., Campagna-Slater, V., Vedadi, M., Arrowsmith, C. H., Plotnikov, A. N., and Schapira, M. (2010) Structural biology of human H3K9 methyltransferases, PLoS One 5, e8570. Richon, V. M., Johnston, D., Sneeringer, C. J., Jin, L., Majer, C. R., Elliston, K., Jerva, L. F., Scott, M. P., and Copeland, R. A. (2011) Chemogenetic analysis of human protein methyltransferases, Chem Biol Drug Des 78, 199-210. Liu, L., Zhen, X. T., Denton, E., Marsden, B. D., and Schapira, M. (2012) ChromoHub: a data hub for navigators of chromatin-mediated signalling, Bioinformatics 28, 2205-2206. Spellmon, N., Holcomb, J., Trescott, L., Sirinupong, N., and Yang, Z. (2015) Structure and function of SET and MYND domain-containing proteins, Int J Mol Sci 16, 14061428. Angrand, P. O., Apiou, F., Stewart, A. F., Dutrillaux, B., Losson, R., and Chambon, P. (2001) NSD3, a new SET domain-containing gene, maps to 8p12 and is amplified in human breast cancer cell lines, Genomics 74, 79-88. He, C., Li, F., Zhang, J., Wu, J., and Shi, Y. (2013) The methyltransferase NSD3 has chromatin-binding motifs, PHD5-C5HCH, that are distinct from other NSD (nuclear receptor SET domain) family members in their histone H3 recognition, The Journal of biological chemistry 288, 4692-4703. Kuzmichev, A., Nishioka, K., Erdjument-Bromage, H., Tempst, P., and Reinberg, D. (2002) Histone methyltransferase activity associated with a human multiprotein complex containing the Enhancer of Zeste protein, Genes Dev 16, 2893-2905. Finley, A., and Copeland, R. A. (2014) Small molecule control of chromatin remodeling, Chemistry & biology 21, 1196-1210. Kaniskan, H. U., and Jin, J. (2015) Chemical probes of histone lysine methyltransferases, ACS chemical biology 10, 40-50. Schapira, M. (2011) Structural Chemistry of Human SET Domain Protein Methyltransferases, Curr Chem Genomics 5, 85-94. Cheng, X., Collins, R. E., and Zhang, X. (2005) Structural and sequence motifs of protein (histone) methylation enzymes, Annu Rev Biophys Biomol Struct 34, 267-294. Zhang, X., and Cheng, X. (2003) Structure of the predominant protein arginine methyltransferase PRMT1 and analysis of its binding to substrate peptides, Structure 11, 509-520. Antonysamy, S., Bonday, Z., Campbell, R. M., Doyle, B., Druzina, Z., Gheyi, T., Han, B., Jungheim, L. N., Qian, Y., Rauch, C., Russell, M., Sauder, J. M., Wasserman, S. R., Weichert, K., Willard, F. S., Zhang, A., and Emtage, S. (2012) Crystal structure of the human PRMT5:MEP50 complex, Proc Natl Acad Sci U S A 109, 17960-17965. Wang, C., Zhu, Y., Caceres, T. B., Liu, L., Peng, J., Wang, J., Chen, J., Chen, X., Zhang, Z., Zuo, X., Gong, Q., Teng, M., Hevel, J. M., Wu, J., and Shi, Y. (2014) Structural determinants for the strict monomethylation activity by trypanosoma brucei protein arginine methyltransferase 7, Structure 22, 756-768. Boriack-Sjodin, P. A., Jin, L., Jacques, S. L., Drew, A., Sneeringer, C., Scott, M. P., Moyer, M. P., Ribich, S., Moradei, O., and Copeland, R. A. (2015) Structural Insights into Ternary Complex Formation of Human CARM1 with Various Substrates, ACS Chemical Biology, DOI: 10.1021/acschembio.5b00773.

21 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

19.

20.

21.

22.

23.

24.

25. 26. 27. 28.

29.

30.

31. 32. 33.

34.

Couture, J. F., Hauk, G., Thompson, M. J., Blackburn, G. M., and Trievel, R. C. (2006) Catalytic roles for carbon-oxygen hydrogen bonding in SET domain lysine methyltransferases, The Journal of biological chemistry 281, 19280-19287. Zhang, X., and Bruice, T. C. (2008) Enzymatic mechanism and product specificity of SET-domain protein lysine methyltransferases, Proceedings of the National Academy of Sciences of the United States of America 105, 5728-5732. Zhang, R., Li, X., Liang, Z., Zhu, K., Lu, J., Kong, X., Ouyang, S., Li, L., Zheng, Y. G., and Luo, C. (2013) Theoretical insights into catalytic mechanism of protein arginine methyltransferase 1, PLoS One 8, e72424. Rust, H. L., Zurita-Lopez, C. I., Clarke, S., and Thompson, P. R. (2011) Mechanistic studies on transcriptional coactivator protein arginine methyltransferase 1, Biochemistry 50, 3332-3345. Sneeringer, C. J., Scott, M. P., Kuntz, K. W., Knutson, S. K., Pollock, R. M., Richon, V. M., and Copeland, R. A. (2010) Coordinated activities of wild-type plus mutant EZH2 drive tumor-associated hypertrimethylation of lysine 27 on histone H3 (H3K27) in human B-cell lymphomas, Proc Natl Acad Sci U S A 107, 20980-20985. Kuo, A. J., Cheung, P., Chen, K., Zee, B. M., Kioi, M., Lauring, J., Xi, Y., Park, B. H., Shi, X., Garcia, B. A., Li, W., and Gozani, O. (2011) NSD2 links dimethylation of histone H3 at lysine 36 to oncogenic programming, Molecular cell 44, 609-620. Wagner, E. J., and Carpenter, P. B. (2012) Understanding the language of Lys36 methylation at histone H3, Nature reviews. Molecular cell biology 13, 115-126. Nguyen, A. T., and Zhang, Y. (2011) The diverse functions of Dot1 and H3K79 methylation, Genes & development 25, 1345-1358. Del Rizzo, P. A., and Trievel, R. C. (2011) Substrate and product specificities of SET domain methyltransferases, Epigenetics 6, 1059-1067. Wilson, J. R., Jing, C., Walker, P. A., Martin, S. R., Howell, S. A., Blackburn, G. M., Gamblin, S. J., and Xiao, B. (2002) Crystal structure and functional analysis of the histone methyltransferase SET7/9, Cell 111, 105-115. Xiao, B., Jing, C., Wilson, J. R., Walker, P. A., Vasisht, N., Kelly, G., Howell, S., Taylor, I. A., Blackburn, G. M., and Gamblin, S. J. (2003) Structure and catalytic mechanism of the human histone methyltransferase SET7/9, Nature 421, 652-656. Zhang, X., Tamaru, H., Khan, S. I., Horton, J. R., Keefe, L. J., Selker, E. U., and Cheng, X. (2002) Structure of the Neurospora SET domain protein DIM-5, a histone H3 lysine methyltransferase, Cell 111, 117-127. Min, J., Zhang, X., Cheng, X., Grewal, S. I., and Xu, R. M. (2002) Structure of the SET domain histone lysine methyltransferase Clr4, Nature structural biology 9, 828-832. Trievel, R. C., Beach, B. M., Dirk, L. M., Houtz, R. L., and Hurley, J. H. (2002) Structure and catalytic mechanism of a SET domain protein methyltransferase, Cell 111, 91-103. Collins, R. E., Tachibana, M., Tamaru, H., Smith, K. M., Jia, D., Zhang, X., Selker, E. U., Shinkai, Y., and Cheng, X. (2005) In vitro and in vivo analyses of a Phe/Tyr switch controlling product specificity of histone lysine methyltransferases, The Journal of biological chemistry 280, 5563-5570. Del Rizzo, P. A., Couture, J. F., Dirk, L. M., Strunk, B. S., Roiko, M. S., Brunzelle, J. S., Houtz, R. L., and Trievel, R. C. (2010) SET7/9 catalytic mutants reveal the role of active site water molecules in lysine multiple methylation, The Journal of biological chemistry 285, 31849-31858. 22 ACS Paragon Plus Environment

Page 22 of 41

Page 23 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

35.

36.

37.

38.

39.

40.

41.

42.

43.

44.

45.

Horowitz, S., Dirk, L. M., Yesselman, J. D., Nimtz, J. S., Adhikari, U., Mehl, R. A., Scheiner, S., Houtz, R. L., Al-Hashimi, H. M., and Trievel, R. C. (2013) Conservation and functional importance of carbon-oxygen hydrogen bonding in AdoMet-dependent methyltransferases, Journal of the American Chemical Society 135, 15536-15548. Dhayalan, A., Kudithipudi, S., Rathert, P., and Jeltsch, A. (2011) Specificity analysisbased identification of new methylation targets of the SET7/9 protein lysine methyltransferase, Chemistry & biology 18, 111-120. Kubicek, S., O'Sullivan, R. J., August, E. M., Hickey, E. R., Zhang, Q., Teodoro, M. L., Rea, S., Mechtler, K., Kowalski, J. A., Homon, C. A., Kelly, T. A., and Jenuwein, T. (2007) Reversal of H3K9me2 by a small-molecule inhibitor for the G9a histone methyltransferase, Molecular cell 25, 473-481. Chang, Y., Zhang, X., Horton, J. R., Upadhyay, A. K., Spannhoff, A., Liu, J., Snyder, J. P., Bedford, M. T., and Cheng, X. (2009) Structural basis for G9a-like protein lysine methyltransferase inhibition by BIX-01294, Nat Struct Mol Biol 16, 312-317. Liu, F., Chen, X., Allali-Hassani, A., Quinn, A. M., Wigle, T. J., Wasney, G. A., Dong, A., Senisterra, G., Chau, I., Siarheyeva, A., Norris, J. L., Kireev, D. B., Jadhav, A., Herold, J. M., Janzen, W. P., Arrowsmith, C. H., Frye, S. V., Brown, P. J., Simeonov, A., Vedadi, M., and Jin, J. (2010) Protein lysine methyltransferase G9a inhibitors: design, synthesis, and structure activity relationships of 2,4-diamino-7-aminoalkoxyquinazolines, J Med Chem 53, 5844-5857. Chang, Y., Ganesh, T., Horton, J. R., Spannhoff, A., Liu, J., Sun, A., Zhang, X., Bedford, M. T., Shinkai, Y., Snyder, J. P., and Cheng, X. (2010) Adding a lysine mimic in the design of potent inhibitors of histone lysine methyltransferases, Journal of molecular biology 400, 1-7. Liu, F., Chen, X., Allali-Hassani, A., Quinn, A. M., Wasney, G. A., Dong, A., Barsyte, D., Kozieradzki, I., Senisterra, G., Chau, I., Siarheyeva, A., Kireev, D. B., Jadhav, A., Herold, J. M., Frye, S. V., Arrowsmith, C. H., Brown, P. J., Simeonov, A., Vedadi, M., and Jin, J. (2009) Discovery of a 2,4-diamino-7-aminoalkoxyquinazoline as a potent and selective inhibitor of histone lysine methyltransferase G9a, J Med Chem 52, 7950-7953. Vedadi, M., Barsyte-Lovejoy, D., Liu, F., Rival-Gervier, S., Allali-Hassani, A., Labrie, V., Wigle, T. J., Dimaggio, P. A., Wasney, G. A., Siarheyeva, A., Dong, A., Tempel, W., Wang, S. C., Chen, X., Chau, I., Mangano, T. J., Huang, X. P., Simpson, C. D., Pattenden, S. G., Norris, J. L., Kireev, D. B., Tripathy, A., Edwards, A., Roth, B. L., Janzen, W. P., Garcia, B. A., Petronis, A., Ellis, J., Brown, P. J., Frye, S. V., Arrowsmith, C. H., and Jin, J. (2011) A chemical probe selectively inhibits G9a and GLP methyltransferase activity in cells, Nat Chem Biol. Liu, F., Barsyte-Lovejoy, D., Li, F., Xiong, Y., Korboukh, V., Huang, X. P., AllaliHassani, A., Janzen, W. P., Roth, B. L., Frye, S. V., Arrowsmith, C. H., Brown, P. J., Vedadi, M., and Jin, J. (2013) Discovery of an in vivo chemical probe of the lysine methyltransferases G9a and GLP, J Med Chem 56, 8931-8942. Sweis, R. F., Wang, Z., Algire, M., Arrowsmith, C. H., Brown, P. J., Chiang, G. G., Guo, J., Jakob, C. G., Kennedy, S., Li, F., Maag, D., Shaw, B., Soni, N. B., Vedadi, M., and Pappano, W. N. (2015) Discovery of A-893, A New Cell-Active Benzoxazinone Inhibitor of Lysine Methyltransferase SMYD2, ACS Medicinal Chemistry Letters 6, 695-700. Morishita, M., and di Luccio, E. (2011) Cancers and the NSD family of histone lysine methyltransferases, Biochimica et biophysica acta 1816, 158-163. 23 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

46.

47.

48.

49.

50.

51.

52.

53.

54.

55.

56.

57.

58.

59.

Qiao, Q., Li, Y., Chen, Z., Wang, M., Reinberg, D., and Xu, R. M. (2011) The structure of NSD1 reveals an autoregulatory mechanism underlying histone H3K36 methylation, J Biol Chem 286, 8361-8368. Zheng, W., Ibanez, G., Wu, H., Blum, G., Zeng, H., Dong, A., Li, F., Hajian, T., AllaliHassani, A., Amaya, M. F., Siarheyeva, A., Yu, W., Brown, P. J., Schapira, M., Vedadi, M., Min, J., and Luo, M. (2012) Sinefungin derivatives as inhibitors and structure probes of protein lysine methyltransferase SETD2, J Am Chem Soc 134, 18004-18014. An, S., Yeo, K. J., Jeon, Y. H., and Song, J. J. (2011) Crystal structure of the human histone methyltransferase ASH1L catalytic domain and its implications for the regulatory mechanism, The Journal of biological chemistry 286, 8369-8374. Rogawski, D. S., Ndoj, J., Cho, H. J., Maillard, I., Grembecka, J., and Cierpicki, T. (2015) Two Loops Undergoing Concerted Dynamics Regulate the Activity of the ASH1L Histone Methyltransferase, Biochemistry 54, 5401-5413. Spadaccini, R., Perrin, H., Bottomley, M. J., Ansieau, S., and Sattler, M. (2006) Structure and functional analysis of the MYND domain, Journal of molecular biology 358, 498508. Sirinupong, N., Brunzelle, J., Ye, J., Pirzada, A., Nico, L., and Yang, Z. (2010) Crystal structure of cardiac-specific histone methyltransferase SmyD1 reveals unusual active site architecture, The Journal of biological chemistry 285, 40635-40644. Ferguson, A. D., Larsen, N. A., Howard, T., Pollard, H., Green, I., Grande, C., Cheung, T., Garcia-Arenas, R., Cowen, S., Wu, J., Godin, R., Chen, H., and Keen, N. (2011) Structural basis of substrate methylation and inhibition of SMYD2, Structure 19, 12621273. Wang, L., Li, L., Zhang, H., Luo, X., Dai, J., Zhou, S., Gu, J., Zhu, J., Atadja, P., Lu, C., Li, E., and Zhao, K. (2011) Structure of human SMYD2 protein reveals the basis of p53 tumor suppressor methylation, The Journal of biological chemistry 286, 38725-38737. Jiang, Y., Sirinupong, N., Brunzelle, J., and Yang, Z. (2011) Crystal structures of histone and p53 methyltransferase SmyD2 reveal a conformational flexibility of the autoinhibitory C-terminal domain, PLoS One 6, e21640. Sirinupong, N., Brunzelle, J., Doko, E., and Yang, Z. (2011) Structural insights into the autoinhibition and posttranslational activation of histone methyltransferase SmyD3, Journal of molecular biology 406, 149-159. Xu, S., Wu, J., Sun, B., Zhong, C., and Ding, J. (2011) Structural and biochemical studies of human lysine methyltransferase Smyd3 reveal the important functional roles of its post-SET and TPR domains and the regulation of its activity by DNA binding, Nucleic acids research 39, 4438-4449. Foreman, K. W., Brown, M., Park, F., Emtage, S., Harriss, J., Das, C., Zhu, L., Crew, A., Arnold, L., Shaaban, S., and Tucker, P. (2011) Structural and functional profiling of the human histone methyltransferase SMYD3, PLoS One 6, e22290. Jiang, Y., Trescott, L., Holcomb, J., Zhang, X., Brunzelle, J., Sirinupong, N., Shi, X., and Yang, Z. (2014) Structural insights into estrogen receptor alpha methylation by histone methyltransferase SMYD2, a cellular event implicated in estrogen signaling regulation, Journal of molecular biology 426, 3413-3425. Mazur, P. K., Reynoird, N., Khatri, P., Jansen, P. W., Wilkinson, A. W., Liu, S., Barbash, O., Van Aller, G. S., Huddleston, M., Dhanak, D., Tummino, P. J., Kruger, R. G., Garcia,

24 ACS Paragon Plus Environment

Page 24 of 41

Page 25 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

60.

61.

62.

63.

64. 65. 66.

67.

68.

69.

70. 71.

B. A., Butte, A. J., Vermeulen, M., Sage, J., and Gozani, O. (2014) SMYD3 links lysine methylation of MAP3K2 to Ras-driven cancer, Nature 510, 283-287. Huang, J., Perez-Burgos, L., Placek, B. J., Sengupta, R., Richter, M., Dorsey, J. A., Kubicek, S., Opravil, S., Jenuwein, T., and Berger, S. L. (2006) Repression of p53 activity by Smyd2-mediated methylation, Nature 444, 629-632. Sweis, R. F., Pliushchev, M., Brown, P. J., Guo, J., Li, F., Maag, D., Petros, A. M., Soni, N. B., Tse, C., Vedadi, M., Michaelides, M. R., Chiang, G. G., and Pappano, W. N. (2014) Discovery and Development of Potent and Selective Inhibitors of Histone Methyltransferase G9a, ACS Medicinal Chemistry Letters 5, 205-209. Nguyen, H., Allali-Hassani, A., Antonysamy, S., Chang, S., Chen, L. H., Curtis, C., Emtage, S., Fan, L., Gheyi, T., Li, F., Liu, S., Martin, J. R., Mendel, D., Olsen, J. B., Pelletier, L., Shatseva, T., Wu, S., Zhang, F. F., Arrowsmith, C. H., Brown, P. J., Campbell, R. M., Garcia, B. A., Barsyte-Lovejoy, D., Mader, M., and Vedadi, M. (2015) LLY-507, a Cell-active, Potent, and Selective Inhibitor of Protein-lysine Methyltransferase SMYD2, The Journal of biological chemistry 290, 13641-13653. Mitchell, L. H., Boriack-Sjodin, P. A., Smith, S., Thomenius, M., Rioux, N., Munchhof, M. J., Mills, J. E., Klaus, C. R., Totman, J., Riera, T. R., Raimondi, A., Jacques, S. L., West, K. A., Foley, M., Waters, N. J., Kuntz, K. W., Wigle, T. J., Porter Scott, M., Copeland, R. A., Smith, J. J., and Chesworth, R. (2015) Novel Oxindole Sulfonamides and Sulfamides: EPZ031686, the First Orally Bioavailable Small Molecule SMYD3 Inhibitor, ACS Med Chem Lett. Huang, S. (2002) Histone methyltransferases, diet nutrients and tumour suppressors, Nat Rev Cancer 2, 469-476. Hohenauer, T., and Moore, A. W. (2012) The Prdm family: expanding roles in stem cells and development, Development 139, 2267-2282. Rea, S., Eisenhaber, F., O'Carroll, D., Strahl, B. D., Sun, Z. W., Schmid, M., Opravil, S., Mechtler, K., Ponting, C. P., Allis, C. D., and Jenuwein, T. (2000) Regulation of chromatin structure by site-specific histone H3 methyltransferases, Nature 406, 593-599. Derunes, C., Briknarova, K., Geng, L., Li, S., Gessner, C. R., Hewitt, K., Wu, S., Huang, S., Woods, V. I., Jr., and Ely, K. R. (2005) Characterization of the PR domain of RIZ1 histone methyltransferase, Biochemical and biophysical research communications 333, 925-934. Pinheiro, I., Margueron, R., Shukeir, N., Eisold, M., Fritzsch, C., Richter, F. M., Mittler, G., Genoud, C., Goyama, S., Kurokawa, M., Son, J., Reinberg, D., Lachner, M., and Jenuwein, T. (2012) Prdm3 and Prdm16 are H3K9me1 methyltransferases required for mammalian heterochromatin integrity, Cell 150, 948-960. Eom, G. H., Kim, K., Kim, S. M., Kee, H. J., Kim, J. Y., Jin, H. M., Kim, J. R., Kim, J. H., Choe, N., Kim, K. B., Lee, J., Kook, H., Kim, N., and Seo, S. B. (2009) Histone methyltransferase PRDM8 regulates mouse testis steroidogenesis, Biochemical and biophysical research communications 388, 131-136. Hayashi, K., Yoshida, K., and Matsui, Y. (2005) A histone H3 methyltransferase controls epigenetic events required for meiotic prophase, Nature 438, 374-378. Wu, H., Mathioudakis, N., Diagouraga, B., Dong, A., Dombrovski, L., Baudat, F., Cusack, S., de Massy, B., and Kadlec, J. (2013) Molecular basis for the regulation of the H3K4 methyltransferase activity of PRDM9, Cell Rep 5, 13-20.

25 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

72.

73.

74.

75. 76.

77.

78.

79.

80.

81.

82. 83.

84. 85.

Ohno, H., Shinoda, K., Ohyama, K., Sharp, L. Z., and Kajimura, S. (2013) EHMT1 controls brown adipose cell fate and thermogenesis through the PRDM16 complex, Nature 504, 163-167. Di Zazzo, E., De Rosa, C., Abbondanza, C., and Moncharmont, B. (2013) PRDM Proteins: Molecular Mechanisms in Signal Transduction and Transcriptional Regulation, Biology (Basel) 2, 107-141. Briknarova, K., Zhou, X., Satterthwait, A., Hoyt, D. W., Ely, K. R., and Huang, S. (2008) Structural studies of the SET domain from RIZ1 tumor suppressor, Biochemical and biophysical research communications 366, 807-813. Dlakic, M. (2001) Chromatin silencing protein and pachytene checkpoint regulator Dot1p has a methyltransferase fold, Trends in biochemical sciences 26, 405-407. Min, J., Feng, Q., Li, Z., Zhang, Y., and Xu, R. M. (2003) Structure of the catalytic domain of human DOT1L, a non-SET domain nucleosomal histone methyltransferase, Cell 112, 711-723. Yu, W., Chory, E. J., Wernimont, A. K., Tempel, W., Scopton, A., Federation, A., Marineau, J. J., Qi, J., Barsyte-Lovejoy, D., Yi, J., Marcellus, R., Iacob, R. E., Engen, J. R., Griffin, C., Aman, A., Wienholds, E., Li, F., Pineda, J., Estiu, G., Shatseva, T., Hajian, T., Al-Awar, R., Dick, J. E., Vedadi, M., Brown, P. J., Arrowsmith, C. H., Bradner, J. E., and Schapira, M. (2012) Catalytic site remodelling of the DOT1L methyltransferase by selective inhibitors, Nat Commun 3, 1288. Basavapathruni, A., Jin, L., Daigle, S. R., Majer, C. R., Therkelsen, C. A., Wigle, T. J., Kuntz, K. W., Chesworth, R., Pollock, R. M., Scott, M. P., Moyer, M. P., Richon, V. M., Copeland, R. A., and Olhava, E. J. (2012) Conformational adaptation drives potent, selective and durable inhibition of the human protein methyltransferase DOT1L, Chem Biol Drug Des 80, 971-980. Daigle, S. R., Olhava, E. J., Therkelsen, C. A., Basavapathruni, A., Jin, L., BoriackSjodin, P. A., Allain, C. J., Klaus, C. R., Raimondi, A., Scott, M. P., Waters, N. J., Chesworth, R., Moyer, M. P., Copeland, R. A., Richon, V. M., and Pollock, R. M. (2013) Potent inhibition of DOT1L as treatment of MLL-fusion leukemia, Blood 122, 10171025. Yu, W., Smil, D., Li, F., Tempel, W., Fedorov, O., Nguyen, K. T., Bolshan, Y., Al-Awar, R., Knapp, S., Arrowsmith, C. H., Vedadi, M., Brown, P. J., and Schapira, M. (2013) Bromo-deaza-SAH: a potent and selective DOT1L inhibitor, Bioorg Med Chem 21, 17871794. Yao, Y., Chen, P., Diao, J., Cheng, G., Deng, L., Anglin, J. L., Prasad, B. V., and Song, Y. (2011) Selective inhibitors of histone methyltransferase DOT1L: design, synthesis, and crystallographic studies, J Am Chem Soc 133, 16746-16749. Lu, H., and Tonge, P. J. (2010) Drug-target residence time: critical information for lead optimization, Current opinion in chemical biology 14, 467-474. Chandrasekharan, M. B., Huang, F., and Sun, Z. (2010) Histone H2B ubiquitination and beyond: Regulation of nucleosome stability, chromatin dynamics and the trans-histone H3 methylation, Epigenetics 5, 9. Di Lorenzo, A., and Bedford, M. T. (2011) Histone arginine methylation, FEBS Lett 585, 2024-2031. Zhang, X., Zhou, L., and Cheng, X. (2000) Crystal structure of the conserved core of protein arginine methyltransferase PRMT3, EMBO J 19, 3509-3519. 26 ACS Paragon Plus Environment

Page 26 of 41

Page 27 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

86.

87.

88.

89.

90.

91.

92.

93.

94.

95.

96.

Yue, W. W., Hassler, M., Roe, S. M., Thompson-Vale, V., and Pearl, L. H. (2007) Insights into histone code syntax from structural and biochemical studies of CARM1 methyltransferase, EMBO J 26, 4402-4412. Troffer-Charlier, N., Cura, V., Hassenboehler, P., Moras, D., and Cavarelli, J. (2007) Functional insights from structures of coactivator-associated arginine methyltransferase 1 domains, EMBO J 26, 4391-4401. Ho, M. C., Wilczek, C., Bonanno, J. B., Xing, L., Seznec, J., Matsui, T., Carter, L. G., Onikubo, T., Kumar, P. R., Chan, M. K., Brenowitz, M., Cheng, R. H., Reimer, U., Almo, S. C., and Shechter, D. (2013) Structure of the arginine methyltransferase PRMT5MEP50 reveals a mechanism for substrate specificity, PLoS One 8, e57008. Hasegawa, M., Toma-Fukai, S., Kim, J. D., Fukamizu, A., and Shimizu, T. (2014) Protein arginine methyltransferase 7 has a novel homodimer-like structure formed by tandem repeats, FEBS letters 588, 1942-1948. Cura, V., Troffer-Charlier, N., Wurtz, J. M., Bonnefond, L., and Cavarelli, J. (2014) Structural insight into arginine methylation by the mouse protein arginine methyltransferase 7: a zinc finger freezes the mimic of the dimeric state into a single active site, Acta crystallographica. Section D, Biological crystallography 70, 2401-2412. Mitchell, L. H., Drew, A. E., Ribich, S. A., Rioux, N., Swinger, K. K., Jacques, S. L., Lingaraj, T., Boriack-Sjodin, P. A., Waters, N. J., Wigle, T. J., Moradei, O., Jin, L., Riera, T., Porter-Scott, M., Moyer, M. P., Smith, J. J., Chesworth, R., and Copeland, R. A. (2015) Aryl Pyrazoles as Potent Inhibitors of Arginine Methyltransferases: Identification of the First PRMT6 Tool Compound, ACS Medicinal Chemistry Letters 6, 655-659. Eram, M. S., Shen, Y., Szewczyk, M., Wu, H., Senisterra, G., Li, F., Butler, K. V., Kaniskan, H. U., Speed, B. A., Dela Sena, C., Dong, A., Zeng, H., Schapira, M., Brown, P. J., Arrowsmith, C. H., Barsyte-Lovejoy, D., Liu, J., Vedadi, M., and Jin, J. (2015) A Potent, Selective and Cell-active Inhibitor of Human Type I Protein Arginine Methyltransferases, ACS chemical biology. Sack, J. S., Thieffine, S., Bandiera, T., Fasolini, M., Duke, G. J., Jayaraman, L., Kish, K. F., Klei, H. E., Purandare, A. V., Rosettani, P., Troiani, S., Xie, D., and Bertrand, J. A. (2011) Structural basis for CARM1 inhibition by indole and pyrazole inhibitors, Biochem J 436, 331-339. Chan-Penebre, E., Kuplast, K. G., Majer, C. R., Boriack-Sjodin, P. A., Wigle, T. J., Johnston, L. D., Rioux, N., Munchhof, M. J., Jin, L., Jacques, S. L., West, K. A., Lingaraj, T., Stickland, K., Ribich, S. A., Raimondi, A., Scott, M. P., Waters, N. J., Pollock, R. M., Smith, J. J., Barbash, O., Pappalardi, M., Ho, T. F., Nurse, K., Oza, K. P., Gallagher, K. T., Kruger, R., Moyer, M. P., Copeland, R. A., Chesworth, R., and Duncan, K. W. (2015) A selective inhibitor of PRMT5 with in vivo and in vitro potency in MCL models, Nature chemical biology 11, 432-437. Siarheyeva, A., Senisterra, G., Allali-Hassani, A., Dong, A., Dobrovetsky, E., Wasney, G. A., Chau, I., Marcellus, R., Hajian, T., Liu, F., Korboukh, I., Smil, D., Bolshan, Y., Min, J., Wu, H., Zeng, H., Loppnau, P., Poda, G., Griffin, C., Aman, A., Brown, P. J., Jin, J., Al-Awar, R., Arrowsmith, C. H., Schapira, M., and Vedadi, M. (2012) An allosteric inhibitor of protein arginine methyltransferase 3, Structure 20, 1425-1435. Liu, F., Li, F., Ma, A., Dobrovetsky, E., Dong, A., Gao, C., Korboukh, I., Liu, J., Smil, D., Brown, P. J., Frye, S. V., Arrowsmith, C. H., Schapira, M., Vedadi, M., and Jin, J. 27 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

97.

98.

99. 100. 101.

102.

103. 104. 105.

106.

107.

(2013) Exploiting an allosteric binding site of PRMT3 yields potent and selective inhibitors, J Med Chem 56, 2110-2124. Kaniskan, H. U., Szewczyk, M. M., Yu, Z., Eram, M. S., Yang, X., Schmidt, K., Luo, X., Dai, M., He, F., Zang, I., Lin, Y., Kennedy, S., Li, F., Dobrovetsky, E., Dong, A., Smil, D., Min, S. J., Landon, M., Lin-Jones, J., Huang, X. P., Roth, B. L., Schapira, M., Atadja, P., Barsyte-Lovejoy, D., Arrowsmith, C. H., Brown, P. J., Zhao, K., Jin, J., and Vedadi, M. (2015) A potent, selective and cell-active allosteric inhibitor of protein arginine methyltransferase 3 (PRMT3), Angewandte Chemie 54, 5166-5170. Harte, P. J., Wu, W., Carrasquillo, M. M., and Matera, A. G. (1999) Assignment of a novel bifurcated SET domain gene, SETDB1, to human chromosome band 1q21 by in situ hybridization and radiation hybrids, Cytogenet Cell Genet 84, 83-86. Cao, R., and Zhang, Y. (2004) SUZ12 is required for both the histone methyltransferase activity and the silencing function of the EED-EZH2 complex, Molecular cell 15, 57-67. Ciferri, C., Lander, G. C., Maiolica, A., Herzog, F., Aebersold, R., and Nogales, E. (2012) Molecular architecture of human polycomb repressive complex 2, Elife 1, e00005. Wu, H., Zeng, H., Dong, A., Li, F., He, H., Senisterra, G., Seitova, A., Duan, S., Brown, P. J., Vedadi, M., Arrowsmith, C. H., and Schapira, M. (2013) Structure of the catalytic domain of EZH2 reveals conformational plasticity in cofactor and substrate binding sites and explains oncogenic mutations, PLoS One 8, e83737. Antonysamy, S., Condon, B., Druzina, Z., Bonanno, J. B., Gheyi, T., Zhang, F., Macewan, I., Zhang, A., Ashok, S., Rodgers, L., Russell, M., and Gately Luz, J. (2013) Structural Context of Disease-Associated Mutations and Putative Mechanism of Autoinhibition Revealed by X-Ray Crystallographic Analysis of the EZH2-SET Domain, PLoS One 8, e84147. Jiao, L., and Liu, X. (2015) Structural basis of histone H3K27 trimethylation by an active polycomb repressive complex 2, Science 350, aac4383. Chase, A., and Cross, N. C. (2011) Aberrations of EZH2 in cancer, Clin Cancer Res 17, 2613-2618. Knutson, S. K., Wigle, T. J., Warholic, N. M., Sneeringer, C. J., Allain, C. J., Klaus, C. R., Sacks, J. D., Raimondi, A., Majer, C. R., Song, J., Scott, M. P., Jin, L., Smith, J. J., Olhava, E. J., Chesworth, R., Moyer, M. P., Richon, V. M., Copeland, R. A., Keilhack, H., Pollock, R. M., and Kuntz, K. W. (2012) A selective inhibitor of EZH2 blocks H3K27 methylation and kills mutant lymphoma cells, Nat Chem Biol advance online publication. McCabe, M. T., Ott, H. M., Ganji, G., Korenchuk, S., Thompson, C., Van Aller, G. S., Liu, Y., Graves, A. P., Della Pietra, A., 3rd, Diaz, E., LaFrance, L. V., Mellinger, M., Duquenne, C., Tian, X., Kruger, R. G., McHugh, C. F., Brandt, M., Miller, W. H., Dhanak, D., Verma, S. K., Tummino, P. J., and Creasy, C. L. (2012) EZH2 inhibition as a therapeutic strategy for lymphoma with EZH2-activating mutations, Nature 492, 108112. Qi, W., Chan, H., Teng, L., Li, L., Chuai, S., Zhang, R., Zeng, J., Li, M., Fan, H., Lin, Y., Gu, J., Ardayfio, O., Zhang, J. H., Yan, X., Fang, J., Mi, Y., Zhang, M., Zhou, T., Feng, G., Chen, Z., Li, G., Yang, T., Zhao, K., Liu, X., Yu, Z., Lu, C. X., Atadja, P., and Li, E. (2012) Selective inhibition of Ezh2 by a small molecule inhibitor blocks tumor cells proliferation, Proc Natl Acad Sci U S A 109, 21360-21365.

28 ACS Paragon Plus Environment

Page 28 of 41

Page 29 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

108.

109.

110.

111. 112.

113.

114.

115.

116.

Konze, K. D., Ma, A., Li, F., Barsyte-Lovejoy, D., Parton, T., Macnevin, C. J., Liu, F., Gao, C., Huang, X. P., Kuznetsova, E., Rougie, M., Jiang, A., Pattenden, S. G., Norris, J. L., James, L. I., Roth, B. L., Brown, P. J., Frye, S. V., Arrowsmith, C. H., Hahn, K. M., Wang, G. G., Vedadi, M., and Jin, J. (2013) An orally bioavailable chemical probe of the Lysine Methyltransferases EZH2 and EZH1, ACS Chem Biol 8, 1324-1334. Garapaty-Rao, S., Nasveschuk, C., Gagnon, A., Chan, E. Y., Sandy, P., Busby, J., Balasubramanian, S., Campbell, R., Zhao, F., Bergeron, L., Audia, J. E., Albrecht, B. K., Harmange, J. C., Cummings, R., and Trojer, P. (2013) Identification of EZH2 and EZH1 small molecule inhibitors with selective impact on diffuse large B cell lymphoma cell growth, Chem Biol 20, 1329-1339. Gibaja, V., Shen, F., Harari, J., Korn, J., Ruddy, D., Saenz-Vash, V., Zhai, H., Rejtar, T., Paris, C. G., Yu, Z., Lira, M., King, D., Qi, W., Keen, N., Hassan, A. Q., and Chan, H. M. (2015) Development of secondary mutations in wild-type and mutant EZH2 alleles cooperates to confer resistance to EZH2 inhibitors, Oncogene. Baylin, S. B. (2005) DNA methylation and gene silencing in cancer, Nat Clin Pract Oncol 2 Suppl 1, S4-11. Song, J., Rechkoblit, O., Bestor, T. H., and Patel, D. J. (2011) Structure of DNMT1-DNA complex reveals a role for autoinhibition in maintenance DNA methylation, Science 331, 1036-1040. Song, J., Teplova, M., Ishibe-Murakami, S., and Patel, D. J. (2012) Structure-based mechanistic insights into DNMT1-mediated maintenance DNA methylation, Science 335, 709-712. Guo, X., Wang, L., Li, J., Ding, Z., Xiao, J., Yin, X., He, S., Shi, P., Dong, L., Li, G., Tian, C., Wang, J., Cong, Y., and Xu, Y. (2015) Structural insight into autoinhibition and histone H3-induced activation of DNMT3A, Nature 517, 640-644. Ma, Z., Liu, H., and Wu, B. (2014) Structure-based drug design of catechol-Omethyltransferase inhibitors for CNS disorders, British journal of clinical pharmacology 77, 410-420. Jacobs, S. A., Harp, J. M., Devarakonda, S., Kim, Y., Rastinejad, F., and Khorasanizadeh, S. (2002) The active site of the SET domain is constructed on a knot, Nature structural biology 9, 833-838.

29 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure Legends Figure 1: Phylogenic trees of A) PKMT and B) PRMT enzymes based on chemogenic analysis (5). Proteins included in the PRMT tree are sequences with methyltransferase domains and include all PRMT enzymes as well as known RNA methyltransferases. Individual enzymes and protein families that are discussed in this review are highlighted with an asterisk.

Figure 2: Representative structures of protein methyltransferase enzymes. SAM and SAH are depicted as sticks in all panels. A. Ribbon depiction of PKMT SETD7 bound to SAH (1MT6 (116)). Structural features highlighted include the I-SET domain (yellow). B. Ribbon depiction of DOT1L (3QOW (5)). Structural features include the N-terminal domain (yellow) and the Cterminal α/β domain that binds SAM (green). C. Ribbon depiction of dimeric PRMT3 bound to SAH (1F3L (85)). Structural features include the methyltransferase domain (green), β-barrel domain (yellow), and dimerization domain (cyan).

Figure 3: Mechanism of A) PKMT and B) PRMT enzymes.

Figure 4: The auto-inhibitory loop of NSD family and SETD2 proteins can adopt multiple conformations. For all panels, the auto-inhibitory loop is cyan and the SET domain is green. A. Cartoon representation of NSD1 (3OOI (46)) bound to SAH in magenta sticks. The autoinhibitory post-SET loop is blocking the substrate pocket within the SET domain. B. WHSC1L1 (4YZ8, Temple et. al., 2015) bound to SAM has a disordered auto-inhibitory loop. C. SETD2 (4FMU (47)) bound to a propyl-substituted sinefungin analog depicted as orange sticks. The auto-inhibitory loop is flipped open such that the substrate pocket is more open. 30 ACS Paragon Plus Environment

Page 30 of 41

Page 31 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 5: Cavernous binding sites in SMYD family enzymes allow diversity in ligand design. AC. Surface representations of SMYD1 (A; 3N71 (51)), SMYD2 (B; 3TG5 (53)) and SMYD3 (C; 5CCM (63)) are shown with N-terminal domain in light orange and C-terminal domain in light pink. Large cavernous binding pockets are formed at the domain interfaces. D. Protein-based overlays of ligands bound to SMYD family enzymes. p53 peptide and SAH from a SMYD2 structure (3TG5) are shown in cyan sticks. A-893 and SAM bound to SMYD2 (green; 4YND (44)), overlap with one trajectory of the peptide. EPZ030456 and SAM from a SMYD3 structure (5CCM; yellow) overlap with a different portion of the peptide.

Figure 6: PRDMs are structurally related to SET domains. A, PRMD2 (green, 2QPW (4)) is overlayed onto SETD7 (yellow, 1MT6 (116)). SAM is shown in magenta sticks. B. Human apo PRDM9 (yellow, 4IJD, Dong et al., 2012) is overlayed with murine PRMD9 (green, 4C1Q (71)) bound to peptide substrate (blue sticks) and SAM (magenta sticks). The auto-inhibitory loop that moves upon substrate binding is highlighted.

Figure 7: DOT1L is a dynamic protein methyltransferase. A. Surface representation of DOT1LSAM complex (3QOW (5)) is shown. N- and C-terminal domains are colored light orange and pink, respectively; SAM is shown in magenta stick representation. Flexible loops are depicted as ribbons; the activation loop is yellow. B. Surface representation of DOT1L-iodotubercidin complex (3UWP, Yu et. al., 2011) with same color scheme as A. Iodotubercidin is shown in green stick representation. Dramatic rearrangement of the activation loop is observed in the presence of different ligands. C. Cavity view of DOT1L (grey) bound to SAM (magenta;

31 ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

3QOW). D. Cavity view of DOT1L (grey) bound to pinometostat (cyan; 4HRA (79)). Ligand induced side chain rearrangements open up a larger cavity in the protein’s interior when the inhibitor is bound.

Figure 8: Diversity is found within PRMT family. Color scheme used in Figure 3C is reproduced for all panels. Nucleotides are represented as magenta spheres and ligands are represented by blue spheres. A. CARM1 bound to peptide and SAH (5DX0 (18)). B. CARM1 bound to an inhibitor and SAM (2Y1W (93)). C PRMT5 bound to EPZ015666 and SAM (4X61 (94)); MEP50 is in orange ribbon. D. PRMT6 bound to EPZ024011 and SAM (4Y30 (91)). E. PRMT3 bound to an allosteric inhibitor, SGC707 (4RYL (97)). In CARM1, PRMT5, and PRMT6, inhibitors bind in the same region as the peptide binds in CARM1. In contrast, the allosteric compound bound to PRMT3 resides in a distal location relative to the other inhibitors.

32 ACS Paragon Plus Environment

Page 32 of 41

Page 33 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 1: Phylogenic trees of A) PKMT and B) PRMT enzymes based on chemogenic analysis (5). Proteins included in the PRMT tree are sequences with methyltransferase domains and include all PRMT enzymes as well as known RNA methyltransferases. Individual enzymes and protein families that are discussed in this review are highlighted with an asterisk. 368x165mm (300 x 300 DPI)

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 2: Representative structures of protein methyltransferase enzymes. SAM and SAH are depicted as sticks in all panels. A. Ribbon depiction of PKMT SET7/9 bound to SAH (1MT6 (116)). Structural features highlighted include the I-SET domain (yellow). B. Ribbon depiction of DOT1L (3QOW (5)). Structural features include the N-terminal domain (yellow) and the C-terminal α/β domain that binds SAM (green). C. Ribbon depiction of dimeric PRMT3 bound to SAH (1F3L (85)). Structural features include the methyltransferase domain (green), β-barrel domain (yellow), and dimerization domain (cyan). 254x83mm (299 x 299 DPI)

ACS Paragon Plus Environment

Page 34 of 41

Page 35 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 3:

Mechanism of A) PKMT and B) PRMT enzymes. 177x215mm (300 x 300 DPI)

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 4: The auto-inhibitory loop of NSD family and SETD2 proteins can adopt multiple conformations. For all panels, the auto-inhibitory loop is cyan and the SET domain is green. A. Cartoon representation of NSD1 (3OOI (46)) bound to SAH in magenta sticks. The auto-inhibitory post-SET loop is blocking the substrate pocket within the SET domain. B. WHSC1L1 (4YZ8, Temple et. al., 2015) bound to SAM has a disordered auto-inhibitory loop. C. SETD2 (4FMU (47)) bound to a propyl-substituted sinefungin analog depicted as orange sticks. The auto-inhibitory loop is flipped open such that the substrate pocket is more open. 323x83mm (299 x 299 DPI)

ACS Paragon Plus Environment

Page 36 of 41

Page 37 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 5: Cavernous binding sites in SMYD family enzymes allow diversity in ligand design. A-C. Surface representations of SMYD1 (A; 3N71 (51)), SMYD2 (B; 3TG5 (53)) and SMYD3 (C; 5CCM (63)) are shown with N-terminal domain in light orange and C-terminal domain in light pink. Large cavernous binding pockets are formed at the domain interfaces. D. Protein-based overlays of ligands bound to SMYD family enzymes. p53 peptide and SAH from a SMYD2 structure (3TG5) are shown in cyan sticks. A-893 and SAM bound to SMYD2 (green; 4YND (44)), overlap with one trajectory of the peptide. EPZ030456 and SAM from a SMYD3 structure (5CCM; yellow) overlap with a different portion of the peptide. 165x165mm (299 x 299 DPI)

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Figure 6: PRDMs are structurally related to SET domains. A, PRMD2 (green, 2QPW (4)) is overlayed onto SETD7 (yellow, 1MT6 (116)). SAM is shown in magenta sticks. B. Human apo PRDM9 (yellow, 4IJD, Dong et al., 2012) is overlayed with murine PRMD9 (green, 4C1Q (71)) bound to peptide substrate (blue sticks) and SAM (magenta sticks). The auto-inhibitory loop that moves upon substrate binding is highlighted. 127x77mm (299 x 299 DPI)

ACS Paragon Plus Environment

Page 38 of 41

Page 39 of 41

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Biochemistry

Figure 7: DOT1L is a dynamic protein methyltransferase. A. Surface representation of DOT1L-SAM complex (3QOW (5)) is shown. N- and C-terminal domains are colored light orange and pink, respectively; SAM is shown in magenta stick representation. Flexible loops are depicted as ribbons; the activation loop is yellow. B. Surface representation of DOT1L-iodotubercidin complex (3UWP, Yu et. al., 2011) with same color scheme as A. Iodotubercidin is shown in green stick representation. Dramatic rearrangement of the activation loop is observed in the presence of different ligands. C. Cavity view of DOT1L (grey) bound to SAM (magenta; 3QOW). D. Cavity view of DOT1L (grey) bound to pinometostat (cyan; 4HRA (79)). Ligand induced side chain rearrangements open up a larger cavity in the protein’s interior when the inhibitor is bound. 229x142mm (299 x 299 DPI)

ACS Paragon Plus Environment

Biochemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

For Table of Contents use only 125x49mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 40 of 41