In Silico Physicochemical Parameter Predictions - Molecular

Jan 10, 2013 - Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data. Alexander...
2 downloads 3 Views 2MB Size
Article pubs.acs.org/molecularpharmaceutics

In Silico Physicochemical Parameter Predictions Mark C. Wenlock* and Patrick Barton AstraZeneca R&D Alderley Park, DMPK, Mereside, Macclesfield, Cheshire, SK10 4TF, United Kingdom ABSTRACT: Drug discovery is a complex process with the aim of discovering efficacious molecules where their potency and selectivity are balanced against ADMET properties to set the appropriate dose and dosing interval. The link between physicochemical properties and molecular structure are well established. The subsequent connections between physicochemical properties and a drug’s biological behavior provide an indirect link back to structure, facilitating the prediction of a biological property as a consequence of a particular molecular manipulation. Due to this understanding, during early drug discovery in vitro physicochemical property assays are commonly performed to eliminate compounds with properties commensurate with high attrition risks. However, the goal is to accurately predict physicochemical properties to prevent the synthesis of high risk compounds and hence minimize wasted drug discovery efforts. This paper will review the relevance to ADMET behaviors of key physicochemical properties, such as ionization, aqueous solubility, hydrogen bonding strength and hydrophobicity, and the in silico methodology for predicting them. KEYWORDS: in silico predictions, pKa, aqueous solubility, hydrogen bonding, hydrophobicity



INTRODUCTION

Principally drugs are designed with in vivo efficacy in mind, driven by free concentrations of the drug available to the protein target receptor and guided by in vitro potency assays. Bimolecular drug receptor interactions can be influenced by the manipulation of physicochemical properties and favorable contributions to the Gibbs free binding energy can include charged and uncharged hydrogen bond interactions and hydrophobic (entropy driven) interactions (along with salt bridges, charge transfer interactions, and van der Waals forces). Once synthesized, the physicochemical properties of a drug molecule are fixed (with the exception of dissolution that can be manipulated by physical form variations), and in minimizing attrition risks there is invariably a balancing act between features that are good for efficacy and those that give rise to preferential ADMET behavior. Difficulty arises when the properties good for protein affinity conflict with those for ADMET, and the druggability of a particular target needs to be considered when identifying complementary ligands. Pharmacokinetic parameters tend to be influenced by a combination of different physicochemical properties. Oral absorption of a drug is influenced by (and in conjunction with physiological and formulation factors) its molecular size, shape, hydrophobicity, hydrogen bonding potential, ionization (or charge), amphilicity, solubility, diffusivity, and hydrolytic

The aim of drug discovery is to find efficacious molecules where potency and selectivity are balanced against absorption, distribution, metabolism, elimination, and toxicity (ADMET) to set the appropriate dose and dosing interval. From the position where the biological relevance of the target has been established the key question that needs answering is: what is the correct chemical structure for an efficacious and safe candidate drug for the given biological target? Due to the complexity of this multiparameter problem our ability to directly relate a compound’s three-dimensional molecular structure to its human pharmacokinetic profile, in vivo efficacy, or toxicity is extremely limited; consequently extensive experimentation in in vitro and in vivo animal models are necessary to develop this understanding. However, there is increasing knowledge of the relationship between a compound’s bulk physiochemical properties and its biological behavior, in particular its pharmacokinetic properties. The effect of changes to chemical structures to manipulate physicochemical properties is reasonably well understood. An indirect situation exists where physicochemical properties can facilitate the interpretation of an ADMET effect due to a molecular structure cause. In the early stages of drug discovery, an appreciation of a compound’s physicochemical properties such as ionization (pKa), aqueous solubility, hydrogen bonding potential, and hydrophobicity can help the interpretation of its ADMET behavior. Furthermore, the ability to use in silico techniques to predict a virtual compound’s physicochemical properties and hence ADMET behavior can help guide drug discovery efforts into potentially more fruitful areas of chemical space. © XXXX American Chemical Society

Special Issue: Predictive DMPK: In Silico ADME Predictions in Drug Discovery Received: September 21, 2012 Revised: December 27, 2012 Accepted: January 10, 2013

A

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

stability.1−4 The pH-partition hypothesis states that nonionized form of an ionizable molecule can permeate across the gastrointestinal tract and the extent of ionization at a particular pH can be estimated from the Henderson−Hasselbalch equation.5,6 This concept oversimplifies a multifaceted process where a drug’s dissolution, solubility, and permeation characteristics are paramount as summarized in the Biopharmaceutics Classification System (BCS) or the Developability Classification System,7,8 hydrophobicity, molecular size, polarity, hydrogen bond counts, and charge being the fundamental parameters that influence solubility and permeation.9−16 The different hepatic metabolizing cytochrome P450 isoenzymes are selective for different types of molecules differentiated by hydrophobicity, charge, and shape.17−20 The extent of biliary clearance is influenced by molecular weight, hydrophobicity, and charge, the latter two implicated in renal clearance along with polarity.21−25 The distribution of unmetabolized drug throughout the body is governed to an extent by the molecule’s charge and hydrophobicity.23,26 The extent of binding to albumin is particular strong for acidic compounds compared to other proteolytic classes for a similar hydrophobicity.27,28 From the perspective of toxicity, a drug’s physicochemical properties can be related to adverse drug reactions due to secondary pharmacology that result from modulating targets other than the primary target. A drug’s hydrophobicity, size, polarity, amphilicity, charge, and electrophilicity are parameters that have been related to cardiovascular toxicity, hepatotoxicty, phospholipidosis, genotoxicity, and phototoxicity effects.29−33 In designing drugs, for druggable protein targets, an appreciation of how the physicochemical properties of a molecule affect its pharmacokinetics and toxicity (as well as its efficacy) is important for minimizing attrition risks. Furthermore an understanding of the structural features that influence such physicochemical properties is fundamental and facilitates in silico predictions of virtual compounds.

(where an approximation is made whereby the activities are replaced with concentrations). ⎛ [H+]aq [Base]aq ⎞ ⎟⎟ pK a = −log10 K a ≈ −log10⎜⎜ ⎝ [Acid]aq ⎠

(1)

In eq 1, for an monoprotic acidic compound the “acid” entity is “HA”, and the “base” (i.e., conjugate base) entity is “A−”, conversely for a monoprotic basic compound the “acid” (i.e., conjugate acid) entity is “BH+” and the “base” entity is “B”. The pKa of a basic compound reflects the ionization constant of the conjugate acid entity. Similar to other equilibrium constants, pKa will vary significantly depending on the temperature of measurement and the ionic strength of the aqueous media; consequently literature pKa data is quoted on two distinct scales: infinite dilution (or “thermodynamic”) scale and the constant ionic strength (or “practical” or “mixed”) scale. These different scales are a consequence of the different experimental techniques used to measure a compound’s pKa; techniques in early drug discovery tend to include potentiometric, spectrophotometric, capillary electrophoresis, and 1H NMR (for ionizations on proteins).37−39 The thermodynamic ionization constant Ktherm for a weak acid HA is defined as a K atherm =

γ − {H+}[A−] {H+}{A−} = A γHA [HA] {HA}

(2)

where γ are activity coefficients. The “practical” or “mixed” ionization constant Kmixed is defined as a K amixed =

{H+}[A−] [HA]

(3)

Equation 2 can be simplified by assuming that γHA equals 1 pK atherm = pK amixed − log γA−



(4)

Equation 4 highlights the difference between the two ionization scales, and the activity of the anion is calculated using the Debye−Huckel theory

DISCUSSION pKa. pKa is of significance in controlling the ADMET characteristics of a drug and can influence its bulk property effects and specific binding to a protein target receptor.34,35 In understanding the ionization of compounds within an aqueous medium it is important to appreciate the unique hydrogen bonding properties of water that give rise to many anomalous behaviors (e.g., hydrophobic effect, etc.). Chaplin offered the following description: “The hydrogen-bonding properties of water allow it to execute an intricate ballet exchanging partners while retaining complex order”.36 If water’s hydrogen bonding strength was slightly different, then its amphiprotic nature (i.e., its ability to act as both an acid and a base) would change, and it is this autoprotolysis equilibrium of a proton transfer involving a single substance that allows proteins to function and compounds to behave as acids or bases, and so forth. Kw is the autoprotolysis constant for water and at 25 °C, Kw ≈ 1 × 10−14 (pKw = 14) indicative that only a few water molecules are ionized. An acid is a species having the tendency to lose a proton while a base is a species having the tendency to add a proton. Acids and bases only manifest their properties by reacting with bases and acids, respectively. In aqueous media, equilibrium is established whereby an acid compound (HA) reacts with water acting as a base, or a basic compound (B) reacts with water acting as an acid. The ionization constant (Ka) for such an equilibrium is quoted as a pKa and defined by eq 1

−log γA− =

0.5085z 2I1/2 1 + 0.3281aI1/2

(5)

where I is the ionic strength, z is the charge of the ion, and a is a size parameter assumed to be 5 Å; for a A− anion (where I = 0.1) the difference is ∼0.1, for a A2− anion the difference is ∼0.4, and for a A3− anion the difference is ∼0.9. Although the Debye−Huckel activity coefficient calculation is an approximation, it is prudent to be aware of these differences when building in silico models for pKa that are trained on data from different sources especially for complex charged species. Qualitatively, as pKa increases, it implies the acid HA(aq) or the conjugate acid of the base BH+(aq) is more stable than the anion A−(aq) or the base B(aq); the reverse is true when pKa decreases. The molecular effects that influence pKas include inductive, electrostatic, and π-electron-delocalization (“mesomeric”) effects (in addition hydrogen bonding, conformational, and steric factors can also contribute).40 Mesomeric effects can result in very large changes in pKa due to the stabilization of charge via delocalization around π electron systems. At 25 °C at 1 atm, the change in Gibbs free energy, for a 1 unit change in pKa is ∼5.7 kJ·mol−1. The result of this is to make acids and bases appreciably stronger or weaker than seen in saturated systems. Cyclohexylamine has a pKa of ∼10.6, whereas aniline B

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

has a pKa of ∼4.7, a difference of 6 log units due to the mesomeric effect.41,42 Within a series of meta- and parasubstituted aromatic or aliphatic/alicyclic molecules the free energy changes are linear and additive and expressed as either a Hammett or Taft equation (eqs 6 and 7):

pK a = pK a0 − ρ(Σσ )

(6)

pK a = pK a0 − ρ*(Σσ *)

(7)

different methodologies for predicting pKa values for protein and small molecules comments that the main limiting factor to improving pKa predictions is the need for large quantities of well-curated data.46 At AstraZeneca (AZ) an ionization formula nomenclature of AxBy is used to systematically categorize compounds and subsequently aid QSAR/QSPR modeling efforts. “A” refers to acidic centers within the molecule that are appreciably ionized at a pH below 11 and x can equal 0, 1, 2, 3, ...; where A1 refers to the most acidic center and A2 the next most acidic, etc. “B” refers to a basic centers that are appreciably ionized at a pH above pH 2, and y can equal 0, 1, 2, 3, ...; where B1 refers to the most basic center and B2 the next basic, etc. Figure 1 shows a plot of ACD pKa v12 predictions for

pK0a is the pKa of the unsubstituted acid or base. ρ and ρ* are constants for a particular aromatic and aliphatic/alicyclic system, and σ (more specifically σmeta and σpara) and σ* are constants for a particular substituent. If pK0a is known along with an estimate of the ρ (or ρ*) value for a particular series, then the pKa of an unknown compound from this series can be estimated by using the σ (or σ*) values for the relevant substituent(s). For benzoic acid the pK0a is ∼4.2, and the ρ value for this series is arbitrarily set to 1 (the σmeta and σpara for a hydrogen substituent is arbitrarily set to 0).40 Hence for benzoic acid its pKa = pK0a . For 4-chlorobenzoic acid the σpara value for the para-chloro substituent is 0.24; hence its pKa is predicted to be ∼4.0. For pyridine the pK0a is ∼5.3, and the ρ value for this series has been established to be ∼5.9. This much higher ρ value indicates that the electronic effect of a given substituent on the pKa of pyridine will be much greater than that on benzoic acid. For 4-chloropyridine its pKa is predicted to be ∼3.9. σmeta and σpara values can be further convoluted into inductive (or field) effects, F, and resonance effects, R. Hence, the effect of substituents on pKa values therefore lead to quantitative descriptors, σmeta, σpara, σ*, F, and R, of electronic effects. These can be used in quantitative structure activity (or property) relationships (QSAR or QSPR), (other electronic descriptors include atomic partial charges, HOMO/LUMO energies, dipole moments, etc.). Drug receptor associations are typically a function of a drug molecule’s hydrophobicity, sterics, electronics, etc. For a given chemical series, practically the difference, ΔpKa, between the pKa of an unsubstituted and substituted version can be used as a descriptor as this will be equivalent to ρ∑σ or ρ*∑σ* and ρ or ρ* although unknown will be by assumption constant for a chemical series. The use of Hammett and Taft equations to predict pKa are more broadly referred to as linear free energy relationships (LFER) and an example of an empirical methodology. Other empirical methods include similarity searching of databases or QSPR. The MoKa (1.2) tool is a notable example of such a method which has been trained on 26 515 pKa values and validated against 5581 pKa values taken from Roche’s in-house library.43 The predictive power of such tools is related to the inclusion within the training set of similar ionizable motifs that represent the molecular effects that influence the compound in question pKa(s). The alternative to these empirical methods are quantum mechanical (QM) calculations.44 A variety of different commercially available pKa estimation packages have recently been compared against each other in two studies.44,45 Although different compound sets were used in the two evaluations (Nicklaus and Liao: 197 pharmaceutical substances; Manchester et al.: 211 druglike compounds) a consensus of the best packages include ACD/pK a v12 (Advanced Chemistry Development, Inc.; LFER), ADME Boxes (Pharma Algorithms, Inc.; QSPR), MoKa (Molecular Discovery, Ltd.; Topological), and Marvin (ChemAxon Ltd.; QSPR) with typically errors >0.4 to ∼1 log unit. A further excellent review, by Crippen et al., of

Figure 1. ACD pKa v12 predictions versus pKa measurements of the basic center for ∼900 AZ proprietary monobasic compounds.

the basic center in a diverse selection of ∼900 AZ proprietary monobases against measurements, and the root-mean-squared error (RMSE) between prediction and measurement for this data set is still 1.45 units. This highlights the need for better QSPR or QM methods of predictions. The assignment of ionization formula is based on experimental data and structural critique as commonly employed measurement techniques (potentiometry, UV, or CEMS) have pH-electrode limitations restricting their dynamic range to 2−11 and extremely strong acidic or basic pKa values such as sulfonic acid or guanidine can be hard to measure. The relevance of some pKa values can be questioned if the pH at which the biological parameter is being modeled is such that a particular center is not appreciably ionized. Bumetanide has three pK a s: A1 = 3.5 (carboxylic acid), A2 = 9.7 (sulphonamide), B1 = 6.1 (aniline):

At pH = 7.4 only A1 is appreciably ionized, and at this pH bumetanide can be approximated to behave as a monoacid (i.e., A1B0). However, if modeling a biological parameter where pH is a variable, such as solubility or absorption throughout the gastrointestinal tract where pH ranges from ∼1.5 to ∼8, then all three pKa values need to be accounted for as the numbers of species present at any pH will vary.47 C

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

From a QSAR perspective pKa contains information on electronic effects of different chemical substituents on a particular drug receptor activity; it is also used as a bulk descriptor to estimate the different concentrations of ionized and un-ionized species at a particular pH. The distribution of species for the monoacid benzoic acid is shown in Figure 2. At

The two ionizations in salsalate correspond to well-separated pKa values (>3 units) and can be treated as separate processes, and practically only two species exist in equilibrium at any particular pH: H2A and HA− below pH = 5.5 and HA− and A2− above pH = 5.5). An analogous situation exists for dibasic and zwitterionic (where acidic pKa < basic pKa) compounds with well separated pKas: BH22+ and BH+ exist below a particular pH and BH+ and B above; HABH+ and A−BH+ exist below a particular pH and A−BH+ and A−B exist above. (Note, for such zwitterions the HAB species will be present in solution but to a much lesser extent than the A−BH+ species.) The situation with carbenicillin is significantly different and highlights a critical aspect of attempting to structurally interpret experimental pKa data for multiply charged compounds and any subsequent application of this interpretation to future predictions, see Figure 4. The similarity between the 2 pKas of carbenicillin implies that the ionization of the two acidic groups take place simultaneously. These pKa values are referred to as overlapping and can no longer be assigned to the ionization of a particular functional group. pKa1 is the pH at which 0.5 equivalent of protons are removed from carbenicillin and pKa2 is the pH at which 1.5 equivalent of protons are removed and are referred to as macroscopic pKa values. A macroscopic pKa cannot be assigned to a specific structural feature. As can be seen in Figure 4 between the pH range 2−5 there are four different species that exist in equilibrium. Microscopic pKa values must be determined before the concentration of each monoanion can be determined, and a similar situation exists for certain types of amphoteric compounds where the acidic and basic pKa values are close. Dealing with multiply charged species will be specific to each drug discovery project. At AZ over the last 12 months more than 50% of the compound’s that had a pKa measurement contained more than 1 pKa, see Figure 5. From the perspective of in silico ADME, pKa is generally used to guide the subdivision of compounds into broad classification such as acids, bases, neutrals, or amphoterics, including studies for predicting renal clearance, extent of plasma protein binding, microsomal binding, and volume of distribution.25,27,28,48−50 This assignment is usually made by assessing ionization behavior at pH = 7.4, but this can seem contrived as the strength of the ionization is mostly ignored as a compound may be treated as an acid if it contains an acidic center whose pKa < 7.4 (i.e., it is more than 50% ionized at pH = 7.4) and contains no appreciably ionized basic centers (i.e., none that are where

Figure 2. Species distribution for benzoic acid (monoacid, pKa = 4.2).

any pH there exists, in equilibrium, two species and the percentage of the anionic species, A−, is defined as 100 A−% = (8) 1 + 10 pKa − pH If a biological process is dependent on the concentration of either of these two species, then it will be dictated by pKa and pH. For oral absorption the distribution of these two species will vary across the pH gradient of the gastrointestinal tract (pH range from ∼1.5 to ∼8). For diacids, such as salsalate, three species exist in equilibrium, see Figure 3. The percentage of the anionic species, A2− and HA− are defined as 100 A2 −% = 2 pK a1+ pK a2 − 2pH (9) 1 + 10 + 10 pKa − pH 100

HA−% = 1 + 10

pK a1− pH

2

+ 10 pH − pKa

(10)

Figure 3. Species distribution for salsalate (diacid, pKa1 = 2.9, pKa2 = 8.0). D

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

Figure 4. Species distribution for carbenicillin (diacid, pKa1 = 2.6, pKa2 = 3.1).

factors such as pKa, aqueous solubility, hydrophobicity will be influential but in the very early stages of evaluating a new potential drug candidate other key dissolution factors such as salt form, surface area, particle size, crystal form, and dosage form are typically uncharacterized simply due to pragmatic reasons. With respect to determining a compound’s aqueous solubility there are broadly two different experimental methodologies: equilibrium (thermodynamic) or kinetic aqueous solubility assays, typically at 20 °C and pH = 7.4.53,54 The starting form of the compound being one of the main distinguishing features (i.e., solid or DMSO stock) along with the length of time the compound and aqueous buffer are allowed to equilibrate for. Why the legacy development of two different forms of assays? From an AZ early drug discovery perspective there has always been a need for an early indication of solubility issues within a typical buffer system that could affect the design and running of subsequent biological in vitro or in vivo studies that require a minimum aqueous concentration. This can be satisfied using a more highthroughput kinetic aqueous solubility assay. Thermodynamic aqueous solubility assays tend to be lower throughput but, in our experience, less variable. Essentially the kinetic aqueous solubility measurement was poorly predictive of a compound’s thermodynamic aqueous solubility except for those compounds that exceeded the upper limit of the former assay.55 Such an observation has significant implications when in silico predictive models are built because if the RMSE between the observed and the predicted data for a training set is better than the random error and bias associated with the experimental measurement then the model is overfitted and its use questionable. Within the literature there are significant deficiencies in the consistency and reliability of aqueous solubility data for what is a simple experiment and the ability to predict aqueous solubility can be challenging.56 From a basic QSPR perspective it is not unexpected that an inverse linear relationship exists between the logarithm of the aqueous thermodynamic solubility (at pH = 7.4 and 20 °C starting with solid material) and the logarithm of the octan-1-ol/water partition coefficient (i.e., log P). The latter is essentially the

Figure 5. Proteolytic class distribution for AZ compounds with a pKa measurement in the last 12 months.

pKa > 7.4), and a similar but reversed assignment may be applied to classify basic compounds. Such classification may be suitable when studying bulk transfer processes such as passive absorption but not so when modeling drug receptor associations where the anion or cation are orders of magnitude more potent than the neutral form due to a charge reinforced interaction; in this scenario, at pH = 7.4, an acidic pKa = 8.9 or a basic pKa = 5.9 may be important. Therefore in drug discovery, an appreciation of the different proteolytic classes, the relative strengths of the pKa values and their relevance to a particular biological parameter (i.e., constant pH or variable pH) is paramount. From the perspective of designing new compounds the limitation of experimental techniques need to be considered (i.e., macroscopic pKa values, and the reliable detection range) if using measured data to train bespoke LFER, QSPR, or QM methodologies for pKa predictions for novel chemical series. Solubility. The absorption of an oral or dry powder inhaled drug into the systemic circulation requires dissolution of the compound into biological aqueous media.51,52 Physicochemical E

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

molecules have high levels of conformational flexibility; furthermore the accuracy of log P predictions can be unacceptably poor.61 An alternative LFER aqueous solubility equation was proposed by Abraham et al. where a compound’s aqueous solubility was related to the following descriptors: molar refractivity, polarizability, hydrogen bond acidity, hydrogen bond basicity, McGowan molecular volume.62 Further complications involved in predicting aqueous solubility is the affect of temperature and in addition for ionizable compounds the affect of pH. Semiempirical models exist for relating the molar solubility of a compound and temperature dependence (expressed as the van’t Hoff enthalpy of solubility) in solvents, although water is more difficult to model than organic solvents and so the extrapolation is less reliable for aqueous solubility data.63 Theoretical equations exist for modeling the ideal affect of pH on solubility for mono-, di-, and triprotic compounds, but examples of sparingly soluble drugs that show deviations from the ideal are known.64 The reliability of using temperature and pH models to extrapolate experimental solubility data measured at 20 °C and pH = 7.4 in a nonbiologically relevant aqueous system, as commonly generated in the very early stages of a drug discovery screening cascades, to 37 °C and the pH range 1.5−8 as might be the case in modeling absorption across the gastrointestinal tract needs to be carefully considered. The purpose of such experimental data is more applicable to the identification of potential solubility issues in subsequent assays. From a design perspective the requirements of an aqueous solubility prediction trained on early screening data is for the model to be able to confidently (i.e., with approximately 80% probability) identify virtual compounds that will have poor aqueous solubility. Hill et al. define low aqueous solubility to be 3 and total PSA < 75 Å2).30 An obvious shortcoming in the application of physicochemical properties to drug discovery for ionizable molecules is that log P is of interest from a compound quality perspective but DMPK tends to focus on measurements of log D7.4. A log P* can be estimated from a log D7.4 and pKa measurement and application of the relevant proteolytic class equation, the asterisk implying an indirect measure of log P. However, where the former is typically measured routinely, pKa measurements are not routine on all compounds, and the use of pKa predictions using generic methodologies can add large errors to a log P* estimate. To address this issue AZ has developed a Pipeline Pilot (Accelrys Inc.) based AutoLogP system that automatically maximizes the log P* data set for proprietary compounds with an experimentally determined log D (at a given pH) and pKa data. The experimenter’s assessment of a compound’s ionization formula (i.e., AxBy, see pKa section above) is critical in the curation of the experimental pKa data as the decision flow logic within Pipeline Pilot will vary for different proteolytic classes. The system has the pragmatic functionality that enables the assignment of a pKa for a constant ionizable group within a common scaffold (or substructure) used within a particular drug discovery chemical series (i.e., where the chemical exploratory changes do not affect the ionization). Thus maximizing the use of experimental pKa data and consequently minimizing the absolute number of compound measurements. Provided that a log D measurement exists and an ionization formula has been assigned, there is a general pattern to the inherent decision flow logic of the AutoLogP system. Do all pKa values exist for the compound as indicated by the ionization formula? If not, check to see if a fragment pKa exists for the chemical series, and use ACD/pKa v12 prediction to estimate whether the ionization is significant to the calculation of log P*. A molecule may have multiple ionizations which are measurable but too weak to affect the log P* calculation; hence an approximation is valid using a more simple proteolytic class equation (e.g., certain A2B0 compounds can be treated as A1B0). Although dependent on the ionization formula in question, an acidic ionization may be ignored if it is predicted to be greater than 8.9 (e.g., phenols), and for such a compound with an ionization formula A1B0 it can be assumed that the log D7.4 measurement is a log P measurement. The value of 8.9 is somewhat conservative but allows for error in the ACD/pKa v12 prediction. Similarly, a value of 5.9 can be used for basic ionizations. The estimation of log P* for certain amphoteric compounds is possible. For

example in the case of A1B1 if the acidic pKa is 5 and the basic pKa is 3, then the compound can be treated as a monoacid (i.e., A1B0). Figure 7 shows a plot of clogP values for AZ proprietary

Figure 7. clogP versus log P* (determined using measured log D7.4 and pKa data) for 1324 AZ proprietary monoacidic (A1B0), monobasic (A0B1), or amphotheric (A1B1) compounds.

compounds against their log P* values, and although there is a significant correlation the RMSE (1.07) is large and equivalent to that shown by Tetko et al.61 There is an assumption that log P* provides a better estimate of an AZ proprietary compound’s lipophilicity than a clogP prediction, and it is implicit that log P* will be a better descriptor in QSAR/QSPR modeling. Predictive models for human PPB are often modeled using lipophilicity descriptors.95 PPB data can be analyzed on a pseudo binding constant scale (K PPB ) by taking the (logarithmic) ratio of the %bound to the %free value.28 The plots between the measured human log KPPB and either log P* or clogP for 373 AZ proprietary monoacidic, monobasic, or amphoteric compounds are shown in Figures 8 and 9. The two correlations are weak, but the former appears better and is illustrative that log P* for AZ proprietary compounds potentially has extra value as a lipophilicity descriptor in subsequent QSAR over the use of clogP. However, log P*

Figure 8. Log P* (determined using measured log D7.4 and pKa data) versus human plasma protein binding for 373 AZ proprietary monoacidic (A1B0), monobasic (A0B1), or amphotheric (A1B1) compounds. H

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

potential can facilitate a BCS classification.96 The Rodgers and Rowland methodology predicted tissue distribution for different proteolytic classes using log P and pKas.97 Luker et al. described an in silico methodology for predicting attrition in preclinical toxicology for basic oral drugs, where key descriptors included those that described a molecule’s charge distribution, shape, and hydrogen bonding potential.98 The drug efficiency index (DEI) has been proposed as a useful parameter to optimize ADMET and in vivo efficacy potentials of molecules, and Valko et al. described an in vitro estimate of DEI that uses an estimate of PPB and hydrophobicity.99 These examples are highlighted to further demonstrate the greater focus on the use of physicochemical parameters to understand ADMET and subsequently modulate a drug’s dose.



Figure 9. clogP versus human plasma protein binding for 373 AZ proprietary monoacidic (A1B0), monobasic (A0B1), or amphotheric (A1B1) compounds.

AUTHOR INFORMATION

Corresponding Author

*Phone: +44 1625 516603. E-mail: mark.wenlock@astrazeneca. com. Notes

The authors declare no competing financial interest.

values require experimental data so from a drug design perspective, where log P is used as a key compound quality parameter, it follows that the “local” in silico QSPR model for this property trained specifically on log P* data may offer complementary and potentially more accurate estimates of intrinsic lipophilicity.



ABBREVIATIONS USED ADME(T), absorption distribution metabolism elimination (toxicity); AZ, AstraZeneca; BCS, Biopharmaceutics Classification System; clogP, Daylight’s octanol/water partition coefficient; CMC, critical micelle concentration; D, octan-1ol/water distribution coefficient; DEI, drug efficiency index; HBA, hydrogen bond acceptor; HBD, hydrogen bond donor; HOMO, highest occupied molecular orbital; Kα, hydrogen bond donor association constant; Kβ, hydrogen bond acceptor association constant; KPPB, pseudo plasma protein binding constant; LFER, linear free energy relationship; LUMO, lowest unoccupied molecular orbital; MP, melting point; P, octan-1ol/water partition coefficient; pKa, ionization constant; PPB, plasma protein binding; PSA, polar surface area; QM, quantum mechanical; QSAR, quantitative structure activity relationship; QSPR, quantitative structure property relationship; RMSE, root-mean-squared error; Ssolid 0 , intrinsic aqueous solubility of crystalline (solid) compound; SFI, solubility forecast index



CONCLUSION The rational design of compounds within drug discovery benefits from understanding how a drug’s ADMET behavior is controlled by its molecular structure or, at the very least, influenced by its physicochemical properties. The use of commercial or proprietary in silico models, built using a variety of statistical techniques, for pKa, solubility, and hydrophobicity is routine within the drug discovery. Physicochemical property predictions are often used as descriptors within multivariate QSAR/QSPR models that relate to a particular biological process or more holistically used as broad compound quality criteria. High-quality experimental physicochemical property data should result in improved in silico QSPR models for predicting that parameter and proprietary in silico models of commonly built for this purpose. The log P* parameter is derived from experimental log D data and experimental pKa data or decisions based on predicted pKa data using specific proteolytic class equations. Log P* has been shown to be a better descriptor for understanding human PPB data compared to clogP. It follows that the inclusion of such improved in silico predictions as descriptors in ADMET multivariate QSAR models should confer some degree of advantage within drug discovery, helping to reduce costs and uncertainty as decisions may be more confidently made on virtual structures. The prediction accuracy of these models will be dependent on the applicability domain of the training set compounds to the compounds being predicted. As drug discovery projects explore different chemical areas, the applicability of such models needs to be monitored, and if necessary models should be refined to incorporate information pertaining to new compounds at appropriate intervals. Although there are many examples where physicochemical parameters have been related to ADMET, four further studies are noteworthy. Comer and Box proposed that an appreciation of a molecule’s log P, intrinsic solubility, and hydrogen bonding



REFERENCES

(1) Martinez, M. N.; Amidon, G. L. A Mechanistic Approach to Understanding the Factors Affecting Drug Absorption: A Review of Fundamentals. J. Clin. Pharmacol. 2002, 42, 620−643. (2) Yu, L. X.; Lipka, E.; Crison, J. R.; Amidon, G. L. Transport Approaches to the Biopharmaceutical Design of Oral Drug Delivery Systems: Prediction of Intestinal Absorption. Adv. Drug Delivery Rev. 1996, 19, 359−376. (3) Van de Waterbeemd, H. Fundamental Variables of the Biopharmaceutics Classification System (BCS): A Commentary. Eur. J. Pharm. Sci. 1998, 7, 1−3. (4) Lö benberg, R.; Amidon, G. L. Modern Bioavailability, Bioequivalence and Biopharmaceutics Classication System. New Scientific Approaches to International Regulatory Standards. Eur. J. Pharm. Biopharm. 2000, 50, 3−12. (5) Shore, P. A.; Brodie, B. B.; Hogben, C. A. M. The Gastric Secretion of Drugs: A pH Partition Hypothesis. J. Pharmacol., Exp., Therap. 1957, 119, 361−369. (6) Corey, H. E. Stewart and Beyond: New Models of Acid-Base Balance. Kidney Int. 2003, 64, 777−787. (7) Amidon, G. L.; Lennernäs, H.; Shah, V. P.; Crison, J. R. A Theoretical Basis For a Biopharmaceutic Drug Classification: The

I

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

Correlation of In Vitro Drug Product Dissolution and In Vivo Bioavailability. Pharm. Res. 1995, 12, 413−420. (8) Butler, J. M.; Dressman, J. B. The Developability Classification System: Application of Biopharmaceutics Concepts to Formulation Development. J. Pharm. Sci. 2010, 99, 4940−4954. (9) Hansch, C.; Quinlan, J. E.; Lawrence, G. L. The Linear FreeEnergy Relationship Between Partition Coefficients and the Aqueous Solubility of Organic Liquids. J. Org. Chem. 1968, 33, 347−350. (10) Yalkowsky, S. H.; Valvani, S. C. Solubility and Partitioning I: Solubility of Nonelectrolytes in Water. J. Pharm. Sci. 1980, 69, 912− 922. (11) Van de Waterbeemd, H.; Camenisch, G. Estimation of Caco-2 Cell Permeability using Calculated Molecular Descriptors. Quant. Struct.-Act. Relat. 1996, 15, 480−490. (12) Palm, K.; Stenberg, P.; Luthman, K.; Artursson, P. Polar Molecular Surface Properties Predict the Intestinal Absorption of Drugs in Humans. Pharm. Res. 1997, 14, 568−571. (13) Abraham, M. H.; Le, J. The Correlation and Prediction of the Solubility of Compounds in Water Using an Amended Solvation Energy Relationship. J. Pharm. Sci. 1999, 88, 868−880. (14) Clark, D. E. Rapid Calculation of Polar Molecular Surface Area and Its Application to the Prediction of Transport Phenomena. 1. Prediction of Intestinal Absorption. J. Pharm. Sci. 1999, 88, 807−814. (15) Kasim, N. A.; Whitehouse, M.; Ramachandran, C.; Bermejo, M.; Lennernäs, H.; Hussain, A. S.; Junginger, H. E.; Stavchansky, S. A.; Midha, K. K.; Shah, V. P.; Amidon, G. L. Molecular Properties of WHO Essential Drugs and Provisional Biopharmaceutical Classification. Mol. Pharmaceutics 2004, 1, 85−96. (16) Avdeef, A.; Bendels, S.; Di, L.; Faller, B.; Kansy, M.; Sugano, K.; Yamauchi, Y. PAMPACritical Factors For Better Predictions of Absorption. J. Pharm. Sci. 2007, 96, 2893−2909. (17) Smith, D. A.; Ackland, M. J.; Jones, B. C. Properties of Cytochrome P450 Isoenzymes and Their Substrates Part 2: Properties of Cytochrome P450 Substrates. Drug Discovery Today 1997, 2, 479− 486. (18) Lewis, D. F. V. On the Recognition of Mammalian Microsomal Cytochrome P450 Substrates and Their Characteristics. Biochem. Pharmacol. 2000, 60, 293−306. (19) Riley, R. J.; Parker, A. J.; Trigg, S.; Manners, C. N. Development of a Generalized, Quantitative Physicochemical Model of CYP3A4 Inhibition For Use in Early Drug Discovery. Pharm. Res. 2001, 18, 652−655. (20) Lewis, D. F. V.; Jacobs, M. N.; Dickins, M. Compound Lipophilicity For Substrate Binding to Human P450s in Drug Metabolism. Drug Discovery Today 2004, 9, 530−537. (21) Kwon, Y. Handbook of Essential Pharmacokinetics, Pharmacodynamics and Drug Metabolism For Industrial Scientists; Kluwer Academic/Plenum Publishers: New York, 2001; pp 169−173. (22) Hamilton, H. W.; Steinbaugh, B. A.; Stewart, B. H.; Chan, O. H.; Schmid, H. L.; Schroeder, R.; Ryan, M. J.; Keiser, J.; Taylor, M. D.; Blankley, C. J.; Kaltenbronn, J. S.; Wright, J.; Hicks, J. Evaluation of Physicochemical Parameters Important to the Oral Bioavailability of Peptide-like Compounds: Implications For the Synthesis of Renin Inhibitors. J. Med. Chem. 1995, 38, 1446−1455. (23) Smith, D. A.; Jones, B. C.; Walker, D. K. Design of Drugs Involving the Concepts and Theories of Drug Metabolism and Pharmacokinetics. Med. Res. Rev. 1996, 16, 243−266. (24) Varma, M. V. S.; Feng, B.; Obach, R. S.; Troutman, M. D.; Chupka, J.; Miller, H. R.; El-Kattan, A. Physicochemical Determinants of Human Renal Clearance. J. Med. Chem. 2009, 52, 4844−4852. (25) Paine, S. W.; Barton, P.; Bird, J.; Denton, R.; Menochet, K.; Smith, A.; Tomkinson, N. P.; Chohan, K. K. A Rapid Computational Filter For Predicting the Rate of Human Renal Clearance. J. Mol. Graph. Modell. 2010, 29, 529−537. (26) Barton, P.; Davis, A. M.; McCarthy, D. J.; Webborn, P. J. H. Drug-Phospholipid Interactions. 2. Predicting the Sites of Drug Distribution Using n-Octanol/Water and Membrane/Water Distribution Coefficients. J. Pharm. Sci. 1997, 86, 1034−1039.

(27) Davis, A. M.; Riley, R. The Impact of Physical Organic Chemistry on the Control of Drug-like Properties. In Drug Design: Cutting Edge Approaches; Flower, D. R., Ed.; Royal Society of Chemistry: Cambridge, UK, 2002; pp 106−123. (28) Fessey, R. E.; Austin, R. P.; Barton, P.; Davis, A. M.; Wenlock, M. C. The Role of Plasma Protein Binding in Drug Discovery. In Pharmacokinetic Profiling in Drug Research: Biological, Physicochemical, and Computational Strategies [LogP2004, Lipophilicity Symposium]; Testa, B., Ed.; Verlag Helvetica Chimica Acta: Zurich, Switzerland, 2006; pp 119−141. (29) Valentin, J.; Hammond, T. Safety and Secondary Pharmacology: Successes, Threats, Challenges and Opportunities. J. Pharmacol. Toxicol. 2008, 58, 77−87. (30) Hughes, J. D.; Blagg, J.; Price, D. A.; Bailey, S.; DeCrescenzo, G. A.; Devraj, R. V.; Ellsworth, E.; Fobian, Y. M.; Gibbs, M. E.; Gilles, R. W.; Greene, N.; Huang, E.; Krieger-Burke, T.; Loesel, J.; Wager, T.; Whiteley, L.; Zhang, Y. Physiochemical Drug Properties Associated With In Vivo Toxicological Outcomes. Bioorg. Med. Chem. Lett. 2008, 18, 4872−4875. (31) Reasor, M. J.; Hastings, K. L.; Ulrich, R. G. Drug-Induced Phospholipidosis Issues and Future Directions. Exp. Opin. Drug Saf. 2006, 5, 567−583. (32) Benigni, R. Structure-Activity Relationship Studies of Chemical Mutagens and Carcinogens: Mechanistic Investigations and Prediction Approaches. Chem. Rev. 2005, 105, 1767−1800. (33) Hansch, C.; McKarns, S. C.; Smith, C. J.; Doolittle, D. J. Comparative QSAR Evidence For a Free-Radical Mechanism of Phenol-Induced Toxicity. Chem.-Biol. Interact. 2000, 127, 61−72. (34) Manallack, D. T. The pKa Distribution of Drugs: Application to Drug Discovery. Perspect. Med. Chem. 2007, 1, 25−38. (35) Andrews, P. R.; Craik, D. J.; Martin, J. L. Functional Group Contributions to Drug-Receptor Interactions. J. Med. Chem. 1984, 27, 1648−1657. (36) Chaplin, M. Do We Underestimate the Importance of Water in Cell Biology? Nat. Rev. Mol. Cell Biol. 2006, 7, 861−866. (37) Völgyi, G.; Ruiz, R.; Box, K.; Comer, J.; Bosch, E.; TakácsNovák, K. Potentiometric and Spectrophotometric pKa Determination of Water-Insoluble Compounds: Validation Study in a New Cosolvent System. Anal. Chim. Acta 2007, 583, 418−428. (38) Ishihama, Y.; Oda, Y.; Asakawa, N. Microscale Determination of Dissociation Constants of Multivalent Pharmaceuticals by Capillary Electrophoresis. J. Pharm. Sci. 1994, 83, 1500−1507. (39) Nielsen, J. E. Chapter 5 Analyzing Protein NMR pH-Titration Curves. In Annual Reports in Computational Chemistry, 1st ed.; Wheeler, R., Spellmeyer, D., Eds.; Elsevier: Amsterdam, The Netherlands, 2008; Vol. 4; pp 89−106. (40) Perrin, D. D.; Dempsey, B.; Serjeant, E. P. pKa Predictions For Organic Acids and Bases; Chapman and Hall Ltd: London, UK, 1981. (41) Hancock, R. D.; Nakani, B. S.; Marsicano, F. Relationship Between Lewis Acid-Base Behavior in the Gas Phase and in Aqueous Solution. 1. Role of Inductive, Polarizability, and Steric Effects in Amine Ligands. Inorg. Chem. 1983, 22, 2531−2535. (42) Berry, D.; Bukka, K.; Satchell, R. S. Quantitive Aspects of Lewis Acidity. XV. Basicity of Substituted Pyridines and Quinolines Towards Metal Halides in Diethyl Ether. J. Chem. Soc., Perkin Trans. 2 1976, 1, 89−91. (43) Milletti, F.; Storchi, L.; Goracci, L.; Bendels, S.; Wagner, B.; Kansy, M.; Cruciani, G. Extending pKa Prediction Accuracy: HighThroughput pKa Measurements to Understand pKa Modulation of New Chemical Series. Eur. J. Med. Chem. 2010, 45, 4270−4279. (44) Liao, C.; Nicklaus, M. C. Comparison of Nine Programs Predicting pKa Values of Pharmaceutical Substances. J. Chem. Inf. Model. 2009, 49, 2801−2812. (45) Manchester, J.; Walkup, G.; Rivin, O.; You, Z. Evaluation of pKa Estimation Methods on 211 Druglike Compounds. J. Chem. Inf. Model. 2010, 50, 565−571. (46) Lee, A. C.; Crippen, G. M. Predicting pKa. J. Chem. Inf. Model. 2009, 49, 2013−2033. J

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

(47) Agoram, B.; Woltosz, W. S.; Bolger, M. B. Predicting the Impact of Physiological and Biochemical Processes on Oral Drug Bioavailability. Adv. Drug Delivery Rev. 2001, 50 (Suppl. 1), S41−S67. (48) Austin, R. P.; Barton, P.; Cockroft, S. L.; Wenlock, M. C.; Riley, R. J. The Influence of Nonspecific Microsomal Binding on Apparent Intrinsic Clearance, and Its Prediction From Physicochemical Properties. Drug Metab. Dispos. 2002, 30, 1497−1503. (49) Abraham, M. H.; Austin, R. P. The Effect of Ionized Species on Microsomal Binding. Eur. J. Med. Chem. 2012, 47, 202−205. (50) Lombardo, F.; Obach, R. S.; Shalaeva, M. Y.; Gao, F. Prediction of Human Volume of Distribution Values For Neutral and Basic Drugs. 2. Extended Data Set and Leave-Class-Out Statistics. J. Med. Chem. 2004, 47, 1242−1250. (51) Dressman, J. B.; Thelen, K.; Jantratid, E. Towards Quantitative Prediction of Oral Drug Absorption. Clin. Pharmacokinet. 2008, 47, 655−667. (52) Patton, J. S.; Byron, P. R. Inhaling Medicines: Delivering Drugs to the Body Through the Lungs. Nat. Rev. Drug Discovery 2007, 6, 67− 74. (53) Kerns, E. H.; Di, L.; Carter, G. T. In Vitro Solubility Assays in Drug Discovery. Curr. Drug Metab. 2008, 9, 879−885. (54) Colclough, N.; Hunter, A.; Kenny, P. W.; Kittlety, R. S.; Lobedan, L.; Tam, K. Y.; Timms, M. A. High Throughput Solubility Determination With Application to Selection of Compounds For Fragment Screening. Biorg. Med. Chem. 2008, 16, 6611−6616. (55) Wenlock, M. C.; Austin, R. P.; Potter, T.; Barton, P. A Highly Automated Assay For Determining the Aqueous Equilibrium Solubility of Drug Discovery Compounds. J. Assoc. Lab. Autom. 2011, 16, 276− 284. (56) Llinàs, A.; Glen, R. C.; Goodman, J. M. Solubility Challenge: Can You Predict Solubilities of 32 Molecules Using a Database of 100 Reliable Measurements? J. Chem. Inf. Model. 2008, 48, 1280−1303. (57) Jain, N.; Yalkowsky, S. H. Estimation of the Aqueous Solubility 1: Application to Organic Nonelctrolytes. J. Pharm. Sci. 2001, 90, 234− 252. (58) Wenlock, M. C.; Barton, P.; Potter, T.; Austin, R. P. A Method For Measuring the Lipophilicity of Compounds in Mixtures of 10. J. Biomol. Screen. 2011, 16, 348−355. (59) Ran, Y.; He, Y.; Yang, G.; Johnson, J. L. H.; Yalkowsky, S. H. Estimation of Aqueous Solubility of Organic Compounds by Using the General Solubility Equation. Chemosphere 2002, 48, 487−509. (60) Irmann, F. Eine Einfache Korrelation Zwischen Wasserlöslichkeit und Struktur von Kohlenwasserstoffen und Halogenohlenwasserstoffen. Chem. Ing. Tech. 1965, 37, 789−798. (61) Tetko, I. V.; Poda, G. I.; Ostermann, C.; Mannhold, R. Accurate In Silico log P Predictions: One Can’t Embrace the Unembracable. QSAR Comb. Sci. 2009, 28, 845−849. (62) Abraham, M. H.; Le, J. The Correlation and Prediction of the Solubility of Compounds in Water Using an Amended Solvation Energy Relationship. J. Pharm. Sci. 2000, 88, 868−880. (63) Nordström, F. L.; Rasmuson, Å. C. Prediction of Solubility Curves and Melting Properties of Organic and Pharmaceutical Compounds. Eur. J. Pharm. Sci. 2009, 36, 330−344. (64) Avdeef, A. Solubility of Sparingly-Soluble Ionizable Drugs. Adv. Drug Delivery Rev. 2007, 59, 568−590. (65) Hill, A. P.; Young, R. J. Getting Physical in Drug Discovery: A Contemporary Perspective on Solubility and Hydrophobicity. Drug Discovery Today 2010, 15, 648−655. (66) Bergström, C. A. S. Computational Models to Predict Aqueous Drug Solubility, Permeability and Intestinal Absorption. Exp. Opin. Drug Metab. Toxicol. 2005, 1, 613−627. (67) Delaney, J. S. Predicting Aqueous Solubility From Structure. Drug Discovery Today 2005, 10, 289−295. (68) Dearden, J. C. In Silico Prediction of Aqueous Solubility. Exp. Opin. Drug Discovery 2006, 1, 31−52. (69) Davis, A. M.; Bruneau, P. In Silico Prediction of Solubility. In Drug Bioavailabilty (40, Methods and Principles in Medicinal Chemistry), 2nd ed.; van de Waterbeemd, H., Testa, B., Eds.; Wiley-VCH Verlag GmbH & Co. KGaA: Weinheim, Germany, 2009; pp 53−68.

(70) Gleeson, M. P.; Hersey, A.; Hannongbua, S. In Silico ADME Models: A General Assessment of Their Utility in Drug Discovery Applications. Curr. Top. Med. Chem. 2011, 11, 358−381. (71) Bergström, C. A. S.; Wassvik, C. M.; Norinder, U.; Luthman, K.; Artursson, P. Global and Local Computational Models For Aqueous Solubility Prediction of Drug-Like Molecules. J. Chem. Inf. Comput. Sci. 2004, 44, 1477−1488. (72) Wood, D. J.; Buttar, D.; Cumming, J. G.; Davis, A. M.; Norinder, U.; Rodgers, S. L. Automated QSAR With a Hierarchy of Global and Local Models. Mol. Inf. 2011, 30, 960−972. (73) Murray-Rust, P.; Glusker, J. P. Directional Hydrogen Bonding to sp2- and sp3-Hybridized Oxygen Atoms and Its Relevance to Ligand-Macromolecule Interactions. J. Am. Chem. Soc. 1984, 106, 1018−1025. (74) Fersht, A. R. Dissection of the Structure and Activity of the Tyrosyl-tRNA Synthetase by Site-Directed Mutagenesis. Biochemistry 1987, 26, 8031−8037. (75) Lipinski, C. A.; Lombardo, F.; Dominy, B. W.; Feeney, P. J, Experimental and Computational Approaches to Estimate Solubility and Permeability in Drug Discovery and Development Settings. Adv. Drug Delivery Rev. 1997, 23, 3−25. (76) Abraham, M. H.; Duce, P. P.; Prior, D. V.; Barratt, D. G.; Morris, J. J.; Taylor, P. J. Hydrogen Bonding. Part 9. Solute Proton Donor and Proton Acceptor Scales For Use in Drug Design. J. Chem. Soc., Perkins Trans. II 1989, 10, 1355−1375. (77) Raevsky, O. A.; Skvortsov, V. S. 3D Hydrogen Bond Thermodynamics (HYBOT) Potentials in Molecular Modelling. J. Comput. Aided Mol. Des. 2002, 16, 1−10. (78) Murray, J. S.; Ranganathan, S.; Politzer, P. Correlations Between the Solvent Hydrogen Bond Acceptor Parameter β and the Calculated Molecular Electrostatic Potential. J. Org. Chem. 1991, 56, 3734−3739. (79) Kenny, P. W. Prediction of Hydrogen Bond Basicity From Computed Molecular Electrostatic Properties: Implications For Comparative Molecular Field Analysis. J. Chem. Soc., Perkin Trans. 2 1994, 2, 199−202. (80) Kenny, P. W. Hydrogen Bonding, Electrostatic Potential, and Molecular Design. J. Chem. Inf. Model. 2009, 49, 1234−1244. (81) Scherrer, R. A.; Howard, S. M. Use of Distribution Coefficients in Quantitative Structure-Activity Relationships. J. Med. Chem. 1977, 20, 53−58. (82) Kerns, E. H.; Di, L. Physicochemical Profiling: Overview of the Screens. Drug Discovery Today Technol. 2004, 1, 343−348. (83) Abraham, M. H.; Chadha, H. S.; Whiting, G. S.; Mitchell, R. C. Hydrogen Bonding. 32. An Analysis of Water-Octanol and WaterAlkane Partitioning and the Δlog P parameter of Seiler. J. Pharm. Sci. 1994, 83, 1085−1100. (84) Pauletti, G. M.; Wunderli-Allenspach, H. Partition Coefficients In Vitro: Artificial Membranes as a Standardized Distribution Model. Eur. J. Pharm. Sci. 1994, 1, 273−282. (85) Hansch, C.; Leo, A. The Hydrophobic Parameter: Measurement and Calculation. In Exploring QSAR: Fundamentals and Applications in Chemistry and Biology; Heller, S. R., Ed.; American Chemical Society: Washington, DC, 1995; pp 97−124. (86) Austin, R. P.; Davis, A. M.; Manners, C. N. Partitioning of Ionizing Molecules Between Aqueous Buffers and Phospholipid Vesicles. J. Pharm. Sci. 1995, 84, 1180−1183. (87) Sarkar, A.; Kellogg, G. E. HydrophobicityShake Flasks, Protein Folding and Drug Discovery. Curr. Top. Med. Chem. 2010, 10, 67−83. (88) Snyder, P. W.; Mecinović, J.; Moustakas, D. T.; Thomas, S. W., III; Harder, M.; Mack, E. T.; Lockett, M. R.; Héroux, A.; Sherman, W.; Whitesides, G. M. Mechanism of the Hydrophobic Effect in the Biomolecular Recognition of Arylsulfonamides by Carbonic Anhydrase. Proc. Natl. Acad. Sci. U.S.A. 2011, 108, 17889−17894. (89) Fischer, H.; Gottschlich, R.; Seelig, A. Blood-Brain Barrier Permeation: Molecular Parameters Governing Passive Diffusion. J. Membr. Biol. 1998, 165, 201−211. (90) Peters, J.; Hunziker, D.; Fischer, H.; Kansy, M.; Weber, S.; Kritter, S.; Müller, A.; Wallier, A.; Ricklin, F.; Boehringer, M.; Poli, S. K

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX

Molecular Pharmaceutics

Article

M.; Csato, M.; Loeffler, B. An Aminomethylpyrimidine DPP-IV Inhibitor With Improved Properties. Bioorg. Med. Chem. Lett. 2004, 14, 3575−3578. (91) Mannhold, R.; Poda, G. I.; Ostermann, C.; Tetko, I. V. Calculation of Molecular Lipophilicity: State-of-the-Art and Comparison of Log P Methods on More Than 96,000 Compounds. J. Pharm. Sci. 2009, 98, 861−893. (92) Leo, A. J. Log, P Anomalies: Indications of Unusual Conformations and Electronic Interactions in Bioactive Solutes. In QSAR in Design of Bioactive Compounds; Kuchar, M., Ed.; Prous Science Publishers: Barcelona, Spain, 1992; pp 35−46. (93) Gleeson, M. P. Generation of a Set of Simple, Interpretable ADMET Rules of Thumb. J. Med. Chem. 2008, 51, 817−834. (94) Leeson, P. D.; Springthorpe, B. The Influence of Drug-like Concepts on Decision-making in Medicinal Chemistry. Nat. Rev. Drug Discovery 2007, 6, 881−890. (95) Colmenarejo, G. In Silico ADME Prediction: Data Sets and Models. Curr. Comput. Aided Drug Des. 2005, 1, 365−376. (96) Box, K. J.; Comer, J. E. A. Using Measured pKa, Log P and Solubility to Investigate Supersaturation and Predict BCS Class. Curr. Drug Metab. 2008, 9, 869−878. (97) Rodgers, T.; Rowland, M. Physiologically Based Pharmacokinetic Modelling 2: Predicting the Tissue Distribution of Acids, Very Weak Bases, Neutrals and Zwitterions. J. Pharm. Sci. 2006, 95, 1238− 1257. (98) Luker, T.; Alcaraz, L.; Chohan, K. K.; Blomberg, N.; Brown, D. S.; Butlin, R. J.; Elebring, T.; Griffin, A. M.; Guile, S.; St-Gallay, S.; Swahn, B.; Swallow, S.; Waring, M. J.; Wenlock, M. C.; Leeson, P. D. Strategies to Improve In Vivo Toxicology Outcomes For Basic Candidate Drug Molecules. Bioorg. Med. Chem. Lett. 2011, 21, 5673− 5679. (99) Valko, K.; Chiarparin, E.; Nunhuck, S.; Montanari, D. In Vitro Measurement of Drug Efficiency Index to Aid Early Lead Optimization. J. Pharm. Sci. 2012, 101, 4155−4169.

L

dx.doi.org/10.1021/mp300537k | Mol. Pharmaceutics XXXX, XXX, XXX−XXX