MS Puzzle: Strategies for ... - ACS Publications

Artificial intelligence software approaches are providing new insights into ... ments in data collection speeds (2), and ... The sheer volume of data ...
0 downloads 0 Views 10MB Size
Solving the MS/MS Puzzle:

Strategies for Automated Structure Elucidation

to solve the problem. Artificial intelligence software approaches are providing plied Experts in mass spectral interpreta­ new insights into MS/MS spectral feature/substructure tion are highly trained individuals and, like any experts, they are hard to find. relationships Mature expertise in tandem mass spec­ Christie G. Enke Adrian P. Wade Peter T. Palmer Kevin J. Hart Department of Chemistry Michigan State University East Lansing, Mich. 48824

There has been considerable interest recently in advancing the state of auto­ mated structure elucidation. To some extent this interest has been a neces­ sary reaction to growth in the so-called hyphenated techniques (i), improve­ ments in data collection speeds (2), and the ever-increasing ability of new in­ strumentation to generate large quan­ tities of multidimensional data. Recent advances in mass spectrome­ try instrumentation include femtogram detection limits (3); the ability to collect a complete mass spectrometry/ mass spectrometry (MS/MS) fragmen­ tation map in a few seconds (4); and multidimensional instrumentation such as Fourier transform mass spec­ trometry (FT-MS), which has pro­ duced five consecutive stages of MS (5), and gas chromatography/infrared/ mass spectrometry (GC/IR/MS), which produces five dimensions of in­ formation (6). The sheer volume of data produced by such techniques mandates some automated method for extracting chemically relevant information. As such multidimensional instrumenta­ tion becomes more common, one can expect that traditional structure eluci­ dation tools (including human experts) will fail to extract all the valuable ana­ lytical information within a reasonable time interval. Thus development of

new automated structure elucidation procedures has become a priority. Van Dalen notes that "there can be little doubt that the future of analysis is inextricably linked with that of the computer" (7). Any intelligent instru­ ment should be an adaptive system; ideally, it should be able to learn from experience and its operation should adapt to changes in external circum­ stances (e.g., self-optimization). Intel­ ligent instruments perform operations normally left to a human expert. As such, they will incorporate aspects of artificial intelligence which, when ap­ plied to chemical instrumentation, is defined as "the scientific discipline which attempts to endow computercontrolled machinery with the ability for actions which, if done by a human being, would be thought to require in­ telligence" (8). Expert systems are a practical appli­ cation of artificial intelligence that at­ tempts to capture the interpretive skills of experts in a form that can then be consulted by less knowledgeable us­ ers. Most expert systems use knowl­ edge formalisms in which expertise may be represented by rules (e.g., if α is true and b is true, then conclude c). Rule-based programming offers sever­ al advantages over conventional algo­ rithmic approaches. It is simpler to un­ derstand and modify; one explicitly stated rule may be equivalent to sever­ al instances of implicit knowledge rep­ resented by conventional code scat­ tered throughout a large program. Problems that proved intractable by conventional programming styles have been shown to be solvable using a rulebased approach (9). The results from a rule-based system may be reviewed by a human user in terms of the rules ap­

tral interpretation has not yet had a chance to develop. An instrument con­ taining an expert system with just part of the human experts' skills will be a significant advance over what has gone before. Truly expert spectral interpre­ tation systems should be able to deal with facts, rules, and metarules. Facts are simple statements (e.g., the colli­ sion gas pressure is 1 millitorr) that may have some degree of uncertainty associated with them. Rules are the mechanisms by which an expert estab­ lishes new facts, based on what is al­ ready known (e.g., if neutral loss of 28

FOCUS amu occurs, then the carbonyl sub­ structure is likely). Metarules indicate the pathways by which an expert for­ mulates new rules and plans how to solve problems. They describe the for­ malisms and procedures by which the application rules are developed (i.e., the mechanisms for learning from ex­ perience). Metarules can also assist in conflict resolution (when available evi­ dence suggests two or more conflicting conclusions) and temporal reasoning (when evidence that was obtained pre­ viously has been refuted or is now no longer valid for some other reason). Thus metarules are essential when the data space is very large, complex, and inherently empirical. Automated techniques for mass spectral interpretation Many approaches to automating the interpretation process have been tried, each aimed at decreasing the level of expertise needed by a user or increas­ ing the amount of useful information

ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987 · 1363 A

FOCUS derived. In a recent article Small categorized such automated spectral interpretation techniques as either direct or indirect database methods (10). Direct database methods, commonly called spectral matching methods, require a library of reference spectra and some means for comparing sample and reference spectra. The prominence of these methods can be attributed to the success and popularity of GC/MS as a mixture analysis technique. Several different spectral matching methods for mass spectrometric data are currently available (11, 12), the most well-known of which is the Probability-Based Matching system (13). Considering that Chemical Abstract Services currently recognizes more than 7 million different organic compounds, the most serious drawback to any mass spectral matching method is

that a library of mass spectra can never be complete. Furthermore, the precision of this technique decreases as the library becomes more complete, because the number of spectra similar to any given spectrum will increase. Experimental irreproducibility that results in differences between sample and reference spectra of the same compound leads to increasingly erroneous results. Although spectral matching methods are valuable aids for limited-domain problems involving known compounds, for true unknowns (compounds whose spectra may not exist in the library) one must often resort to indirect database or interpretive methods for structure elucidation. The SelfTraining Interpretive and Retrieval System (STIRS) is one such method. STIRS deduces substructural informa-

Mass spectral, IR, and NMR data

Plan

Heuristic DENDRAL

Substructural constraints and molecular formula

Generate "••:::;·



GENOA '$

Candidate structures

tion about an unknown by analyzing its mass spectrum for the presence of 26 different classes of mass spectral data (13). These data classes correspond to combinations of fragment masses or inferred neutral losses that are known to have structural significance. This technique directly uses information from all available library spectra without resorting to predefined spectrum/substructure correlations; hence the selftraining appellation. Perhaps the most well-known indirect database method is the DENDRAL project, which began at Stanford University in 1965 (14). DENDRAL, a classical approach to the solution of a problem with a large state space, employs plan, generate, and test stages (Figure 1). The plan stage (Heuristic DENDRAL) derives constraints on the unknown structure using empirically derived fragmentation rules that are automatically inferred from mass spectral data of known compounds by Meta-DENDRAL (15). The generate stage (GENOA) provides all possible nonredundant structures consistent with the constraints (16). The test stage ranks the resulting list of structures by comparing their simulated mass spectra to the unknown spectrum. This simulation uses fragmentation rules derived by Meta-DENDRAL. DENDRAL has been applied to several problems, and its performance has been shown to equal or exceed the performance of a human expert in structure elucidation (16). Its power is derived not from "knowing" more than any human expert, but from a thorough application of constraints and a systematic search through the space of possible structures. However, in many cases mass spectral data alone were insufficient to determine the complete structure of an unknown. Thus DENDRAL used NMR data to provide additional substructural constraints. DENDRAL's simulation of mass spectra can best be described as an approximation. A complete and accurate simulation of mass spectra for all molecules under various experimental conditions is currently unobtainable. MS/MS: Another dimension in structure elucidation

Test

Mass spectral simulation and ranking of candidate structures

Figure 1. Schematic of the DENDRAL approach to structure elucidation using plan, generate, and test stages. 1364 A · ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987

Mass spectra of compounds that contain common substructures often show patterns of features that are caused by those substructures. Some common fragment ions and inferred neutral losses have been recognized as fairly specific indicators for certain substructures. These have been tabulated and are widely used for mass spectral interpretation (17,18). Until recently, there has been no attempt to exhaustively

FOCUS deduce and organize the correlations between tandem mass spectral features and substructures. Although MS/MS instruments have been in use for more than 10 years, there is still no general database of MS/MS spectra or any agreement on standard conditions for collecting such spectra (19). A major difficulty in the interpretation of mass spectra is that the products of all the fragmentation processes are overlapped in a mass spectrum. Electron impact ionization imparts ions with excess energy. These ions can then undergo fragmentation within the ion source, and the subsequent ionmolecule reactions and decompositions can often give a wide variety of products. Rearrangements further complicate interpretation. A mass spectrum indicates only the presence of ions and gives no information about their parentage. Isotopic labeling is required to determine parent-daughter relationships from mass spectra. MS/MS has several advantages over conventional MS for structure elucidation, the most obvious of which is the second dimension of information. Three types of features can be derived from an MS/MS data space: specific daughter ions, neutral losses, and parent-to-daughter transitions. The parent-daughter relationships and neutral losses can thus be determined directly rather than inferred. Considering the mass range of 1-500 amu, the

MS/MS data space yields 125,750 potential features (500 possible daughter ion masses, 500 possible neutral losses, and 124,750 possible parent-to-daughter transitions), and the corresponding MS data space yields only 500. With higher resolution MS/MS instruments, the number of potential features increases even further. Not only are certain features in the MS/MS database more specific than individual mass spectral features, but many more specific combinations of features are possible from MS/MS data than from MS data alone. Thus by using combinations of MS/MS features, the template for the particular contribution that a substructure makes to the mass spectral data set can be more adequately specified. By selecting a parent ion and fragmenting it, information about an isolated portion of a molecule can be obtained. Thus conceptually it is reasonable to expect that parts of a molecular structure can be identified from characteristic features within the MS/MS data space. Correlating M S / M S features with substructures

An extension of spectral matching to MS/MS has been reported by Cross and Enke (20). In this work, individual daughter spectra were correlated with specific substructures. The presence of substructures in unknowns was determined by matching daughter spectra BMHIiHHBS^^^I^^^BIiH^HHMH^HHHMMHHMMH

Known compounds

MAPS TQMS

Database

Generate rules Apply rules

Unknown compound

Inclusion and exclusion rules

STRCHK Identify substructures Summarize discriminating features

To "intelligent controller"

GENOA Candidate Structures

Generate structures

Substructures present or absent

EFG Constrain EF generation

Figure 2. Components and data pathways for the Automated Chemical structure Elucidation System (ACES), including the Triple Quadrupole Mass Spectrometer (TQMS), the Method for Analyzing Patterns in Spectra (MAPS), the Empirical Formula Generator (EFG), the constrained structure generator (GENOA), and a routine to organize the output of GENOA into groups based on known and unknown portions of the complete structure (STRCHK). Dashed lines represent the learning mode; solid lines represent the identification mode. 1366 A · ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987

from an unknown against a database of reference daughter spectra. This method still suffers from drawbacks inherent to spectral matching and does not take full advantage of the extra dimension of information that MS/MS affords. For instance, the carbonyl substructure gives rise to a characteristic neutral loss of 28 amu, which may be seen in daughter spectra of any ionic substructures that contain this moiety. We have developed a computer method that automatically searches for and identifies the relationships between MS and MS/MS spectral features and substructures. This algorithm, the Method for Analyzing Patterns in Spectra (MAPS), assumes that much information lies within patterns of features in MS n spectra, and not just in the presence of individual masses or neutral losses. A more complete description of this software will appear elsewhere (21). MAPS expresses the relationships between MS and MS/MS spectral features and substructures in the form of production rules that may then be used to help identify the presence or absence of substructures in unknown compounds. A database of a few thousand rules could in theory be used to identify the structures of millions of compounds. ACES using MS and M S / M S data

Several artificial intelligence and machine learning methodologies are being developed in this laboratory for automatic structure elucidation from MS and MS/MS data. Together they form an integrated set of software tools known as the Automated Chemical structure Elucidation System (ACES). The individual components and data pathways of this system are shown in Figure 2. A triple quadrupole mass spectrometer (TQMS) is included as the source of MS/MS data. The MAPS software operates in two different modes: the learning mode (dashed line) and the identification mode (solid line). In the learning mode, MAPS identifies the relationships between substructures and the characteristic features they produce in the MS and MS/MS data spaces using data from known compounds and stores these in the form of rules. In the identification mode, spectra from an unknown are searched for the diagnostic features contained in the rules. The substructures identified as present or absent by MAPS can then be used as constraints for an empirical formula generator (EFG) and for a structure generator (GENOA). The main assumption behind this system is that if enough substructures can be identified as present or absent in an unknown, the

FOCUS

GENOA

Inclusion and exclusion rules

Substructure buckets

Substructure data Training set

TQMS

Correlation

MS and MS/MS data

Filters and relevancy tests

Feature buckets False correlations

Figure 3. Schematic of the rule generation procedure in MAPS.

complete structure can be determined. The function and state of development of each of the main components in this system are described below. MAPS. This software was developed in InterLISP-D on a Xerox 1108 AI workstation. MAPS deduces the relationships between substructures and the characteristic features they produce in the MS and MS/MS data space without prior assumptions regarding fragmentation pathways. These relationships may then be used to determine the presence and absence of substructures in unknown compounds not

requiring that these spectra be in the database. MAPS uses supervised learning to formulate the rules as shown in Figure 3. First, MS and MS/MS data are obtained for a set of known compounds; these data comprise the training set. From this, the "feature bucket" and "substructure bucket" data structures are created to facilitate the next step: correlation of features with substructures. A minimum level of correlation is specified in this stage in the rule generation. Because each spectral feature has some level of correlation with a

substructure, this minimum level of correlation affects the number of features in the rules. Chemical knowledge is then used to filter out spurious features from the rules. These filters include the minimum and maximum fragment masses that can logically be attributed to each substructure and constraints to define legal fragment masses and compositions based on the elemental composition of the substructure and rules of valence. This process results in inclusion and exclusion rules that predict the presence and absence of substructures, respectively. —*~

DEPENDABLE CONTINUITY SHAKER BATH

Cal No. 6250 To provide immersed agitation at controlled temperatures. Available in two sizes, table model or floor model, to meet a wide range of requirements. Variable speed, stroke, and temperature featured in all models.

RECIPROCATING SHAKERS

Cat no. 1000 (w/o platinum) Apparatus for determining copper, lead, antimony, cadmium, nickel, and zinc, by the electro-deposition process. Apparatus also available for high speed determinations and the separation of metals by the mercury cathode technique.

Cat No. 5850 For general purpose shaking with a reciprocating (back and forth) action. Wide range of accessories available. Speeds from 20 to 240 excursions per minute.

1

STIRRERS 1

Cat. No. 7225 (w/o agitator) The Eberbach Con-Torque stirrer with high torque output, is designed to stir highly viscous materials such as resins. Hollow spindle concept utilized in Eberbach Stirrer line assures additional gripping of agitator and facilitates changing containers. Available from 1/70 thru 1/10 hp.

ANALYTICAL ELECTRO-ANALYSIS APPARATUS

Cat No. 8581

WARING CONTAINERS

Cal No. 8575 |

Power Unit

Cat No. 8470

Cat No. 8520

Glass, stainless steel, and aluminum containers available in capacities from 1000 ml to 3 ml. Commercial one quart Blendor Power units to include an explosion proof power unit will accept all Eberbach containers. Replacement blending assemblies and components available.

ACCENT ON PERFORMANCE Cat No. 8017 with 8020 Container

BLENDER POWER UNITEXPLOSION PROOF

Eberbach's new blender power unit, catalog No. 8017, solves the hazard problem of blending mixtures containing volatile solvents. It features an explosion proof VA H.P. single phase motor with automatic reset thermal protection. Motor provides free running spindle speed of 11,500 RPM. The heavy duty motor is UL approved for Class I, Group D and Class II, Group E, F, and G operation. Accommodates the Waring Gallon Size container, Eberbach catalog No. 8020, and with the addition of an adapter will accommodate standard size Eberbach and Waring Containers.

[berbach C O R P O R A T I O N A N N AHBOP,

MICHIGAN

CIRCLE 40 ON READER SERVICE CARD

ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987 · 1367 A

FOCUS An example of one such rule is the inclusion rule for the benzyl substructure shown in the box. Note t h a t MAPS provides the level of correlation between the feature and the substructure (the fraction enclosed in brackets) and also formulates plausible fragment formulae for the fragment masses represented by each feature in the rule. Intensities in MS/MS scans are dependent on a large number of instrumental parameters and are not as important as the presence or absence of a feature; hence they play a decreased role in identifying the presence and absence of substructures (as opposed to most spectral-matching techniques). Therefore intensity classes rather than numerical intensities were used in the rules. Three such intensity classes are currently recognized: strong, medium, and weak, which correspond to relative intensities of 10-100%, 1-10%, and 01%, respectively. Incorporation of intensities and implementation of a fuzzy logic matching algorithm were found to improve the performance of the rules. If a compound provides greater than a certain percentage of features from an inclusion rule, that substructure is said to be present. The predictive capabilities of the rules were ascertained by applying them to the training set compounds, and thus the two categories of results are indicated: correct and incorrect predictions. Rule performance can be ascertained at several different levels of correlation or "match factors." This is shown for the benzyl rule in Table I. Different criteria exist for predicting the presence and absence of substructures. One can expect that certain features will be present in the MS and MS/MS data space whenever a specific substructure is present in compounds analyzed under similar instrumental conditions. Similarly, the absence of these features suggests the absence of the substructure. Thus rules for excluding a substructure should ideally contain features that correlate strongly with the presence of each substructure. These features may not be very useful, however, in rules for predicting the

Table 1. Predictive capabilities of the benzyl rule when applied to the MAPS training set Match factor

% Correct inclusions

% Incorrect inclusions

100 90 80 70 60

50 81 94 97 100

9 14 30 39 41

presence of that substructure if they also have moderate-to-high correlations with other substructures as well, because they produce incorrect predictions of the presence of substructures (false positives). Uniqueness factors were calculated for each feature in several inclusion rules (22). The uniqueness factor is defined as the ratio of the number of occurrences of a feature for a substructure to the number of occurrences of that feature in the database. It was found that MS/MS features generally had higher uniqueness factors than MS features and increased the reliability of the rules. To minimize false positives, the inclusion rules should contain features or combinations of features that have a high uniqueness factor for each substructure. MAPS currently uses simple matching of features in the rules against the features from the MS and MS/MS data set from compounds in the training set to ascertain the predictive capabilities of the rules. Further optimization of the rules, implementation of more sophisticated matching, and continued expansion of the training set have resulted in improved rule performance. EFG. Software has been developed to determine the possible empirical formulae for unknown compounds using medium resolution (0.1-1.0 amu resolution) MS and MS/MS data (23). Direct determination of empirical formulae cannot be accomplished by using such data because many formulae are often consistent with the molecular

Inclusion rule for the benzyl substructure If [28/32] strong intensity daughter ion at m/z 51 and [30/32] strong intensity daughter ion at m/z 77 and [25/32] strong intensity neutral loss of 2 amu and [25/32] medium intensity neutral loss of 26 amu and [30/32] medium intensity neutral loss of 28 amu and [30/32] strong intensity neutral loss of 28 amu and [27/32] strong intensity daughter of m/z 51 from m/z 77 then the benzyl substructure is present

C4H3 CeH5 H2 C2H2

C2H4 C2H4 C2H2

weight. In the empirical formula generator we have developed, MS and MS/ MS data are used to develop constraints on the elemental composition of an unknown and thus reduce the list of empirical formulae. Constraints can be developed from daughter spectra of isotopic molecular ions (24), from isotopic clusters from conventional mass spectra, and from substructures identified by MAPS. GENOA. The substructures identified by the application of MAPS rules or by other means can be used to formulate constraints for GENOA, a constrained structure generator developed during the course of the DENDRAL project. Substructural constraints take the form of a substructure definition and the number of occurrences (e.g., "constraint benzyl at least 1"). GENOA allows for overlapping substructures; each substructure does not have to encompass a unique portion of a molecule. The presence of alternate substructures (i.e., either substructure A or substructure B) can also be specified. Negative information (substructures known to be absent) can also be used to constrain the structure generation. Results from the application of exclusion rules are used to trim the list of candidate structures. Given an empirical formula and substructural constraints, GENOA produces all possible nonredundant structures. Most importantly, the structure of the unknown will always be contained within the set of structures produced by GENOA using correctly identified substructures as constraints. Structure elucidation methods that rely on spectral matching cannot guarantee that the list of "closest hits" will contain the structure of the unknown. Nor can they guarantee that the list of "closest hits" will reliably reflect the substructures (or functional groups) present in the unknown because these substructures are not directly taken into account. An identification procedure that uses known substructure information should therefore be inher-

ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987 · 1369 A

FOCUS ently more reliable than spectral matching methods for true unknowns. The commercial GENOA software package includes a program called STRCHK (shown in Figure 2) that performs substructure searching. Given the structure of a compound and a library of predefined substructures, this procedure provides a list of substructures contained in the compound. These data are used by the MAPS software in developing spectral feature and substructure correlations. GENOA is being modified in this laboratory to better suit the purposes of ACES. A GENOA session originally required a single empirical formula. However, several formulae are often consistent with elemental composition data. Therefore additional software is being developed to automatically run GENOA structure generation sessions for each empirical formula. These sessions may use alternate empirical formulae and substructural constraints and will provide the user with the appropriate set of candidate structures. Modifications to STRCHK are being

made to provide automated substructure searching of a library of structures for training set compounds. The library of predefined substructures used for this purpose is also being constructed. In addition, STRCHK provides a method for organizing candidate structures into groups based on discriminating substructures and thus assists the user in determining what portions of the complete structure are unidentified. A simple example

To illustrate how these three software tools interact, di-n-octyl phthalate was treated as an unknown. This compound has a molecular weight of 390 and an empirical formula of C24H3804. When only the empirical formula was used as a constraint, GENOA produced more than 5000 structures before the program exceeded the memory capabilities of the computer. MS and MS/ MS spectra from this compound were fed into MAPS to obtain a list of substructures likely to be present in the unknown. The following substructures

Samples

TQMS Machine-level commands

Control High-level commands

Raw data out

Status

Diagnostics Error flags

Processed results

Expert System

User Interface Commands and queries User Figure 4. An expert system for the structure elucidation system using a TQMS. 1370 A · ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987

are quickly indicated as being present with match factors of greater than 50%: benzoyl, methyl, ethyl, butyl, propyl, pentyl, hexyl, heptyl, octyl, carbonyl, carboxyl, ester, phthalate, phthalate ester, x-phenyl, and 1,2-phenyl. This information was used to formulate constraints for the EFG as well as for GENOA, because the empirical formula of substructures found will necessarily restrict the number and types of atoms present in the unknown. A molecular weight of 390 and the presence of the phthalate ester (C8H4O4) and the octyl (C8H17) substructures were used as constraints for the EFG (i.e., at least Ci 6 H 2 i04). The resulting two empirical formulae produced were C24H3804 and C24H22O5. The latter empirical formula was rejected by GENOA because it could not incorporate the substructures identified by MAPS. When the substructures identified by MAPS and the former empirical formula were used as constraints for GENOA, it took just a few minutes to indicate that only 89 possible structures exist. These represented the 89 isomeric possibilities for the second octyl substructure. If a future system could tell that there were two n-octyl groups in this molecule, then only one structure would be possible, that of di-M-octyl phthalate. When dimethylaniline was treated in the same fashion, an unambiguous identification of the structure resulted. Future prospects

Complete structure elucidation of unknowns is not always possible, necessary, or desirable. The analytical requirement may not be the complete identity of a sample compound. For large-molecular-weight species such as those of biological importance, determination of key substructures may be sufficient and will be made possible by using the kind of empirically based correlation techniques currently being developed. The ultimate goal of this work is to produce an intelligent system for structure elucidation that includes the TQMS in its feedback loop (shown in Figure 4). The TQMS will carry out diagnostic and confirmatory experiments, each time feeding its results back through "expert" interpretive tools to the user. The automated integration of these software tools is still being developed. The ACES approach is not limited to MS; it can develop rules from both known expertise and empirically derived correlations and can be extended to other multidimensional techniques to provide greatly enhanced diagnostic power for structure elucidation.

FOCUS References (1) Hirschfeld, T. Anal. Chem. 1980, 52, 297 A. (2) Holland, J. F., Enke, C. G.; Allison, J.; Stults, J. T.; Pinkston, J. D.; Newcome, B.; Watson, J. T. Anal. Chem. 1983, 55, 997 A. (3) Johnson, J. V.; Yost, R. A. Anal. Chem. 1985,57, 758 A. (4) Eckenrode, B. E.; Watson, J. T.; Hol­ land, J.; Enke, C. G. Int. J. Mass Spectrom. Ion Process., in press. (5) Laukien, F. H. Abstracts of Papers, 14th Annual FACSS Meeting, Detroit, Mich.; American Chemical Society, Washington, D.C., 1987; Abstract 524. (6) Wilkins, C. L. Anal. Chem. 1987, 59, 571 A. (7) Van Dalen, P. Presented at the French Scientific and Technical Press at Mesucora Physique, Paris, France, December 1985. (8) Graham, N. Artificial Intelligence— Making Machines Think; Tab Books: Blue Ridge Summit, Pa., 1979. (9) McDermott, J. AI Magazine 1982, 2, 22. (10) Small, G. W. Anal. Chem. 1987, 59, 535 A. (11) Rasmussen, G. T.; Isenhour, T. L. J. Chem. Inf. Comput. Sci. 1979,19, 179. (12) Martinsen, D. P.; Song, Β. Η. Mass Spectrom. Rev. 1985, 4, 461. (13) McLafferty, F. W.; Stauffer, D. B. J. Chem. Inf. Comput. Sci. 1985, 25, 245. (14) Barr, Α.; Feigenbaum, E. A. The Handbook of Artificial Intelligence; Heuristech: Stanford, 1982; Vol. II.

(15) Buchanan, B. G.; Smith, D. H.; White, W. C ; Gritter, R. J.; Feigenbaum, Ε. Α.; Lederberg, J.; Djerassi, C. J. Am. Chem. Soc. 197G, 98(20), 6168. (16) Carhart, R. E.; Smith, D. H.; Gray, N.A.B.; Nourse, J. G ; Djerassi, C. J. Am. Chem. Soc. 1981, 46,1708. (17) McLafferty, F. W. Interpretation of Mass Spectra; University Science Books: Mill Valley, Calif., 1980. (18) McLafferty, F. W.; Venkataraghavan, R. Mass Spectral Correlations; American Chemical Society: Washington, D.C., 1982. (19) Martinez, R. I.; Cooks, R. G. Towards Building a Practical MS/MS Database; 35th Annual Conference on Mass Spec­ trometry and Allied Topics; Denver, Colo., June 1987, pp. 1175-76. (20) Cross, K. P.; Enke, C. G. Comput. Chem. 1986,10,175. (21) Wade, A. P.; Palmer, P. T.; Hart, K. J.; Enke, C. G., submitted for publication in Anal. Chim. Acta. (22) Hart, K. J.; Wade, A. P.; Palmer, P. T.; Enke, C. G., submitted for publication in Anal. Chim. Acta. (23) Palmer, P. T.; Enke, C. G., submitted for publication in Int. J. Mass Spectrom. Ion Proc. (24) Bozorgzadeh, M. H.; Morgan, R. P.; Beynon, J. H. Analyst 1978,103, 613. This work was funded by grant GM-28254 from the National Institutes of Health. Thanks are due to Finnigan MAT for the use of and support for the Xerox 1108 AI Workstation. The authors also thank Molecular Design Ltd. for assisting in the acquisition and modification of GENOA.

Proceedings of the 1987 EPA/APCA Symposium on

MEASUREMENT

OF

TOXIC AND RELATED AIR POLLUTANTS

Proceedings of a joint confer­ ence sponsored by APCA and U.S. EPA's Environmental Monitoring Systems Laboratory. • Measurement of Semi-Volatiles in Ambient Air • Indoor Toxic Air Contaminants • Acidic Deposition • Volatile Organic Pollutants in Ambient Air

• Physical/Chemical Properties of Toxics • Dry Deposition—Methods Comparison • Atmospheric Transformations • Chemometrics/Data Analysis • Hazardous Waste Emissions • Integrated Air Cancer Project • Source Monitoring • Environmental Quality Assurance Reviewed and approved for publication in accordance with U.S. EPA policy, this 775-page landmark work is now availa­ ble from APCA.

APCA Publications P.O. Box 2861 Pittsburgh, PA 15230 Phone (412) 232-3444 Christie G. Enke is a professor of chemistry at Michigan State University {MSU). He received his Ph.D. from the University of Illinois in 1959. He is coauthor of Electronics a n d I n s t r u m e n t a t i o n for Scientists, codirector of the NIH/ MSU Mass Spectrometry Facility, and co-inventor of the TQMS. His research interests are in the broad area of computer applications in chemical analysis. Adrian P. Wade received his Ph.D. from the University of Wales in 1985 and is now an assistant professor of analytical chemistry at the University of British Columbia. In 1985 he was awarded the Harry Hallam Memorial Prize. Peter T. Palmer received his B.S. degree from Canisius College in 1983. He is a graduate student at MSU and is also a Quill Fellow. His research interests are automated structure elucidation and intelligent instrumentation. Kevin J. Hart received his B.S. from the University of Notre Dame in 1984. He is a graduate student at MSU. His interests include mass spectrometry and com­ puter applications in chemistry.

• Order code VIP-8A • Price S 70.00, APCA Members S45.00 • Please send prepayment. Name Company Address

D Send VÏP-8A (payment enclosed) D Send APCA Publications Catalog CIRCLE 2 ON READER SERVICE CARD

ANALYTICAL CHEMISTRY, VOL. 59, NO. 23, DECEMBER 1, 1987 · 1371 A