e d i to ri a l
Think Carefully, and Take More Data T
he art of analysis has long consisted of creating measurement strategies that have the greatest possible selectivity and sensitivity. Although this approach has well served science, society, and the discipline, it increasingly fails to exploit two of the greatest assets available to modern measurement science: cheap data and fast computers. More than two decades after their commercial introduction, learning set-based analytical methods continue to impress me, with near-IR (NIR) reflectance being the most widely known. These methods thrive on the opposite of selectivity, measuring everything and carefully ignoring most of it. The value of this strategy seems to have only partially penetrated the research community. Learning set methods now include such remarkable measurements as blood glucose through skin (also NIR) and automated fruit and vegetable recognition for supermarket checkouts (visible spectral reflectance). Their success depends on learning sets that encompass samples across the full concentration range for the target analyte as well as multiple examples of all possible interfering components. There is a bit of a “chicken and egg” problem with this approach, in that the learning set samples must be “known” for the target species— easy for apples, harder for glucose. However, time, expense, separation requirements, and sensitivity can yield methods that are impractical for most measurement problems but appropriate for learning set creation. The neglect of the opportunity in academic circles likely stems from an opinion that this “anti-intellectual” approach is unworthy of the scholarly environment. A similar reaction to combinatorial synthesis was and still is strong in the medicinal chemistry community, although it continues to revolutionize industrial pharmaceutical practice. Such an attitude may be understandable to those of us who expect to make our livings through careful thinking; however, a second look reveals this attitude to be shortsighted. Learning set-based methods have analogies in other areas of technology. They constitute a decision to make certain activities historically performed serially, highly parallel. We ignore them at our peril. Supercomputing is now dominated by massively parallel, modest-performance processor arrays, which are making obsolete companies that pursued high-performance serial strategies. Structure optimization in the pharmaceutical industry
is universally pursued by synthesizing many compounds in parallel rather than a few serially. These trends are driven by technology and competition. Modest processors are cheap; parallel synthesis makes test compounds cheap. A comparable situation exists for measurement science. Modern, automated methods have made measurement data cheap. Fast computing and enormous data storage are also cheap. The exploitation of these resources opens a measurement pathway that runs counter to the cultivated instincts of many practitioners. These ideas meet in measurements on highly complex systems. Among the most complex and interesting samples are living organisms. Biologists have prospered by incrementally expanding understanding by using samples that are too hopelessly complex to grasp in complete detail. To date, much of this knowledge has remained qualitative. We physical scientists can benefit from the biologist’s model and still contribute our own quantitative rigor. Optical imaging offers an extreme example of large data volume (cheap data). Without much difficulty, megabytes of data per second can be generated with near single molecule sensitivity. The accelerating effort to monitor complete protein expression (the proteome) will provide learning set data relevant to a variety of signaling, enzyme activity, and gene activity functions. Consumer electronics technology has created fast, inexpensive image processing capabilities, which only the military could have afforded a few years ago. These capabilities are the ingredients of methods that could be used to monitor activity in living cells. Such tools would have a significant impact in research and commercial labs. The challenge of creating robust learning sets and the mathematics of extracting information from these mountains of data are far from anti-intellectual. New technologies are opening exciting and powerful measurement methods for those who have the insight to exploit them. Opportunities abound for those who think carefully and take more data.
Timothy Harris Praelux, Inc.
[email protected] N O V E M B E R 1 , 2 0 0 0 / A N A LY T I C A L C H E M I S T R Y
669 A
EDITOR Royce W. Murray University of North Carolina
ASSOCIATE EDITORS Daniel W. Armstrong
Reinhard Niessner
Iowa State University/Ames Laboratory
Technische Universität München (Germany)
Catherine C. Fenselau
Robert A. Osteryoung
University of Maryland
North Carolina State University
William S. Hancock
Edward S. Yeung
ThermoQuest/Finnigan
Iowa State University/Ames Laboratory
EDITORIAL HEADQUARTERS Research section Department of Chemistry Venable and Kenan Laboratories University of North Carolina Chapel Hill, NC 27599-3290 Phone: 919-962-2541; Fax: 919-962-2542; E-mail:
[email protected] A-page section 1155 16th St., N.W., Washington, DC 20036 Phone: 202-872-4570;TDD: 202-872-6076 Fax: 202-872-4574; E-mail:
[email protected] Managing Editor: Alan R. Newman Associate Editors: Britt Erickson, Felicia Wach, Elizabeth Zubritsky Editorial Assistant: Wilder Damian Smith Web Editor: Christine Brennan Contributing Editors: Gerald Keller, U.S., Marcia Vogel, U.S., Thomas J. Wenzel, Bates College Production Editor: Doug Roemer Art Director: Sean Kennedy Electronic Composition: Yang H. Ku Journals Associate Editor: Lorraine Gibb Circulation Manager: Scott Nathan
Editorial Advisory Board
William B. Whitten
John Fetzer
Oak Ridge National Laboratory
Chevron Research andTechnology
Luc Bousse
Stephen A. Wise
Robert T. Kennedy
CaliperTechnologies
University of Florida
Frank V. Bright
National Institute of Standards andTechnology
State University of New York at Buffalo
Vicki H. Wysocki
University of Southern Denmark
Steven A. Carr
University of Arizona
Kay Niemax
Robert M. Corn
Ex-Officio Member
Institute for Spectrochemistry and Applied Spectroscopy (Germany)
University of Wisconsin–Madison
Theodore R. Williams
J. David Pinkston
Andrew Ewing
College of Wooster
Procter & Gamble
SmithKline Beecham
Matthias Mann
Kimberly Prather
Pennsylvania State University
Totaro Imasaka
A-page Advisory Panel
University of California–Riverside
Kyushu University (Japan)
Klaus Albert
Peter Schoenmakers
Viorica Lopez-Avila
University ofTubingen (Germany)
Midwest Research Institute
Philip Bartlett
Shell Research andTechnology (The Netherlands)
Andreas Manz
University of Southhampton (U.K.)
Robert L. St. Claire
Imperial College of Science,Technology and Medicine (U.K.)
Alain Berthod
Triangle Pharmaceuticals
Zbigniew Stojek
Klaus H. Mosbach
National Center for Scientific Research (France)
University of Lund (Sweden)
Tibor Braun
Karen Wahl
Janusz Pawliszyn
Eötvös University (Hungary)
Pacific Northwest National Laboratory
University of Waterloo (Canada)
Sylvia Daunert
Douglas Westerlund
Antonio J. Ricco
University of Kentucky
Uppsala University (Sweden)
ACLARA Biosciences
Marta E. Diaz-Garcia
John Yates
Robert S. Rush
Universidad de Oviedo (Spain)
Scripps Research Institute
Amgen
Francesco Dondi
Renato Zenobi
Isiah M. Warner
University of Ferrara (Italy)
Louisiana State University
Christine E. Evans
Swiss Federal Institute ofTechnology Zurich
University of Warsaw (Poland)
University of Michigan
Publications Division Director: Robert D. Bovenschulte Director, Publishing Operations: Mary E. Scanlan General Manager, Special Publications: Mary Warner Journals Editing Manager: Debora A. Bittaker General Manager, Publishing & Creative Services: William Succolosky Manager, Copy Editing: Elizabeth Wood Manager, Production & Imaging: Vincent L. Parker Creative Director, Publishing & Creative Services: Julie Farrar