Data Mining: Concepts, Models, Methods, and Algorithms - American

ends with relevant references. The book is divided into 12 chapters and 2 appendices. Chapter 1 introduces the data-mining process—including data co...
53 downloads 8 Views 2MB Size
books in review

Data Mining: Concepts, Models, Methods, and Algorithms By Mehmed Kantardzic Wiley-IEEE Press:Totowa, NJ, 2003, 343 pp ISBN 0-471-22852-4 $59.95 ritten by a well-established expert in computer science and data-mining methods, Data Mining: Concepts, Models, Methods, and Algorithms provides an introduction to the concepts, widely used algorithms, and visualization methods used to extract information from databases. The book is designed to provide supporting material for undergraduate and graduate classes and includes review questions and problems at the end of each chapter. The book also serves as an excellent starting point for anyone wishing to learn about data mining. Each chapter starts with a set of objectives and an introduction and ends with relevant references. The book is divided into 12 chapters and 2 appendices. Chapter 1 introduces the data-mining process—including data collection, preprocessing, and analysis—and discusses the important concept of a data warehouse. Chapter 2 describes the principles of data representation, techniques for data preparation, and methods for addressing incomplete datasets. The final section, which describes outlier analysis, is particularly useful. Chapter 3 explains the advantages of data reduction as part of preprocessing the data and describes the methods that can be applied. Chapter 4 then covers the general theoretical background of data mining and discusses the philosophy of learning from data.

W

Chapters 5 through 11, however, constitute the real meat of the book, covering the various algorithmic approaches to data mining. Chapters 5 and 6 cover standard statistical and clustering methods; chapters 7 and 8 deal with decision trees and association rules; and chapters 9, 10, and 11 describe artificial neural networks, genetic algorithms, and fuzzy logic. Each method and its fundamentals are well introduced and illustrated with appropriate examples. Then the underlying mathematical model is described in more depth. The final chapter deals with the perception and visualization of data, both as a method of data mining and as a means to interpret its results. The book provides useful supporting material in the form of a comprehensive survey of available datamining tools in Appendix A and a complete set of references. Finally, Appendix

BOOKS RECEIVED Posttranslational Modification of Proteins Edited by Christoph Kannicht Humana Press:Totowa, NJ, 2002, 322 pp ISBN 0-89603-678-2 $99.50 Volume 194 of the Methods in Molecular Biology series, this book is a compendium of methods for detecting and analyzing the posttranslational modifications of protein, particularly with regard to protein function, proteome research, and the characterization of pharmaceutical proteins. Among the methods presented are those for analyzing the assignment of disulfide bond sites and N- and O-glycosylation sites. Other techniques facilitate the analysis of glycosylphosphatidylinositols; lipid modifications; protein phosphorylation, sulfation, methylation, and acetylation; α-amidation; γ-glutamate; isoaspartate; and lysine hydroxylation.

Enzyme Kinetics: A Modern Approach By Alejandro Marangoni John Wiley & Sons: Hoboken, NJ, 2002, 229 pp ISBN 0-471-15985-9 $89.95 Because enzymes are new drug targets as well as useful synthetic catalysts, the comprehension of their kinetic behavior and catalytic properties is vital to the chemical,

334 JournalofProteom e Research •Vol. 2, No. 3, 2003

B details examples of the application of data-mining methods to real-world problems from a wide variety of industry sectors such as finance, retail, engineering, and biomedical research. Overall, this is a comprehensive textbook that describes the process and methodologies of data mining in an unbiased manner. It provides a well-laid-out discussion of the choice of data-mining method as an iterative procedure that allows one to explore the data as part of an effort to elucidate the underlying patterns hidden within it. Readers who seek to understand not only data-mining concepts but also the mathematical models that can be applied will find this book particularly useful. DAVID EDWARDS

Director, Computational Proteomics, Accelrys, Inc.

pharmaceutical, and food science industries. This book is a practical how-to guide for evaluating enzyme kinetics. The author emphasizes models: how researchers develop them, their limitations, and practical ways to use them to analyze enzyme kinetic data. Covering both the principles of enzyme kinetics and the applications of the mathematical tools necessary for analysis, each chapter concisely presents the progression of a model from first principles to applications.

Data Acquisition Techniques Using PCs, 2nd Edition By Howard Austerlitz Academic Press: NewYork, 2002, 544 pp ISBN 0-12-068377-6 $79.95 Because most instrument-based experimentation and data analysis is performed through a PC interface, it is critical to understand how PC-based data-acquisition systems work. This book covers all aspects of setting up a system, with topics ranging from sensors to data-acquisition cards, as well as PC hardware and software. Similarly, the author describes the steps involved in transforming analog results into a digital language that the computer can comprehend. The book covers topics that are critical to most areas of industry, including instrument applications in electrical engineering (e.g., control and robotics), biomedical engineering, and mechanical systems.

© 2003 American Chemical Society