Publisher's Synopsis
The potential business advantages of data mining are well documented in publications for executives and managers. However, developers implementing major data-mining systems need concrete information about the underlying technical principles and their practical manifestations in order to either integrate commercially available tools or write data-mining programs from scratch. This book is the first technical guide to provide a complete, generalized roadmap for developing data-mining applications, together with advice on performing these large-scale, open-ended analyses for real-world data warehouses. A state-of-the-art data-mining software kit accompanies the book. The software, which is delivered through a special web site, is a collection of routines for efficient mining of big data. Both classical and the more computationally expensive state-of-the-art prediction methods are included. Using a standard spreadsheet data format, this kit implements all of the data-mining tasks described in the book. The software is available for Windows 95/NT and Unix platforms (no need to specify when ordering).