Discovering Knowledge in Data: An Introduction to Data Mining
John Wiley & Sons, 28 de jan. de 2005 - 222 páginas
Learn Data Mining by doing data mining
Data mining can be revolutionary-but only when it's done right. The powerful black box data mining software now available can produce disastrously misleading results unless applied by a skilled and knowledgeable analyst. Discovering Knowledge in Data: An Introduction to Data Mining provides both the practical experience and the theoretical insight needed to reveal valuable information hidden in large data sets.
Employing a "white box" methodology and with real-world case studies, this step-by-step guide walks readers through the various algorithms and statistical structures that underlie the software and presents examples of their operation on actual large data sets. Principal topics include:
* Data preprocessing and classification
* Exploratory analysis
* Decision trees
* Neural and Kohonen networks
* Hierarchical and k-means clustering
* Association rules
* Model evaluation techniques
Complete with scores of screenshots and diagrams to encourage graphical learning, Discovering Knowledge in Data: An Introduction to Data Mining gives students in Business, Computer Science, and Statistics as well as professionals in the field the power to turn any data warehouse into actionable knowledge.
An Instructor's Manual presenting detailed solutions to all the problems in the book is available online.
O que estão dizendo - Escrever uma resenha
Não encontramos nenhuma resenha nos lugares comuns.
1 INTRODUCTION TO DATA MINING
2 DATA PREPROCESSING
3 EXPLORATORY DATA ANALYSIS
4 STATISTICAL APPROACHES TO ESTIMATION AND PREDICTION
5 kNEAREST NEIGHBOR ALGORITHM
6 DECISION TREES
7 NEURAL NETWORKS
Outras edições - Visualizar todos
adjustment apply asparagus assets association rules beans C4.5 algorithm candidate split categorical variables cereal Chapter classification cluster center cluster membership correlated credit risk CRISP–DM customer service calls data mining data mining methods data mining model data set database day minutes decision node Discovering Knowledge entropy error rate estimation and prediction Euclidean distance example exploratory data analysis false negative false positive field values hidden layer histogram income input International Plan k-means k-means algorithm k-nearest neighbor algorithm Knowledge in Data Kohonen networks leaf node lift chart marital status medium min–max normalization minimum Minitab missing values neural network number of customer numerical variables nutritional rating outlier output node patient point estimate prediction interval predictor variables priori algorithm ratio represents sample scatter plot self-organizing maps shown in Figure squash statistical subset sugar Suppose Table target variable task training set uncover unsupervised learning VoiceMail Plan wcurrent weights Z-score standardization zero