楼主: zytka
5398 18

[下载]Data Mining Methods and Models [推广有奖]

  • 0关注
  • 0粉丝

高中生

42%

还不是VIP/贵宾

-

威望
0
论坛币
854 个
通用积分
0
学术水平
1 点
热心指数
1 点
信用等级
1 点
经验
6360 点
帖子
13
精华
0
在线时间
34 小时
注册时间
2009-1-2
最后登录
2021-3-11

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Wiley-IEEE Press
2006-01-30
ISBN:0471666564
344 pages
PDF--- 6 MB

Data Mining Methods and Models:
* Applies a "white box" methodology, emphasizing an understanding of the model structures underlying the softwareWalks the reader through the various algorithms and provides examples of the operation of the algorithms on actual large data sets, including a detailed case study, "Modeling Response to Direct-Mail Marketing"
* Tests the reader's level of understanding of the concepts and methodologies, with over 110 chapter exercises
* Demonstrates the Clementine data mining software suite, WEKA open source data mining software, SPSS statistical software, and Minitab statistical software
* Includes a companion Web site, www.dataminingconsultant.com, where the data sets used in the book may be downloaded, along with a comprehensive set of data mining resources. Faculty adopters of the book have access to an array of helpful resources, including solutions to all exercises, a PowerPoint(r) presentation of each chapter, sample data mining course projects and accompanying data sets, and multiple-choice chapter quizzes.
With its emphasis on learning by doing, this is an excellent textbook for students in business, computer science, and statistics, as well as a problem-solving reference for data analysts and professionals in the field.
An Instructor's Manual presenting detailed solutions to all the problems in the book is available onlne.


PREFACE xi
1 DIMENSION REDUCTION METHODS 1
   Need for Dimension Reduction in Data Mining 1
   Principal Components Analysis 2
   Applying Principal Components Analysis to the Houses Data Set 5
   How Many Components Should We Extract? 9
   Profiling the Principal Components 13
   Communalities 15
   Validation of the Principal Components 17
   Factor Analysis 18
   Applying Factor Analysis to the Adult Data Set 18
   Factor Rotation 20
   User-Defined Composites 23
   Example of a User-Defined Composite 24
   Summary 25
   References 28
   Exercises 28
2 REGRESSION MODELING 33
   Example of Simple Linear Regression 34
   Least-Squares Estimates 36
   Coefficient of Determination 39
   Standard Error of the Estimate 43
   Correlation Coefficient 45
   ANOVA Table 46
   Outliers, High Leverage Points, and Influential Observations 48
   Regression Model 55
   Inference in Regression 57
   t-Test for the Relationship Between x and y 58
   Confidence Interval for the Slope of the Regression Line 60
   Confidence Interval for the Mean Value of y Given x 60
   Prediction Interval for a Randomly Chosen Value of y Given x 61
   Verifying the Regression Assumptions 63
   Example: Baseball Data Set 68
   Example: California Data Set 74
   Transformations to Achieve Linearity 79
   Box–Cox Transformations 83
   Summary 84
   References 86
   Exercises 86
vii
viii CONTENTS
3 MULTIPLE REGRESSION AND MODEL BUILDING 93
   Example of Multiple Regression 93
   Multiple Regression Model 99
   Inference in Multiple Regression 100
   t-Test for the Relationship Between y and xi 101
   F-Test for the Significance of the Overall Regression Model 102
   Confidence Interval for a Particular Coefficient 104
   Confidence Interval for the Mean Value of y Given x1, x2, . . ., xm 105
   Prediction Interval for a Randomly Chosen Value of y Given x1, x2, . . ., xm 105
   Regression with Categorical Predictors 105
   Adjusting R2: Penalizing Models for Including Predictors That Are
   Not Useful 113
   Sequential Sums of Squares 115
   Multicollinearity 116
   Variable Selection Methods 123
   Partial F-Test 123
   Forward Selection Procedure 125
   Backward Elimination Procedure 125
   Stepwise Procedure 126
   Best Subsets Procedure 126
   All-Possible-Subsets Procedure 126
   Application of the Variable Selection Methods 127
   Forward Selection Procedure Applied to the Cereals Data Set 127
   Backward Elimination Procedure Applied to the Cereals Data Set 129
   Stepwise Selection Procedure Applied to the Cereals Data Set 131
   Best Subsets Procedure Applied to the Cereals Data Set 131
   Mallows’ Cp Statistic 131
   Variable Selection Criteria 135
   Using the Principal Components as Predictors 142
   Summary 147
   References 149
   Exercises 149
4 LOGISTIC REGRESSION 155
   Simple Example of Logistic Regression 156
   Maximum Likelihood Estimation 158
   Interpreting Logistic Regression Output 159
   Inference: Are the Predictors Significant? 160
   Interpreting a Logistic Regression Model 162
   Interpreting a Model for a Dichotomous Predictor 163
   Interpreting a Model for a Polychotomous Predictor 166
   Interpreting a Model for a Continuous Predictor 170
   Assumption of Linearity 174
   Zero-Cell Problem 177
   Multiple Logistic Regression 179
   Introducing Higher-Order Terms to Handle Nonlinearity 183
   Validating the Logistic Regression Model 189
   WEKA: Hands-on Analysis Using Logistic Regression 194
   Summary 197
   References 199
   Exercises 199
5 NAIVE BAYES ESTIMATION AND BAYESIAN NETWORKS 204
   Bayesian Approach 204
   Maximum a Posteriori Classification 206
   Posterior Odds Ratio 210
   Balancing the Data 212
   Na˙ıve Bayes Classification 215
   Numeric Predictors 219
   WEKA: Hands-on Analysis Using Naive Bayes 223
   Bayesian Belief Networks 227
   Clothing Purchase Example 227
   Using the Bayesian Network to Find Probabilities 229
   WEKA: Hands-On Analysis Using the Bayes Net Classifier 232
   Summary 234
   References 236
   Exercises 237
6 GENETIC ALGORITHMS 240
   Introduction to Genetic Algorithms 240
   Basic Framework of a Genetic Algorithm 241
   Simple Example of a Genetic Algorithm at Work 243
   Modifications and Enhancements: Selection 245
   Modifications and Enhancements: Crossover 247
   Multipoint Crossover 247
   Uniform Crossover 247
   Genetic Algorithms for Real-Valued Variables 248
   Single Arithmetic Crossover 248
   Simple Arithmetic Crossover 248
   Whole Arithmetic Crossover 249
   Discrete Crossover 249
   Normally Distributed Mutation 249
   Using Genetic Algorithms to Train a Neural Network 249
   WEKA: Hands-on Analysis Using Genetic Algorithms 252
   Summary 261
   References 262
   Exercises 263
7 CASE STUDY: MODELING RESPONSE TO DIRECT MAIL MARKETING 265
   Cross-Industry Standard Process for Data Mining 265
   Business Understanding Phase 267
   Direct Mail Marketing Response Problem 267
   Building the Cost/Benefit Table 267
   Data Understanding and Data Preparation Phases 270
  Clothing Store Data Set 270
  Transformations to Achieve Normality or Symmetry 272
  Standardization and Flag Variables 276
  x CONTENTS
   Deriving New Variables 277
   Exploring the Relationships Between the Predictors and the Response 278
  Investigating the Correlation Structure Among the Predictors 286
  Modeling and Evaluation Phases 289
  Principal Components Analysis 292
  Cluster Analysis: BIRCH Clustering Algorithm 294
  Balancing the Training Data Set 298
  Establishing the Baseline Model Performance 299
  Model Collection A: Using the Principal Components 300
  Overbalancing as a Surrogate for Misclassification Costs 302
  Combining Models: Voting 304
  Model Collection B: Non-PCA Models 306
  Combining Models Using the Mean Response Probabilities 308
  Summary 312
  References 316
INDEX 317
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Data Mining Methods models Method model 下载 Methods models Mining Data

Data.Mining.rar

4.92 MB

需要: 3 个论坛币  [购买]

本附件包括:

  • figure2_5.JPG

本帖被以下文库推荐

沙发
zhongzihong 发表于 2009-8-12 08:53:31 |只看作者 |坛友微信交流群
太贵
本文来自: 人大经济论坛 详细出处参考:http://www.pinggu.org/bbs/viewth ... &from^^uid=629211
曾经错过

使用道具

藤椅
henry1225 发表于 2009-10-14 18:03:39 |只看作者 |坛友微信交流群
买了……3个币

使用道具

板凳
hr1230 发表于 2009-10-15 15:22:23 |只看作者 |坛友微信交流群
顶一下,先
aaa

使用道具

报纸
chin 发表于 2009-12-4 13:23:28 |只看作者 |坛友微信交流群
这本书楼主卖的最便宜
赞叹!

使用道具

地板
alixjr 发表于 2009-12-8 11:54:57 |只看作者 |坛友微信交流群
this is a great book

使用道具

7
yangponingsui 发表于 2010-8-3 11:04:48 |只看作者 |坛友微信交流群
http://www.pinggu.org/bbs/viewth ... %2BMethods%2BModels
卖1论坛币,有些却卖10论坛币,真是不比不知道,一比吓一跳

使用道具

8
yangponingsui 发表于 2010-8-3 11:15:38 |只看作者 |坛友微信交流群
还有免费的,我下了个免费的,呵呵

使用道具

9
m8843620 发表于 2011-5-27 12:37:22 |只看作者 |坛友微信交流群
謝謝樓主的分享

使用道具

10
cg7101 发表于 2011-5-28 01:55:03 |只看作者 |坛友微信交流群
thanks for your doing

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-5-11 02:11