楼主: cshan
7851 10

2013_AppliedPredictiveModeling_Springer [推广有奖]

  • 0关注
  • 0粉丝

已卖:262份资源

硕士生

55%

还不是VIP/贵宾

-

威望
0
论坛币
3646 个
通用积分
0.2938
学术水平
6 点
热心指数
7 点
信用等级
6 点
经验
862 点
帖子
60
精华
0
在线时间
282 小时
注册时间
2008-7-20
最后登录
2023-8-19

楼主
cshan 发表于 2013-7-1 06:01:14 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
2013_AppliedPredictiveModeling_Springer.zip (10.03 MB, 需要: 3 个论坛币)
This text is intended for a broad audience as both an introduction to predictive models as well as a guide to applying them. It provides an intuitive explanations of the techniques while an emphasis on problem-solving with real data across a wide variety of applications will aid practitioners who wish to extend their expertise. Readers should have knowledge of basic statistical ideas, such as correlation and linear regression analysis.

Table of Contents

Preface

Chapter 1 Introduction

Prediction Versus Interpretation, Key Ingredients of Predictive Models; Terminology; Example Data Sets and Typical Data Scenarios; Overview; Notation (15 pages, 3 figures)


Part I: General Strategies

Chapter 2 A Short Tour of the Predictive Modeling Process

Case Study: Predicting Fuel Economy; Themes; Summary (8 pages, 6 figures, R packages used)

Chapter 3 Data Pre-Processing

Case Study: Cell Segmentation in High-Content Screening; Data Transformations for Individual Predictors; Data Transformations for Multiple Predictors; Dealing with Missing Values; Removing Variables; Adding Variables; Binning Variables; Computing; Exercises (32 pages, 11 figures, R packages used)

Chapter 4 Over-Fitting and Model Tuning

The Problem of Over-Fitting; Model Tuning; Data Splitting; Resampling Techniques; Case Study: Credit Scoring; Choosing Final Tuning Parameters; Data Splitting Recommendations; Choosing Between Models; Computing; Exercises (29 pages, 13 figures, R packages used)


Part II: Regression Models

Chapter 5 Measuring Performance in Regression Models

Quantitative Measures of Performance; The Variance-Bias Tradeoff; Computing (4 pages, 3 figures)

Chapter 6 Linear Regression and Its Cousins

Case Study: Quantitative Structure-Activity Relationship Modeling; Linear Regression; Partial Least Squares; Penalized Models; Computing; Exercises (37 pages, 20 figures, R packages used)

Chapter 7 Non-Linear Regression Models

Neural Networks; Multivariate Adaptive Regression Splines; Support Vector Machines; K-Nearest Neighbors; Computing; Exercises (28 pages, 10 figures, R packages used)

Chapter 8 Regression Trees and Rule-Based Models

Basic Regression Trees; Regression Model Trees; Rule-Based Models; Bagged Trees; Random Forests; Boosting; Cubist; Computing; Exercises (46 pages, 24 figures, R packages used)

Chapter 9 A Summary of Solubility Models

(3 pages, 3 figures)

Chapter 10 Case Study: Compressive Strength of Concrete Mixtures

Model Building Strategy; Model Performance; Optimizing Compressive Strength; Computing (12 pages, 5 figures, R packages used)


Part III: Classification Models

Chapter 11 Measuring Performance in Classification Models

Class Predictions; Evaluating Predicted Classes; Evaluating Class Probabilities; Computing (20 pages, 9 figures, R packages used)

Chapter 12 Discriminant Analysis and Other Linear Classification Models

Case Study; Logistic Regression; Linear Discriminant Analysis; Partial Least Squares Discriminant Analysis; Penalized Models; Nearest Shrunken Centroids; Computing; Exercises (52 pages, 20 figures, R packages used)

Chapter 13 Non-Linear Classification Models

Nonlinear Discriminant Analysis; Neural Networks; Flexible Discriminant Analysis; Support Vector Machines; K-Nearest Neighbors; Naive Bayes; Computing; Exercises (38 pages, 16 figures, R packages used)

Chapter 14 Classification Trees and Rule-Based Models

Basic Regression Trees; Rule-Based Models; Bagged Trees; Random Forests; Boosting; C5.0; Wrap-Up; Computing (46 pages, 15 figures, R packages used)

Chapter 15 A Summary of Grant Application Models

(3 pages, 2 figures)

Chapter 16 Remedies for Severe Class Imbalance

Case Study: Predicting Caravan Policy Ownership; The Effect of Class Imbalance; Model Tuning; Alternate Cutoffs; Adjusting Prior Probabilities; Unequal Case Weights; Sampling Methods; Cost-Sensitive Training; Computing; Exercises (24 pages, 7 figures, R packages used)

Chapter 17 Case Study: Job Scheduling

Data Splitting and Model Strategy; Results; Computing (13 pages, 6 figures, R packages used)


Part IV: Other Considerations

Chapter 18 Measuring Predictor Importance

Numeric Outcomes; Categorical Outcomes; Other Approaches; Computing; Exercises (24 pages, 10 figures, R packages used)

Chapter 19 An Introduction to Feature Selection

Consequences of Using Non-Informative Predictors; Approaches for Reducing the Number of Predictors; Wrappers Methods; Filter Methods; Selection Bias; Misuse of Feature Selection; Case Study: Predicting Cognitive Impairment; Computing; Exercises (34 pages, 7 figures, R packages used)

Chapter 20 Factors That Can Affect Model Performance

Type III Errors; Measurment Error in the Outcome; Measurement Error in the Predictors; Discretizing Continuous Outcomes; When Should You Trust Your Model’s Prediction?; The Impact of a Large Sample; Computing; Exercises (26 pages, 12 figures, R packages used)


Appendix

These are included in the sample pages on Spinger's website.

Appendix A A Summary of Various Models

Appendix B An Introduction to R

Startup and Getting Help; Packages; Creating Objects; Data Types and Basic Structures; Working with Rectangular Data Sets; Objects and Classes; R Functions; The Three Faces of =; The AppliedPredictiveModeling Package; The caret Package; Software Used in This Text (16 pages, 1 figure, R packages used)

Appendix C Interesting Websites


References

Index


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Predictive Modeling Springer Applied predict techniques knowledge audience emphasis intended

已有 1 人评分学术水平 热心指数 信用等级 收起 理由
leonkd + 1 + 1 + 1 奖励积极上传好的资料

总评分: 学术水平 + 1  热心指数 + 1  信用等级 + 1   查看全部评分

本帖被以下文库推荐

沙发
leonkd(真实交易用户) 在职认证  发表于 2013-7-4 08:32:41
没有第一章

藤椅
cshan(未真实交易用户) 发表于 2013-7-4 14:23:32
leonkd 发表于 2013-7-4 08:32
没有第一章
ch01.pdf (242.87 KB)   Uploaded Chapter 1 here. Thanks for your point.

板凳
mw89(真实交易用户) 发表于 2013-8-9 02:30:55
This is what I am looking for. Thanks added chapter 1.

报纸
dxystata(真实交易用户) 发表于 2013-9-19 16:55:16
谢谢分享!

地板
jgchen1966(真实交易用户) 发表于 2013-10-12 15:02:54
此书,不错,一年前始研究作者开发的R 包及其相关内容,现要合成如此众多的学习机器。本书不足是:没有
Ensemble Methods.
如下面十个机器,在进行5*3 CV 参数TUNING 后的最优model 的绩效:


Call:
summary.resamples(object = resamps)

Models: gbmM, svmRadialM, svmPolyM, rfM, cforestM, blackboostM, gamboostM, glmboostM, glmnetM, hddaM
Number of resamples: 15

ROC
              Min. 1st Qu. Median   Mean 3rd Qu.   Max. NA's
gbmM        0.8118  0.8915 0.9241 0.9131  0.9471 0.9647    0
svmRadialM  0.9160  0.9353 0.9643 0.9582  0.9756 1.0000    0
svmPolyM    0.8319  0.8965 0.9451 0.9308  0.9793 1.0000    0
rfM         0.7922  0.9085 0.9294 0.9222  0.9569 0.9922    0
cforestM    0.7412  0.8487 0.8950 0.8861  0.9196 0.9804    0
blackboostM 0.6529  0.7864 0.8176 0.8219  0.8710 0.9196    0
gamboostM   0.7843  0.8675 0.9018 0.8963  0.9289 0.9922    0
glmboostM   0.7689  0.8113 0.8549 0.8512  0.8948 0.9216    0
glmnetM     0.8235  0.8657 0.8863 0.8896  0.9196 0.9554    0
hddaM       0.8137  0.8560 0.8863 0.8864  0.9216 0.9688    0
鹑居鷇食,鸟行无彰

7
jgchen1966(真实交易用户) 发表于 2013-10-12 15:03:16
Sens
              Min. 1st Qu. Median   Mean 3rd Qu.   Max. NA's
gbmM        0.7500  0.8180 0.8824 0.8728  0.9412 1.0000    0
svmRadialM  0.7059  0.8529 0.9375 0.9015  0.9412 1.0000    0
svmPolyM    0.7059  0.8235 0.9375 0.8860  0.9412 1.0000    0
rfM         0.7647  0.8824 0.9375 0.9049  0.9412 1.0000    0
cforestM    0.7500  0.8235 0.9375 0.8966  0.9412 1.0000    0
blackboostM 0.5882  0.7279 0.8235 0.7985  0.8824 0.9412    0
gamboostM   0.6471  0.8180 0.8824 0.8490  0.9099 1.0000    0
glmboostM   0.5294  0.6471 0.7647 0.7387  0.8180 0.8824    0
glmnetM     0.5882  0.7647 0.8750 0.8380  0.9099 0.9412    0
hddaM       0.4706  0.6176 0.7059 0.6995  0.7886 0.8235    0
鹑居鷇食,鸟行无彰

8
jgchen1966(真实交易用户) 发表于 2013-10-12 15:03:50
Spec
              Min. 1st Qu. Median   Mean 3rd Qu.   Max. NA's
gbmM        0.4667  0.7000 0.7857 0.7771  0.8667 0.9333    0
svmRadialM  0.6000  0.7595 0.8571 0.8263  0.9286 0.9333    0
svmPolyM    0.6667  0.7238 0.8667 0.8448  0.9310 1.0000    0
rfM         0.4667  0.7143 0.8000 0.7432  0.8571 0.8667    0
cforestM    0.3333  0.6000 0.6429 0.6575  0.7333 0.8667    0
blackboostM 0.2000  0.5333 0.6000 0.6203  0.7595 0.8000    0
gamboostM   0.6000  0.7262 0.8571 0.7984  0.8667 1.0000    0
glmboostM   0.5333  0.7143 0.7333 0.7438  0.8286 0.9333    0
glmnetM     0.6000  0.7238 0.8000 0.7800  0.8619 0.8667    0
hddaM       0.7333  0.8571 0.9333 0.9044  1.0000 1.0000    0
鹑居鷇食,鸟行无彰

9
jgchen1966(真实交易用户) 发表于 2013-10-12 15:06:41
数据是mlbench 中的Sonar 的随机选2/3 进行培训
用何种model 来最终对test 数据集作预测???
鹑居鷇食,鸟行无彰

10
zhengjie0521(真实交易用户) 发表于 2014-6-12 09:08:26
好像多一点币,就可以下载这本书啦,哎!

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2025-12-24 12:48