楼主: neuroexplorer
3246 5

[学习分享] Information Value Statistics [推广有奖]

  • 5关注
  • 23粉丝

学科带头人

79%

还不是VIP/贵宾

-

威望
0
论坛币
29072 个
通用积分
844.3345
学术水平
58 点
热心指数
75 点
信用等级
63 点
经验
176572 点
帖子
3222
精华
0
在线时间
1396 小时
注册时间
2013-7-21
最后登录
2024-4-29

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
The Information Value (IV) statistic is a popular screener for selecting predictor variables for binary logistic regression. Familiar, but perhaps mysterious, guidelines for deciding if the IV of a predictor X is high enough to use in modeling are given in many  textbooks on credit scoring. For example, these texts say that IV > 0.3 shows X to be a strong predictor. These guidelines must be considered in the context of binning. A common practice in preparing a predictor X is to bin the levels of X to remove outliers and reveal a trend. But IV decreases as the levels of X are collapsed. This paper has two goals: (1) Provide a method for collapsing the levels of X which maximizes IV at each iteration and (2) show how the guidelines (e.g. IV > 0.3) relate to other measures of predictive power. All data processing was performed using Base SAS®.

Information Value Statistics.pdf (556.64 KB)




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:information Statistics Informatio formation statistic guidelines practice perhaps example popular

沙发
西门高 发表于 2016-2-19 12:36:34 |只看作者 |坛友微信交流群
谢谢分享

使用道具

藤椅
aqaq22 发表于 2016-2-19 15:13:21 |只看作者 |坛友微信交流群
谢谢分享

使用道具

板凳
neuroexplorer 发表于 2016-2-20 01:00:04 |只看作者 |坛友微信交流群
IV (information value is a good way to select variables that are relevant.

使用道具

报纸
67890 发表于 2016-2-20 03:26:26 |只看作者 |坛友微信交流群
Good reference. By any chance do you have his Macro code? He has several articles and SAS macros.

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-5-2 01:08