楼主: neuroexplorer
3552 5

[学习分享] Information Value Statistics [推广有奖]

  • 5关注
  • 23粉丝

已卖:5901份资源

学科带头人

79%

还不是VIP/贵宾

-

威望
0
论坛币
29250 个
通用积分
850.5514
学术水平
58 点
热心指数
75 点
信用等级
63 点
经验
176544 点
帖子
3215
精华
0
在线时间
1416 小时
注册时间
2013-7-21
最后登录
2025-10-2

楼主
neuroexplorer 发表于 2016-2-19 12:25:51 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
The Information Value (IV) statistic is a popular screener for selecting predictor variables for binary logistic regression. Familiar, but perhaps mysterious, guidelines for deciding if the IV of a predictor X is high enough to use in modeling are given in many  textbooks on credit scoring. For example, these texts say that IV > 0.3 shows X to be a strong predictor. These guidelines must be considered in the context of binning. A common practice in preparing a predictor X is to bin the levels of X to remove outliers and reveal a trend. But IV decreases as the levels of X are collapsed. This paper has two goals: (1) Provide a method for collapsing the levels of X which maximizes IV at each iteration and (2) show how the guidelines (e.g. IV > 0.3) relate to other measures of predictive power. All data processing was performed using Base SAS®.

Information Value Statistics.pdf (556.64 KB)




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:information Statistics Informatio formation statistic guidelines practice perhaps example popular

沙发
西门高 发表于 2016-2-19 12:36:34
谢谢分享

藤椅
aqaq22 发表于 2016-2-19 15:13:21
谢谢分享

板凳
neuroexplorer 发表于 2016-2-20 01:00:04
IV (information value is a good way to select variables that are relevant.

报纸
67890 发表于 2016-2-20 03:26:26
Good reference. By any chance do you have his Macro code? He has several articles and SAS macros.

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-9 03:22