楼主: 420948492
3883 8

clementine软件操作问题 [推广有奖]

  • 2关注
  • 37粉丝

版主

已卖:108份资源

院士

48%

还不是VIP/贵宾

-

威望
1
论坛币
724 个
通用积分
18.8346
学术水平
80 点
热心指数
89 点
信用等级
62 点
经验
13471 点
帖子
3689
精华
3
在线时间
2983 小时
注册时间
2007-10-16
最后登录
2025-1-14

楼主
420948492 发表于 2009-9-12 23:53:13 |AI写论文
100论坛币
在clementine12中,每个模型的节点都有都有variable importance选项,请问计算的原理是什么,不同的模型,比如决策树和神经网络模型,计算原理是否相同?

最佳答案

freedj 查看完整内容

http://clementine-blog.beauregar ... ariable-importance/ 这里有谈到,计算原理没说清楚,不过应该是一样的,至少结果是可比较的。网页要翻墙才能打开 我就不翻译了 How Clementine 12 calculates variable importance13Nov08 A long-awaited feature in Clementine 12 is that all, or almost all, modelling algorithms generate a summary listing the relative importance of the variables. In version 11, a handful ...
关键词:clementine clementin Clement 软件操作 CLE 软件 clementine
有人的地方就有江湖

沙发
freedj 发表于 2009-9-12 23:53:14
http://clementine-blog.beauregar ... ariable-importance/
这里有谈到,计算原理没说清楚,不过应该是一样的,至少结果是可比较的。网页要翻墙才能打开
我就不翻译了
How Clementine 12 calculates variable importance13Nov08
A long-awaited feature in Clementine 12 is that all, or almost all, modelling algorithms generate a summary listing the relative importance of the variables.  In version 11, a handful of algorithms ranked variables in order of importance, each using a different technique.  For instance, you could work out which variables in a regression were important by browsing the coefficients.  Neural networks generated a chart by a means that now escapes me.

Version 12 standardises how variable importance is calculated, so the importance charts of different models can be compared, and models that did not previously generate “native” variable importance can be evaluated with the new technique.  According to information received, the following algorithms all use the same  calculation:

C5.0
C&RT
QUEST
CHAID
Regression
Logistic
Discriminant
GenLin
SVM
Bayesian Networks
How does it work?  It uses factor prioritisation: that is, which factor (input variable) leads to the greatest reduction in the variance of the output, when the value of that input variable is known?  Which leads to the second-greatest?
The maths behind the calculation is quite involved.  For me the most useful piece of knowledge is that all of the algorithms above use an identical means of determining variable importance, so the results can be directly compared.  No word yet on whether neural networks use the new calculation.

On a practical note, for some algorithms generation of variable importance is disabled by default because it can take a long time to calculate.  If you want it for SVM, logistic regression, or the binary classifier, you need to turn it on before building the model.  You might want to use feature selection prior to modelling in these cases, to reduce the number of low-impact variables being entered into the models.

藤椅
420948492 发表于 2009-9-13 22:51:46
好,我看一下
有人的地方就有江湖

板凳
ycl0536 发表于 2009-9-15 15:01:26
上面的网站怎么打不开

报纸
freedj 发表于 2009-9-15 15:50:25
嗯,国内打不开的,被屏蔽了。要翻墙

地板
ycl0536 发表于 2009-9-15 15:53:22
楼上的,该怎么翻墙啊

7
freedj 发表于 2009-9-15 16:06:47
怎么翻墙?随便搜一下,就有很多资料的
有很多软件可以用,自由门,无界,UltraVPN
我现在用的是最后一个,安装了去注册个用户就可以,很方便

8
ycl0536 发表于 2009-9-15 16:17:03
你太有才了,,这样岂不是可以看好多花花绿绿的网站了

9
420948492 发表于 2009-9-15 17:49:22
呵呵,用自由门吧,很好用的,但千万别做坏事,{:2_38:}
有人的地方就有江湖

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2026-1-3 02:16