楼主: minihawk
4857 3

[问答] 回归分析的局限性怎么处理,急! [推广有奖]

  • 0关注
  • 1粉丝

已卖:1份资源

本科生

6%

还不是VIP/贵宾

-

威望
0
论坛币
251 个
通用积分
0
学术水平
2 点
热心指数
2 点
信用等级
1 点
经验
1381 点
帖子
47
精华
0
在线时间
45 小时
注册时间
2013-1-5
最后登录
2014-8-21

楼主
minihawk 发表于 2013-4-8 15:14:22 |AI写论文
100论坛币
求大侠帮忙!要研究一个总体指标满意度与各变量的关系,现打算用回归分析来做,但是根据前人的经验,满意度是存在一定的右偏的,非完全正态的。请问,有没有办法通过增加虚拟变量或者其他的什么方法来处理,使其能够满足回归分析的条件,还是说既然是右偏的就只能放弃用回归分析来做的思路?急,恳请大家知道的帮个忙。

关键词:怎么处理 回归分析 局限性 什么方法 SOSO 回归分析 局限性

沙发
mssr 发表于 2013-4-8 21:35:40
1. Taking the square root helps normalize a distribution skewed right.

When one applies a square root transformation, the square root of every value is taken. However, as one cannot take the square root of a negative number, if there are negative values for a variable, a constant must be added to move the minimum value of the distribution above 0, preferably to 1.00 (the rationale for this assertion is explained below). Another important point is that numbers of 1.00 and above behave differently than numbers between 0.00 and 0.99. The square root of numbers above 1.00 always become smaller, 1.00 and 0.00 remain constant, and number between 0.00 and 1.00 become larger (the square root of 4 is 2, but the square root of 0.40 is 0.63). Thus, if you apply a square root to a continuous variable that contains values between 0 and 1 as well as above 1, you are treating some numbers differently than others, which is probably not desirable in most cases.

2. Taking the logarithm of X also helps normalize a right-skewed distribution.

Logarithmic transformations are actually a class of transformations, rather than a single transformation. In brief, a logarithm is the power (exponent) to which a base number must be raised in order to get the original number. Any given number can be expressed as y to the x power in an infinite number of ways. For example, if we were talking about base 10, 1 is 100, 100 is 102, 16 is 101.2, and so on. Thus, log10(100)=2 and log10(16)=1.2. However, base 10 is not the only option for log transformations. Another common option is the Natural Logarithm, where the constant e (2.7182818) is the base. In this case the natural log 100 is 4.605. As the logarithm of any negative number or number less than 1 is undefined, if a variable contains values less than 1.0, a constant must be added to move the minimum value of the distribution, preferably to 1.00.

藤椅
minihawk 发表于 2013-4-9 11:24:07
不好意思啊 本人英文真心差  看不懂啊 麻烦能用中文说明下吗,谢了

板凳
zkymath 在职认证  发表于 2013-4-9 12:56:54
猜测是提示你利用变换 sqrt(y)+平移     或者    log(y)搞搞,不一定非得线性回归啊。把满意度分级再利用Logistic回归不知道成不成啊

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-24 12:29