lgfit<-glm(Churn~.,family = binomial(link ='logit'),data=traindata)
summary(lgfit) # 四个变量无系数,说它们异常,实际上数据并没问题,估计是算法包本身的问题
Call:
glm(formula = Churn ~ ., family = binomial(link = "logit"), data = traindata)
Coefficients: (4 not defined because of singularities)
Estimate Std. Error z value Pr(>|z|)
(Intercept) 3.529e+07 5.037e+07 0.701 0.483505
Gender女 1.338e-01 8.495e-02 1.575 0.115153
HandsetASAD90 5.560e+00 2.788e-01 19.942 < 2e-16 ***
Usage_Band中使用率 -8.563e-01 2.082e-01 -4.113 3.90e-05 ***
International_mins 1.496e-02 3.884e-03 3.850 0.000118 ***
...
National_calls NA NA NA NA
National_mins NA NA NA NA
All_calls_mins NA NA NA NA
Nat_call_cost 1.133e-02 2.560e-02 0.443 0.657953
Weekend_mins_Fluctuation 1.050e-03 2.406e-02 0.044 0.965196
...
Mins_charge NA NA NA NA
actual_call_cost 2.481e-02 2.773e-02 0.895 0.371031
Total_call_cost 3.366e+05 4.797e+05 0.702 0.482842
Total_Cost -3.366e+05 4.797e+05 -0.702 0.482842
call_cost_per_min -3.940e-02 6.678e-02 -0.590 0.555170
average_cost_min -1.450e+00 5.689e-01 -2.548 0.010829 *
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
traindata.rar
(1.5 MB)
本附件包括:- traindata.csv


雷达卡





京公网安备 11010802022788号







