楼主: hunxuexiaomeinv
910 4

[其他] A problem about my logistic model. [推广有奖]

  • 0关注
  • 0粉丝

大专生

81%

还不是VIP/贵宾

-

威望
0
论坛币
708 个
通用积分
0
学术水平
0 点
热心指数
0 点
信用等级
0 点
经验
3453 点
帖子
31
精华
0
在线时间
73 小时
注册时间
2010-7-27
最后登录
2022-4-27

楼主
hunxuexiaomeinv 发表于 2015-10-16 05:07:34 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Hi,

I have made a logistic model for a datasetwith a binary response variable.

The result is not good, I think.

The prediction error is 10%. However allthe predict response variables are the same (all are “YES”). The predictpossibilities are different, but they are all larger than 0.5, and I use 0.5 asa cutoff to decide the result, so all the response variables are “YES”.

Also I calculate the R2, whichis 0.05; it’s so small. And the Hoemer Lemeshow Test shows the p-value is<0.0001, which is bad enough.

I think the logistic model is not a goodchoice here. I want to know what I should do next.

Trying some other models? Could you give mesome choices?

Or I need to do something based on logisticmodel?

Could anyone give me any suggestion aboutmy problem?

Thanks.


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:logistic problem logisti ogistic logist problem about

yh

沙发
夏目贵志 发表于 2015-10-17 07:42:21
You will have to be way more specific than that. What is it that you are trying to study? What is the dependent variable? What are the independent variables?

There are a lot of things that are not clear from your statements. For example, you said that all the predictions are 1 but there is only a 10% "prediction error". Does this mean that your dependent variable is 1 in most of your observations? That may create a problem.

Do not worry about R squares. They don't usually mean much in such models. Look at, say, a two way table of prediction vs actual. Or use ROC curve.

Try probit model too.

藤椅
hunxuexiaomeinv 发表于 2015-10-20 01:25:54
夏目贵志 发表于 2015-10-17 07:42
You will have to be way more specific than that. What is it that you are trying to study? What is th ...
I cannot paste the data.
The factors are some engineering factors like alloy composition and other process variables like water pressure, temperature. They are numerical and dependent variable is a binary variable with 1 "PASS", 0"FAIL". And accually most of these binary variables are PASS. That's why only 10% prediction error.
The problem here is that I am not sure if the model indeed works here.
I have tried probit model, which is similar with logit model. The CV-MSE is the same.
Do I need to try some penalty in logistic model?
I cannot find what the problem here is; then I cannot find the way to solve it.
Could you give me some suggestion?
Thanks

板凳
夏目贵志 发表于 2015-10-20 08:19:37
hunxuexiaomeinv 发表于 2015-10-20 01:25
I cannot paste the data.
The factors are some engineering factors like alloy composition and othe ...
I'd say if it is experimental data, you really need to think hard about the underlying data generating process before imposing the assumptions of any model. It could be that linearity is just a bad assumption. If so, it does not matter if you are using logit or probit. I think it is time to talk to your advisor about it. I'd say most people here work with economics/business, not some natural science.

报纸
hunxuexiaomeinv 发表于 2015-10-20 15:00:41
夏目贵志 发表于 2015-10-20 08:19
I'd say if it is experimental data, you really need to think hard about the underlying data genera ...
Thanks, anyway.
I want to try machine learing methods to do classification.
Thanks for your answer.

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2026-2-7 21:23