【独家发布】【kindle】R Data Analysis Cookbook - More Than 80 Recipes to Help You Delive [推广有奖]

51楼

Lisrelchen 发表于 2016-7-25 22:25:30

Linear Regression using R

Linear Regression using R
1. Load the caret package:
> library(caret)
2. Read the data:
> auto <- read.csv("auto-mpg.csv")
3. Convert the categorical variable cylinders into a factor with appropriate renaming of
the levels:
> auto$cylinders <- factor(auto$cylinders,
levels = c(3,4,5,6,8), labels = c("3cyl", "4cyl", "5cyl",
"6cyl", "8cyl"))
4. Create partitions:
> set.seed(1000)
> t.idx <- createDataPartition(auto$mpg, p = 0.7,
list = FALSE)
5. See the names of the variables in the data frame:
> names(auto)
6. Build the linear regression model:
> mod <- lm(mpg ~ ., data = auto[t.idx, -c(1,8,9)])
7. View the basic results (your results may differ because of random sampling
differences in creating the partitions):
> mod
8. View more detailed results:
> summary(mod)
9. Generate predictions for the test data:
> pred <- predict(mod, auto[-t.idx, -c(1,8,9)])
10. Compute the RMS error on the test data (your results can differ):
> sqrt(mean((pred - auto[-t.idx, 2])^2))
[1] 4.333631
11. View diagnostic plots of the model:
> par(mfrow = c(2,2))
> plot(mod)
> par(mfrow = c(1,1))

复制代码

加关注串个门加好友发消息 0关注 463 粉丝巨擘 Nicolle 当前离线阅读权限 255 威望 16 级论坛币 12403139 个通用积分 1638.9258 学术水平 3305 点热心指数 3329 点信用等级 3095 点经验 476993 点帖子 23839 精华 91 在线时间 9878 小时注册时间 2005-4-23 最后登录 2022-3-6 雷达卡	52楼 Nicolle 发表于 2016-7-25 23:48:28 Regression Trees using R 提示: 作者被禁止或删除内容自动屏蔽

	回复举报

加关注串个门加好友发消息 0关注 463 粉丝巨擘 Nicolle 当前离线阅读权限 255 威望 16 级论坛币 12403139 个通用积分 1638.9258 学术水平 3305 点热心指数 3329 点信用等级 3095 点经验 476993 点帖子 23839 精华 91 在线时间 9878 小时注册时间 2005-4-23 最后登录 2022-3-6 雷达卡	53楼 Nicolle 发表于 2016-7-25 23:54:49 提示: 作者被禁止或删除内容自动屏蔽

	回复举报