A Beginner’s Guide to Neural Networks with R

1关注
185
粉丝

版主

已卖：2994份资源

泰斗

1%

还不是VIP/贵宾

-

TA的文库 其他...

计量文库

0%

威望: 7 级
论坛币: 84105 个
通用积分: 31671.0967
学术水平: 1454 点
热心指数: 1573 点
信用等级: 1364 点
经验: 384134 点
帖子: 9629
精华: 66
在线时间: 5508 小时
注册时间: 2007-5-21
最后登录: 2025-7-8

楼主

oliyiyi 发表于 2016-8-13 18:36:15 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

In this article we will learn how Neural Networks work and how to implement them with the R programming language! We will see how we can easily create Neural Networks with R and even visualize them. Basic understanding of R is necessary to understand this article.

By Jose Portilla, Udemy Data Science Instructor.

I'm Jose Portilla and teach thousands of students on Udemy about Data Science and Programming and I also conduct in-person programming and data science training. Check out the end of the article for discount coupons on my courses!

Neural Networks

Neural Networks are a machine learning framework that attempts to mimic the learning pattern of natural biological neural networks. Biological neural networks have interconnected neurons with dendrites that receive inputs, then based on these inputs they produce an output signal through an axon to another neuron. We will try to mimic this process through the use of Artificial Neural Networks (ANN), which we will just refer to as neural networks from now on. The process of creating a neural network begins with the most basic form, a single perceptron.

The Perceptron

Let's start our discussion by talking about the Perceptron! A perceptron has one or more inputs, a bias, an activation function, and a single output. The perceptron receives inputs, multiplies them by some weight, and then passes them into an activation function to produce an output. There are many possible activation functions to choose from, such as the logistic function, a trigonometric function, a step function etc. We also make sure to add a bias to the perceptron, this avoids issues where all inputs could be equal to zero (meaning no multiplicative weight would have an effect). Check out the diagram below for a visualization of a perceptron:

Once we have the output we can compare it to a known label and adjust the weights accordingly (the weights usually start off with random initialization values). We keep repeating this process until we have reached a maximum number of allowed iterations, or an acceptable error rate.

To create a neural network, we simply begin to add layers of perceptrons together, creating a multi-layer perceptron model of a neural network. You'll have an input layer which directly takes in your feature inputs and an output layer which will create the resulting outputs. Any layers in between are known as hidden layers because they don't directly "see" the feature inputs or outputs. For a visualization of this check out the diagram below (source: Wikipedia).

Let's move on to actually creating a neural network in R!

Data

We'll use ISLR's built in College Data Set which has several features of a college and a categorical column indicating whether or not the School is Public or Private.

#install.packages('ISLR')
library(ISLR)
print(head(College,2))

复制代码

Private Apps Accept Enroll Top10perc Top25percAbilene Christian University Yes 1660 1232 721 23 52Adelphi University Yes 2186 1924 512 16 29 F.Undergrad P.Undergrad Outstate Room.Board BooksAbilene Christian University 2885 537 7440 3300 450Adelphi University 2683 1227 12280 6450 750 Personal PhD Terminal S.F.Ratio perc.alumni ExpendAbilene Christian University 2200 70 78 18.1 12 7041Adelphi University 1500 29 30 12.2 16 10527 Grad.RateAbilene Christian University 60Adelphi University 56

Data Preprocessing

It is important to normalize data before training a neural network on it. The neural network may have difficulty converging before the maximum number of iterations allowed if the data is not normalized. There are a lot of different methods for normalization of data. We will use the built-in scale() function in R to easily accomplish this task.

Usually it is better to scale the data from 0 to 1, or -1 to 1. We can specify the center and scale as additional arguments in the scale() function. For example:

# Create Vector of Column Max and Min Values
maxs <- apply(College[,2:18], 2, max)
mins <- apply(College[,2:18], 2, min)
# Use scale() and convert the resulting matrix to a data frame
scaled.data <- as.data.frame(scale(College[,2:18],center = mins, scale = maxs - mins))
# Check out results
print(head(scaled.data,2))

复制代码

Apps Accept EnrollAbilene Christian University 0.03288692646 0.04417701272 0.10791253736Adelphi University 0.04384229271 0.07053088583 0.07503539405 Top10perc Top25perc F.UndergradAbilene Christian University 0.2315789474 0.4725274725 0.08716353479Adelphi University 0.1578947368 0.2197802198 0.08075165058 P.Undergrad Outstate Room.BoardAbilene Christian University 0.02454774445 0.2634297521 0.2395964691Adelphi University 0.05614838562 0.5134297521 0.7361286255 Books Personal PhDAbilene Christian University 0.1577540107 0.2977099237 0.6526315789Adelphi University 0.2914438503 0.1908396947 0.2210526316 Terminal S.F.Ratio perc.alumniAbilene Christian University 0.71052631579 0.4182305630 0.1875Adelphi University 0.07894736842 0.2600536193 0.2500 Expend Grad.RateAbilene Christian University 0.0726714046 0.4629629630Adelphi University 0.1383867137 0.4259259259

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏3 回帖

关键词：Networks beginner network beginn Neural understand necessary thousands training article

相关帖子

已有 2 人评分	经验	学术水平	热心指数	收起理由
耕耘使者		+ 3	+ 3	对论坛有贡献
william9225	+ 20			精彩帖子

总评分: 经验 + 20 学术水平 + 3 热心指数 + 3 查看全部评分

缺少币币的网友请访问有奖回帖集合：
https://bbs.pinggu.org/thread-3990750-1-1.html

沙发

oliyiyi 发表于 2016-8-13 18:37:44

Train and Test Split

Let us now split our data into a training set and a test set. We will run our neural entwork on the training set and then see how well it performed on the test set.

We will use the caTools to randomly split the data into a training set and test set.

# Convert Private column from Yes/No to 1/0
Private = as.numeric(College$Private)-1
data = cbind(Private,scaled.data)
library(caTools)
set.seed(101)
# Create Split (any column is fine)
split = sample.split(data$Private, SplitRatio = 0.70)
# Split based off of split Boolean Vector
train = subset(data, split == TRUE)
test = subset(data, split == FALSE)

复制代码

Neural Network Function

Before we actually call the neuralnetwork() function we need to create a formula to insert into the machine learning model. The neuralnetwork() function won't accept the typical decimal R format for a formula involving all features (e.g. y ~.). However, we can use a simple script to create the expanded formula and save us some typing:

feats <- names(scaled.data)
# Concatenate strings
f <- paste(feats,collapse=' + ')
f <- paste('Private ~',f)
# Convert to formula
f <- as.formula(f)
f

复制代码

Private ~ Apps + Accept + Enroll + Top10perc + Top25perc + F.Undergrad + P.Undergrad + Outstate + Room.Board + Books + Personal + PhD + Terminal + S.F.Ratio + perc.alumni + Expend + Grad.Rate

#install.packages('neuralnet')
library(neuralnet)
nn <- neuralnet(f,data,hidden=c(10,10,10),linear.output=FALSE)

复制代码

Predictions and Evaluations

Now let's see how well we performed! We use the compute() function with the test data (jsut the features) to create predicted values. This returns a list from which we can call net.result off of.

# Compute Predictions off Test Set
predicted.nn.values <- compute(nn,test[2:18])
# Check out net.result
print(head(predicted.nn.values$net.result))

复制代码

[,1]Adrian College 1.0000000000Alfred University 1.0000000000Allegheny College 1.0000000000Allentown Coll. of St. Francis de Sales 0.9999999415Alma College 0.9999999960Amherst College 0.9994219945

Notice we still have results between 0 and 1 that are more like probabilities of belonging to each class. We'll use sapply() to round these off to either 0 or 1 class so we can evaluate them against the test labels.

predicted.nn.values$net.result <- sapply(predicted.nn.values$net.result,round,digits=0)
Now let's create a simple confusion matrix:

复制代码

table(test$Private,predicted.nn.values$net.result)

复制代码

0 1 0 62 2 1 0 169

Visualizing the Neural Net

We can visualize the Neural Network by using the plot(nn) command. The black lines represent the weighted vectors between the neurons. The blue line represents the bias added. Unfortunately, even though the model is clearly a very powerful predictor, it is not easy to directly interpret the weights. This means that we usually have to treat Neural Network models more like black boxes.

Hopefully you've enjoyed this brief discussion on Neural Networks! Try playing around with the number of hidden layers and neurons and see how they effect the results!

Want to learn more? You can check out my Data Science and Machine Learning Bootcamp with R course on Udemy! Get it for 50% off at this link: https://www.udemy.com/data-science-and-machine-learning-bootcamp-with-r/?couponCode=KDNUGGETS

If you are looking for corporate in-person training, feel free to contact me at: training AT pieriandata.com

Bio: Jose Portilla is a Data Science consultant and trainer who currently teaches online courses on Udemy. He also conducts training as the Head of Data Science for Pierian Data Inc.

缺少币币的网友请访问有奖回帖集合：
https://bbs.pinggu.org/thread-3990750-1-1.html

藤椅

william9225

发表于 2016-8-13 22:50:58 来自手机

谢谢分享

已有 1 人评分	经验	收起理由
oliyiyi	+ 10	精彩帖子

总评分: 经验 + 10 查看全部评分

板凳

Kamize

发表于 2016-9-2 22:51:48 来自手机

oliyiyi 发表于 2016-8-13 18:36
In this article we will learn how Neural Networks work and how to implement them with the R programm ...

谢谢分享

已有 1 人评分	论坛币	收起理由
oliyiyi	+ 5	精彩帖子

总评分: 论坛币 + 5 查看全部评分

报纸

20115326

发表于 2016-10-28 10:12:46

学习一下，哈哈

已有 1 人评分	论坛币	收起理由
oliyiyi	+ 5	精彩帖子

总评分: 论坛币 + 5 查看全部评分

A Beginner’s Guide to Neural Networks with R [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我拉你入群

相关帖子

浏览过的帖子

浏览过的版块

初级学术勋章

初级热心勋章

初级信用勋章

中级信用勋章

中级学术勋章

中级热心勋章

高级热心勋章

高级学术勋章

高级信用勋章

特级热心勋章

特级学术勋章

特级信用勋章

本版微信群

A Beginner’s Guide to Neural Networks with R [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我 拉你入群

相关帖子

浏览过的帖子

浏览过的版块

初级学术勋章

初级热心勋章

初级信用勋章

中级信用勋章

中级学术勋章

中级热心勋章

高级热心勋章

高级学术勋章

高级信用勋章

特级热心勋章

特级学术勋章

特级信用勋章

本版微信群

扫码加我拉你入群