楼主: Shazam
1955 1

What's the difference between deep learning and multilevel analysis? [推广有奖]

  • 0关注
  • 2粉丝

硕士生

40%

还不是VIP/贵宾

-

TA的文库  其他...

C++

Culture(文化学)

C#(Newoccidental)

威望
0
论坛币
1847 个
通用积分
2.0750
学术水平
6 点
热心指数
6 点
信用等级
5 点
经验
1118 点
帖子
144
精华
0
在线时间
4 小时
注册时间
2006-5-10
最后登录
2016-12-4

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Is "deep learning" just another term for multilevel/hierarchical modeling?

I'm much more familiar with the latter than the former, but from what I can tell, the primary difference is not in their definition, but how they are used and evaluated within their application domain.

It looks like the number of nodes in a typical "deep learning" application is larger and uses a generic hierarchical form, whereas applications of multilevel modeling typically uses a hierarchical relationships that mimic the generative process being modeled. Using a generic hierarchy in an applied statistics (hierarchical modeling) domain would be regarded as an "incorrect" model of the phenomena, whereas modeling a domain-specific hierarchy might be regarded as subverting the objective of making a generic deep learning learning machine.

Are these two things really the same machinery under two different names, used in two different ways?


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Multilevel difference Learning Analysis earning difference learning between

沙发
Shazam 发表于 2014-4-17 03:58:44 |只看作者 |坛友微信交流群
Similarity
Fundamentally both types of algorithms were developed to answer one general question in machine learning applications:

Given predictors (factors) x1,x2,…,xp - how to incorporate the interactions between this factors in order to increase the performance?

One way is to simply introduce new predictors: xp+1=x1x2,xp+2=x1x3,… But this proves to be bad idea due to huge number of parameters and very specific type of interactions.

Both Multilevel modelling and Deep Learning algorithms answer this question by introducing much smarter model of interactions. And from this point of view they are very similar.

Difference
Now let me try to give my understanding on what is the great conceptual difference between them. In order to give some explanation, let's see the assumptions that we make in each of the models:

Multilevel modelling:1 layers that reflect the data structure can be represented as a Bayesian Hierarchical Network. This network is fixed and usually comes from domain applications.

Deep Learning:2 the data were generated by the interactions of many factors. The structure of interactions is not known, but can be represented as a layered factorisation: higher-level interactions are obtained by transforming lower-level representations.

The fundamental difference comes from the phrase "the structure of interactions is not known" in Deep Learning. We can assume some priors on the type of interaction, but yet the algorithm defines all the interactions during the learning procedure. On the other hand, we have to define the structure of interactions for Multilevel modelling (we learn only vary the parameters of the model afterwards).

Examples
For example, let's assume we are given three factors x1,x2,x3 and we define {x1} and {x2,x3} as different layers.

In the Multilevel modelling regression, for example, we will get the interactions x1x2 and x1x3, but we will never get the interaction x2x3. Of course, partly the results will be affected by the correlation of the errors, but this is not that important for the example.

In Deep learning, for example in multilayered Restricted Boltzmann machines (RBM) with two hidden layers and linear activation function, we will have all the possible polinomial interactions with the degree less or equal than three.

Common advantages and disadvantages
Multilevel modelling

  • (-) need to define the structure of interactions
  • (+) results are usually easier to interpret
  • (+) can apply statistics methods (evaluate confidence intervals, check hypotheses)

Deep learning

  • (-) requires huge amount of data to train (and time for training as well)
  • (-) results are usually impossible to interpret (provided as a black box)
  • (+) no expert knowledge required
  • (+) once well-trained, usually outperforms most other general methods (not application specific)

Hope it will help!
Dmitry Laptev

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-5-1 16:02