楼主: SASCHEN
3033 12

[熱門話題]Data Science: The End of Statistics? [推广有奖]

  • 0关注
  • 0粉丝

硕士生

8%

还不是VIP/贵宾

-

TA的文库  其他...

Social Media Mining

深度學習(DEEP LEARNING)

HTML

威望
0
论坛币
2296 个
通用积分
3.5500
学术水平
4 点
热心指数
5 点
信用等级
4 点
经验
912 点
帖子
161
精华
0
在线时间
43 小时
注册时间
2005-9-25
最后登录
2022-10-29

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Data Science: The End of Statistics?




    This question was recently posted by Larry Wasserman on the Normal Deviate blog (see extract below). Larry is a statistics and machine learning professor at Carnegie Mellon University.

Here is my answer:

Data science is more than statistics: it also encompasses computer science and business concepts, and it's far more than a set of techniques and principles. I could imagine a data scientist not having a degree - this is not possible for a statistician. But the core of the issue, in my opinion, is explained below.

  • I am one of the guys who contributes to the adoption of the keyword data science. Ironically, I'm a pure statistician (Ph.D. in statistics, 1993 - computational statistics) although I changed a lot since 1993, I'm now an entrepreneur. The reason I tried hard to move away from being called statistician to being called something (anything) else, is because of the American Statistical Association: they killed the keyword statistician as well as limiting career prospects to future statisticians, by making it almost narrowly and exclusively associated with the pharmaceutical industry and small data (where most of its revenue comes from). They missed the boat - on purpose, I believe - of the new statistical revolution that came along with big data over the last 15 years.
  • Statisticians should be very familiar with computer science, big data and software: 10 billion rows with 10,000 variables should not scare a true statistician. On the cloud (or on even on my laptop as streaming data), it gets processed real fast. First step is data reduction, but even if you must keep all observations and variables, it still is feasible. And good computer scientists also produce confidence intervals - you don't need to be statistician for that, just use the First AnalyticBridge Theorem (if you are curious, check out the Second AnalyticBridge Theorem). The distinction between computer scientist and statistician is getting thinner and more fuzzy over the years. The things you did not learn at school (in statistical classes), you can still learn it online.

This diagram misses a few key concepts - including business and domain knowledge

Here's the article:

As I see newspapers and blogs filled with talk of “Data Science” and “Big Data” I find myself filled with a mixture of optimism and dread. Optimism, because it means statistics is finally a sexy field. Dread, because statistics is being left on the sidelines.

The very fact that people can talk about data science without even realizing there is a field already devoted to the analysis of data — a field called statistics — is alarming. I like what Karl Broman says:

When physicists do mathematics, they don’t say they’re doing “number science”. They’re doing math.

If you’re analyzing data, you’re doing statistics. You can call it data science or informatics or analytics or whatever, but it’s still statistics.

Well put.

Maybe I am just pessimistic and am just imagining that statistics is getting left out. Perhaps, but I don’t think so. It’s my impression that the attention and resources are going mainly to Computer Science. Not that I have anything against CS of course, but it is a tragedy if Statistics gets left out of this data revolution.

Two questions come to mind:


  • Why do statisticians find themselves left out?
  • What can we do about it?




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Data Science Statistics statistic Science Statist principles techniques computer learning recently

已有 2 人评分经验 学术水平 热心指数 信用等级 收起 理由
狂热的爱好者 + 60 + 1 + 1 + 1 精彩帖子
离歌レ笑 + 3 + 3 + 3 精彩帖子

总评分: 经验 + 60  学术水平 + 4  热心指数 + 4  信用等级 + 4   查看全部评分

使用道具

藤椅
Lisrelchen 发表于 2014-8-19 05:55:09 |只看作者 |坛友微信交流群

使用道具

板凳
Nicolle 学生认证  发表于 2014-8-19 05:58:39 |只看作者 |坛友微信交流群
提示: 作者被禁止或删除 内容自动屏蔽

使用道具

报纸
meng山楂树 发表于 2014-8-19 06:27:27 |只看作者 |坛友微信交流群
good!!

使用道具

地板
sqy 发表于 2014-8-19 08:49:40 |只看作者 |坛友微信交流群

使用道具

7
eaglestar 在职认证  发表于 2014-8-19 09:01:42 |只看作者 |坛友微信交流群
Welcome to Big Data TIme

使用道具

8
songlinjllive 发表于 2014-8-19 09:25:05 来自手机 |只看作者 |坛友微信交流群
sqy 发表于 2014-8-19 08:49
老江湖说历史

使用道具

9
RDJIN 发表于 2014-8-19 09:37:46 |只看作者 |坛友微信交流群
WHO KNOWS?

使用道具

10
fankaiqing 在职认证  发表于 2014-8-19 09:56:16 |只看作者 |坛友微信交流群
这是不是俺们学统计的悲哀呀

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-27 05:34