楼主: oliyiyi
1479 0

The Single Most Important Skill for a Data Scientist [推广有奖]

版主

已卖:2995份资源

泰斗

1%

还不是VIP/贵宾

-

TA的文库  其他...

计量文库

威望
7
论坛币
66190 个
通用积分
31671.1867
学术水平
1454 点
热心指数
1573 点
信用等级
1364 点
经验
384134 点
帖子
9629
精华
66
在线时间
5508 小时
注册时间
2007-5-21
最后登录
2025-7-8

初级学术勋章 初级热心勋章 初级信用勋章 中级信用勋章 中级学术勋章 中级热心勋章 高级热心勋章 高级学术勋章 高级信用勋章 特级热心勋章 特级学术勋章 特级信用勋章

楼主
oliyiyi 发表于 2015-6-29 20:07:53 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
(This article was first published on Mango Solutions, and kindly contributed to R-bloggers)

By Richard Pugh, Commercial Director

I love my job.  Seriously.  I was enjoying it before Hal Varian made it sexy, but since then, and the data science explosion, everything has kicked into an even higher gear.  Why do I love it?  Because fundamentally, our job as data scientists is to help people make better decisions based on the information we have at hand.

As Mango Solutions continues to grow faster that you can say “what do you mean we need to look at offices again”, I find myself talking to more and more graduates about the skills needs to be a good data scientist, and pitfalls to avoid (mostly because I’ve stumbled into every pitfall at some point, so just about know where they are .. well, the ones I know about so far, of course).

When someone suggested writing this blog post, and gave me a title starting with “the single most important …” my initial reaction was to run away quickly.  Because surely, stating “the single most important …” in front of anything leaves you open to a Monty-Python-Spanish-Inquisition back down at some point along the line.  But in the end I agreed … so here goes …

In my opinion the single most important skill for a data scientist is not:

  • Knowing the difference between a GLM and a GAM
  • Understanding which R package is best to use for a particular task
  • Being able to extract data from twitter and merge it with your relational database
  • Creating a really smart plot that simultaneously communicates a message clearly and looks really sexy

No, in my opinion, the single most important skill for a data scientist is … Empathy.

Why “empathy”?  Because if we’re going to drive decisions with analytics, we need to appreciate the number of different personalities involved, what they are trying to achieve, what constraints they work under etc.

For example, a data scientist may end up interacting with:

  • The business user, who just wants to make more informed decisions, possibly in a very short time frame.
  • The IT contact, who has possibly never heard of the funky analytic technology you’re about to mention, and has to fill in 100 forms just to get a new server commissioned.
  • The marketing person, who wants to make sure you know that the colour of your graph needs to be #333380, not #3D3D99!
  • The internal statistician, who perhaps doesn’t understand this funky gradient boosted regression trees approach of which you speak, but is going to end up supporting this analytic solution.

Being able to interact with these people and take their aims and concerns into account when you’re designing analytic solutions is essential to make sure you create something fit for purpose in a positive way.

Even when you’re not interacting with the team above, empathy is still something that should be at the front of your mind as a data scientist.  For example:

  • When I’m writing some code to extract data, is this a “one off” thing, or had I better write it in a more generic style, parameterise column names etc?
  • Who is going to support the code I’m writing?  Maybe I should steer clear of that “holy crap how clever am I” short line of code that does a million things and replace it with a few well-documented lines of simpler code?
  • How do I best present the insight back to the user?  In a visual style perhaps?  Then let’s make sure they can clearly see the message past the funky interactive embedded scatter/ring/pie(!) graph I’m making
  • Once they’ve understood the message, what will my business users’ next question be?  Maybe I should anticipate that and make it easy to answer that question too?
  • Having fit a cool model, does the end user really want to see a p-value?  Or do they just want to know what decision to make?

So, that’s it.  In my opinion, the single most important skill for a data scientist is “Empathy”.

… and Fear! The two most important things are Empathy and Fear …

… and Surprise!  Empathy and Fear and Surprise …

… and … I’ll come in again.


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Scientist IMPORTANT import SINGLE Skill everything published article Because science

已有 1 人评分论坛币 收起 理由
Nicolle + 20 精彩帖子

总评分: 论坛币 + 20   查看全部评分

本帖被以下文库推荐

缺少币币的网友请访问有奖回帖集合
https://bbs.pinggu.org/thread-3990750-1-1.html

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2026-1-8 09:19