楼主: oliyiyi
226 0

The most desired skill in data science [推广有奖]

版主

大师

88%

还不是VIP/贵宾

-

TA的文库  其他...

计量文库

威望
6
论坛币
634736 个
通用积分
22403.5284
学术水平
1396 点
热心指数
1509 点
信用等级
1305 点
经验
329459 点
帖子
8947
精华
66
在线时间
4973 小时
注册时间
2007-5-21
最后登录
2019-9-17

初级学术勋章 初级热心勋章 初级信用勋章 中级信用勋章 中级学术勋章 中级热心勋章 高级热心勋章 高级学术勋章 高级信用勋章 特级热心勋章 特级学术勋章 特级信用勋章

oliyiyi 发表于 2019-5-7 21:50:24 |显示全部楼层

By Kaiser Fung, Founder, Principal Analytics Prep

This skills gap is typically described as the lack of “critical thinking.”

It’s hard to imagine that someone with a STEM degree lacks critical thinking, so let’s unpack what this means.

Critical thinking in data science can be broken down into two aspects. First is the ability to develop the question. In practice, this involves extensive interviewing with users of the data science or analytics results to truly grasp the problems that need to be solved. Many practitioners, including several speakers at a conference I attended recently, note that users are frequently unable to express the problem properly.

Source

That’s not quite how I’d describe it. The process of developing the question requires collaboration between the data scientists who know much more about the data and the analytical tools and the business owners who know much more about the business goals and metrics. The collaboration leads to sharing of knowledge and symbiotic problem solving.

The second aspect of critical thinking is the ability to question the data. Experienced analysts do not ever dump raw data into a software and see what comes out. Experience tells us what adjustments might be necessary to remove potentially distracting or misleading features in the data.

STEM training is particularly lacking in these two aspects of critical thinking. A typical problem in a math, science or engineering class includes (a) a well-posed question, and (b) nicely-shaped data, and the student’s challenge is to figure out which formula or method can use the provided data to answer the specified question. There is no need to develop the question further; in fact, any student trying to change the question will be penalized! There is also no need to question the data. If the data should be questioned, then the problem will have no single correct answer, which doesn’t fit well with traditional academic STEM training. (By contrast, social science graduates are better trained to handle complexity and incomplete data.)

In a recent blog post, I showed how a data analyst can use critical thinking to question the data and avoid making embarrassing erroneous conclusions. Analysts at the National Highway Traffic Safety Administration (NHTSA) failed to notice gaping holes in the data submitted by Tesla when they endorsed Tesla’s claim that the auto-pilot feature would reduce crash rates by 40 percent. An independent consultant succeeded in getting the data released, and noticed a large number of blank entries. When the missing values were imputed using a standard method (mean imputation), the reported benefit of auto-pilot vanished entirely.

Bridging this skill gap is a key goal of mine when I started Principal Analytics Prep. We accomplish this by fostering in-class, hands-on learning with practitioners who have years of practical work experience, and finding students with diverse backgrounds to consider both science and social science approaches to problem solving.

In Part 2, I provide materials to help you prepare for case interviews that hiring managers use to test critical thinking.

This post was originally published at Kaiser Fung’s blog (https://www.principalanalyticsprep.com/news/critical-thinking-the-most-desired-skill-in-data-science) and is slightly modified.

Bio: Kaiser Fung is the founder of Principal Analytics Prep, a leading data analytics bootcamp; best-selling author of Numbers Rule Your World; and the author of Junk Charts (https://www.junkcharts.com), the popular data visualization blog.



缺少币币的网友请访问有奖回帖集合
http://bbs.pinggu.org/thread-3990750-1-1.html
您需要登录后才可以回帖 登录 | 我要注册

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2019-9-19 17:05