楼主: ReneeBK
911 0

The 42 V’s of Big Data and Data Science [推广有奖]

  • 1关注
  • 62粉丝

VIP

学术权威

14%

还不是VIP/贵宾

-

TA的文库  其他...

R资源总汇

Panel Data Analysis

Experimental Design

威望
1
论坛币
49407 个
通用积分
51.8704
学术水平
370 点
热心指数
273 点
信用等级
335 点
经验
57815 点
帖子
4006
精华
21
在线时间
582 小时
注册时间
2005-5-8
最后登录
2023-11-26

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Understanding and effectively communicating a concept often requires first building a simple mental model. Consider, for example, how we teach the physical laws to students: it helps to walk with algebra before you can run with calculus. This kind of model trades correctness (shaving off "unnecessary" detail) for an increased ability to grasp the larger picture.

In 2001, Gartner (perhaps) accidentally abetted an avalanche of aliteration with an article that forecast trends in the industry, gathering them under the headings Data Volume, Data Velocity, and DataVariety. Of course inflation continues its inexorable march, and about a decade later we had the 4 V's of Big Data, then 7 V's, and then 10 V's.

But it's 2017 now, and we now operate in an ever more sophisticated world of analytics. To keep up with the times, we present our updated 2017 list: The 42 V's of Big Data and Data Science.

  • Vagueness: The meaning of found data is often very unclear, regardless of how much data is available.
  • Validity: Rigor in analysis (e.g., Target Shuffling) is essential for valid predictions.
  • Valor: In the face of big data, we must gamely tackle the big problems.
  • Value: Data science continues to provide ever-increasing value for users as more data becomes available and new techniques are developed.
  • Vane: Data science can point in the direction of correct decision making.
  • Vanilla: Even the simplest models, constructed with rigor, can provide value.
  • Vantage: Big data allows us a privileged view of complex systems.
  • Variability: Data science often models variable data sources. Models deployed into production can encounter especially wild data.
  • Variety: In data science, we work with many data formats (flat files, relational databases, graph networks) and varying levels of data completeness.
  • Varifocal: Big data and data science together allow us to see both the forest and the trees.
  • Varmint: As big data gets bigger, so can software bugs!
  • Varnish: How end-users interact with our work matters, and polish counts.
  • Vastness: With the advent of the internet of things, the "bigness" of big data is accelerating.
  • Vaticination: Predictive analytics provides the ability to forecast. (Of course, these forecasts can be more or less accurate depending on rigor and the complexity of the problem. The future is pesky and never conforms to our March Madness brackets.)
  • Vault: With many data science applications based on large and often sensitive data sets, data security is increasingly important.
  • Veer: With the rise of agile data science, we should be able to navigate the customer's needs and change directions quickly when called upon.
  • Veil: Data science provides the capability to peer behind the curtain and examine the effects of latent variables in the data.
  • Velocity: Not only is the volume of data ever increasing, but the rate of data generation (from the internet of things, social media, etc.) is increasing as well.
  • Venue: Data science work takes place in different locations and under different arrangements: Locally, on customer workstations, and in the cloud.
  • Veracity: Reproducibility is essential for accurate analysis.
  • Verdict: As an increasing number of people are affected by models' decisions, Veracity and Validity become ever more important.
  • Versed: Data scientists often need to know a little about a great many things: mathematics, statistics, programming, databases, etc.
  • Version Control: You're using it, right?
  • Vet: Data science allows us to vet our assumptions, augmenting intuition with evidence.
  • Vexed: Some of the excitement around data science is based on its potential to shed light on large, complicated problems.
  • Viability: It is difficult to build robust models, and it's harder still to build systems that will beviable in production.
  • Vibrant: A thriving data science community is vital, and it provides insights, ideas, and support in all of our endeavors.
  • Victual: Big data — the food that fuels data science.
  • Viral: How does data spread among other users and applications?
  • Virtuosity: If data scientists need to know a little about many things, we should also grow to know a lot about one thing.
  • Viscosity: Related to Velocity; how difficult is the data to work with?
  • Visibility: Data science provides visibility into complex big data problems.
  • Visualization: Often the only way customers interact with models.
  • Vivify: Data science has the potential to animate all manner of decision making and business processes, from advertising to fraud detection.
  • Vocabulary: Data science provides a vocabulary for addressing a variety of problems. Different modeling approaches tackle different problem domains, and different validation techniques harden these approaches in different applications.
  • Vogue: "Machine Learning" which becomes "Artificial Intelligence", which becomes...?
  • Voice: Data science provides the ability to speak with knowledge (though not all knowledge, of course) on a diverse range of topics.
  • Volatility: Especially in production systems, one has to prepare for data volatility. Data that should "never" be missing suddenly disappears, numbers suddenly contain characters!
  • Volume: More people use data-collecting devices as more devices become internet-enabled. The volume of data is increasing at a staggering rate.
  • Voodoo: Data science and big data aren't voodoo, but how can we convince potential customers of data science's value to deliver results with real-world impact?
  • Voyage: May we always keep learning as we tackle the problems that data science provides.
  • Vulpine: Nate Silver would like you to be a fox, please.

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Data Science Big data Science Data SCIE industry building physical perhaps ability

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-5-1 02:04