楼主: jasonwu24
3270 11

[书籍介绍] 【Wiley2017新书】The Data Science Handbook [推广有奖]

  • 5关注
  • 43粉丝

讲师

98%

还不是VIP/贵宾

-

威望
0
论坛币
58388 个
通用积分
245.9254
学术水平
119 点
热心指数
114 点
信用等级
85 点
经验
22677 点
帖子
344
精华
1
在线时间
505 小时
注册时间
2015-2-15
最后登录
2022-11-18

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
The Data Science Handbook
  • Title: The Data Science Handbook
  • Author: Field Cady
  • Length: 416 pages
  • Edition: 1
  • Language: English
  • Publisher: Wiley
  • Publication Date: 2017-02-28
  • ISBN-10: 1119092949
  • ISBN-13: 9781119092940

A comprehensive overview of data science covering the analytics, programming, and business skills necessary to master the discipline

Wiley.The.Data.Science.Handbook.2017.pdf (5.86 MB, 需要: 5 个论坛币)

Finding a good data scientist has been likened to hunting for a unicorn: the required combination of technical skills is simply very hard to find in one person. In addition, good data science is not just rote application of trainable skill sets; it requires the ability to think flexibly about all these areas and understand the connections between them. This book provides a crash course in data science, combining all the necessary skills into a unified discipline.

Unlike many analytics books, computer science and software engineering are given extensive coverage since they play such a central role in the daily work of a data scientist. The author also describes classic machine learning algorithms, from their mathematical foundations to real-world applications. Visualization tools are reviewed, and their central importance in data science is highlighted. Classical statistics is addressed to help readers think critically about the interpretation of data and its common pitfalls. The clear communication of technical results, which is perhaps the most undertrained of data science skills, is given its own chapter, and all topics are explained in the context of solving real-world data problems. The book also features:

  • Extensive sample code and tutorials using Python™ along with its technical libraries
  • Core technologies of “Big Data,” including their strengths and limitations and how they can be used to solve real-world problems
  • Coverage of the practical realities of the tools, keeping theory to a minimum; however, when theory is presented, it is done in an intuitive way to encourage critical thinking and creativity
  • A wide variety of case studies from industry
  • Practical advice on the realities of being a data scientist today, including the overall workflow, where time is spent, the types of datasets worked on, and the skill sets needed

The Data Science Handbook is an ideal resource for data analysis methodology and big data software tools. The book is appropriate for people who want to practice data science, but lack the required skill sets. This includes software professionals who need to better understand analytics and statisticians who need to understand software. Modern data science is a unified discipline, and it is presented as such. This book is also an appropriate reference for researchers and entry-level graduate students who need to learn real-world analytics and expand their skill set.

FIELD CADY is Principal Data Scientist at Maana, Inc. where he applies Big Data tools to solve industrial problems. He has a BS in Physics from Stanford University, an MS in Applied Mathematics from the University of Washington, and an MS in Computer Science from Carnegie Mellon University.

Table of Contents

Chapter 1 Introduction: Becoming a Unicorn

Part I The Stuff You’ll Always Use
Chapter 2 The Data Science Road Map
Chapter 3 Programming Languages
Chapter 4 Data Munging: String Manipulation, Regular Expressions, and Data Cleaning
Chapter 5 Visualizations and Simple Metrics
Chapter 6 Machine Learning Overview
Chapter 7 Interlude: Feature Extraction Ideas
Chapter 8 Machine Learning Classification
Chapter 9 Technical Communication and Documentation

Part II Stuff You Still Need to Know
Chapter 10 Unsupervised Learning: Clustering and Dimensionality Reduction
Chapter 11 Regression
Chapter 12 Data Encodings and File Formats
Chapter 13 Big Data
Chapter 14 Databases
Chapter 15 Software Engineering Best Practices
Chapter 16 Natural Language Processing
Chapter 17 Time Series Analysis
Chapter 18 Probability
Chapter 19 Statistics
Chapter 20 Programming Language Concepts
Chapter 21 Performance and Computer Memory

Part III Specialized or Advanced Topics
Chapter 22 Computer Memory and Data Structures
Chapter 23 Maximum Likelihood Estimation and Optimization
Chapter 24 Advanced Classifiers
Chapter 25 Stochastic Modeling
Chapter 25a Parting Words: Your Future as a Data Scientist




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Data Science handbook Science Wiley Data discipline necessary technical overview business

已有 1 人评分论坛币 收起 理由
zhou_yl + 40 精彩帖子

总评分: 论坛币 + 40   查看全部评分

本帖被以下文库推荐

沙发
richardgu26 发表于 2017-1-30 14:37:02 |只看作者 |坛友微信交流群
感谢楼主的分享!

使用道具

藤椅
line_us 发表于 2017-1-30 15:06:56 |只看作者 |坛友微信交流群
支持分享

使用道具

板凳
飞天玄舞6 发表于 2017-2-1 17:30:53 |只看作者 |坛友微信交流群
good book!

使用道具

报纸
pisco008 发表于 2017-2-4 11:52:58 |只看作者 |坛友微信交流群
谢谢分享

使用道具

地板
csmpaul 发表于 2017-2-16 07:29:48 |只看作者 |坛友微信交流群
谢谢楼主提供分享

使用道具

7
铁锷未残 学生认证  发表于 2017-3-17 14:17:01 |只看作者 |坛友微信交流群
谢谢分享

使用道具

8
maxiaoan 在职认证  发表于 2017-3-17 18:54:46 来自手机 |只看作者 |坛友微信交流群
谢谢分享

使用道具

9
jerry22880 在职认证  发表于 2017-4-15 10:05:43 |只看作者 |坛友微信交流群

使用道具

10
johyw 发表于 2017-4-19 05:58:17 |只看作者 |坛友微信交流群
新书,谢谢!

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-26 12:07