楼主: 牛尾巴
7329 64

【2015新书】 Clean Data - Data Science Strategies for Tackling Dirty Data   [推广有奖]

已卖:10458份资源

泰斗

38%

还不是VIP/贵宾

-

TA的文库  其他...

最新e书

2018新书

2017新书

威望
8
论坛币
630078 个
通用积分
57023.8551
学术水平
12700 点
热心指数
12976 点
信用等级
12465 点
经验
569184 点
帖子
9169
精华
66
在线时间
13174 小时
注册时间
2008-2-13
最后登录
2025-9-22

特级学术勋章 特级热心勋章 特级信用勋章 高级学术勋章 高级热心勋章 高级信用勋章

楼主
牛尾巴 发表于 2015-9-14 21:27:16 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
图书名称:Clean Data - Data Science Strategies for Tackling Dirty Data
作者:
Megan Squire
出版社:Packt Publishing
页数:267
出版时间:
September 2015                           
语言:English

格式:pdf
内容简介:
Key Features Grow your data science expertise by filling your toolbox with proven strategies for a wide variety of cleaning challengesFamiliarize yourself with the crucial data cleaning processes, and share your own clean data sets with othersComplete real-world projects using data from Twitter and Stack OverflowBook Description
Is much of your time spent doing tedious tasks such as cleaning dirty data, accounting for lost data, and preparing data to be used by others? If so, then having the right tools makes a critical difference, and will be a great investment as you grow your data science expertise.

The book starts by highlighting the importance of data cleaning in data science, and will show you how to reap rewards from reforming your cleaning process. Next, you will cement your knowledge of the basic concepts that the rest of the book relies on: file formats, data types, and character encodings. You will also learn how to extract and clean data stored in RDBMS, web files, and PDF documents, through practical examples.

At the end of the book, you will be given a chance to tackle a couple of real-world projects.
What you will learnUnderstand the role of data cleaning in the overall data science processLearn the basics of file formats, data types, and character encodings to clean data properlyMaster critical features of the spreadsheet and text editor for organizing and manipulating dataConvert data from one common format to another, including JSON, CSV, and some special-purpose formatsImplement three different strategies for parsing and cleaning data found in HTML files on the WebReveal the mysteries of PDF documents and learn how to pull out just the data you wantDevelop a range of solutions for detecting and cleaning bad data stored in an RDBMSCreate your own clean data sets that can be packaged, licensed, and shared with othersUse the tools from this book to complete two real-world projects using data from Twitter and Stack OverflowAbout the Author
Megan Squire is a professor of computing sciences at Elon University. She has been collecting and cleaning dirty data for two decades. She is also the leader of FLOSSmole.org, a research project to collect data and analyze it in order to learn how free, libre, and open source software is made.

回复免费:

本帖隐藏的内容

Clean Data - Data Science Strategies for Tackling Dirty Data.rar (5.52 MB) 本附件包括:
  • Clean Data - Data Science Strategies for Tackling Dirty Data.pdf



二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Data Science Strategies tackling Science Strateg yourself English science 出版社 share

已有 1 人评分经验 学术水平 热心指数 信用等级 收起 理由
kychan + 100 + 1 + 1 + 1 精彩帖子

总评分: 经验 + 100  学术水平 + 1  热心指数 + 1  信用等级 + 1   查看全部评分

本帖被以下文库推荐

沙发
econ8008 发表于 2015-9-14 21:32:03
看看看看

藤椅
auirzxp 学生认证  发表于 2015-9-14 21:36:12
提示: 作者被禁止或删除 内容自动屏蔽

板凳
feng026 发表于 2015-9-14 21:52:32
Great, Thanks

报纸
Crsky7 发表于 2015-9-14 21:59:35
Clean Data - Data Science Strategies for Tackling Dirty Data

地板
ekscheng 发表于 2015-9-14 22:18:40

7
Enthuse 发表于 2015-9-14 22:19:25
thanks ..

8
xinewo 发表于 2015-9-15 07:11:54
look look

9
rrjj101022 发表于 2015-9-15 07:20:11
谢谢分享~~~

10
11205010029 发表于 2015-9-15 08:48:43
感谢分享

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2025-12-20 21:10