楼主: igs816
5027 43

[其他] Data Analytics with Hadoop: An Introduction for Data Scientists [推广有奖]

已卖:261250份资源

泰斗

6%

还不是VIP/贵宾

-

威望
9
论坛币
1762913 个
通用积分
20526.8467
学术水平
2754 点
热心指数
3477 点
信用等级
2565 点
经验
485149 点
帖子
5457
精华
52
在线时间
3911 小时
注册时间
2007-8-6
最后登录
2026-1-3

高级学术勋章 特级学术勋章 高级信用勋章 特级信用勋章 高级热心勋章 特级热心勋章

楼主
igs816 在职认证  发表于 2017-3-12 11:52:30 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
u7sCOQqFntuBYhDy72vGhPSEqalARA7m.jpg

Data Analytics with Hadoop: An Introduction for Data Scientists  
Benjamin Bengfort, Jenny Kim
ISBN: 1491913703 | 2016 | PDF | 288 pages | 7 MB
                
Ready to use statistical and machine-learning techniques across large data sets? This practical guide shows you why the Hadoop ecosystem is perfect for the job. Instead of deployment, operations, or software development usually associated with distributed computing, you’ll focus on particular analyses you can build, the data         warehousing techniques that Hadoop provides, and higher order data workflows this framework can produce.

Data scientists and analysts will learn how to perform a wide range of techniques, from writing MapReduce and Spark applications with Python to using advanced modeling and data management with Spark MLlib, Hive, and HBase. You’ll also learn about the analytical processes and data systems available to build and empower data products that can handle—and actually require—huge amounts of data.Understand core concepts behind Hadoop and cluster computing
Use design patterns and parallel analytical algorithms to create distributed data analysis jobs
Learn about data management, mining, and warehousing in a distributed context using Apache Hive and HBase
Use Sqoop and Apache Flume to ingest data from relational databases
Program complex Hadoop and Spark applications with Apache Pig and Spark DataFrames
Perform machine learning techniques such as classification, clustering, and collaborative filtering with Spark’s MLlib

本帖隐藏的内容

Data Analytics with Hadoop - An Introduction for Data Scientists.pdf (6.62 MB, 需要: 10 个论坛币)


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:introduction Scientists troduction Scientist Analytics

已有 1 人评分学术水平 热心指数 收起 理由
飞天玄舞6 + 1 + 1 精彩帖子

总评分: 学术水平 + 1  热心指数 + 1   查看全部评分

本帖被以下文库推荐

沙发
Nicolle(真实交易用户) 学生认证  发表于 2017-3-12 12:00:32
提示: 作者被禁止或删除 内容自动屏蔽

藤椅
auirzxp(未真实交易用户) 学生认证  发表于 2017-3-12 12:05:17
提示: 作者被禁止或删除 内容自动屏蔽

板凳
w-long(真实交易用户) 发表于 2017-3-12 12:46:32 来自手机
Data Analytics with Hadoop

报纸
MouJack007(未真实交易用户) 发表于 2017-3-12 15:55:25
谢谢楼主分享!

地板
MouJack007(未真实交易用户) 发表于 2017-3-12 15:55:46

7
钱学森64(未真实交易用户) 发表于 2017-3-12 16:33:21
谢谢分享

8
ekscheng(未真实交易用户) 发表于 2017-3-12 17:13:55

9
终结天狼(真实交易用户) 在职认证  发表于 2017-3-12 17:28:49
Data Analytics with Hadoop: An Introduction for Data Scientists [修改]
高级模式

10
franky_sas(未真实交易用户) 发表于 2017-3-12 17:38:27

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jr
拉您进交流群
GMT+8, 2026-1-4 11:38