楼主: xpf7622
3431 1

[作业] Spark: The Definitive Guide: Big data processing made simple [推广有奖]

  • 0关注
  • 7粉丝

博士生

84%

还不是VIP/贵宾

-

威望
0
论坛币
25477 个
通用积分
18.9399
学术水平
19 点
热心指数
25 点
信用等级
8 点
经验
24470 点
帖子
190
精华
0
在线时间
237 小时
注册时间
2007-9-6
最后登录
2023-10-30

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Book Description
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of this open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.

You’ll explore the basic operations and common functions of Spark’s structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of monitoring, tuning, and debugging Spark, and explore machine learning techniques and scenarios for employing MLlib, Spark’s scalable machine learning library.

Get a gentle overview of big data and Spark
Learn about DataFrames, SQL, and Datasets—Spark’s core APIs—through worked examples
Dive into Spark’s low-level APIs, RDDs, and execution of SQL and DataFrames
Understand how Spark runs on a cluster
Debug, monitor, and tune Spark clusters and applications
Learn the power of Spark’s Structured Streaming and MLlib for machine learning tasks
Explore the wider Spark ecosystem, including SparkR and Graph Analysis
Examine Spark deployment, including coverage of Spark in the Cloud
Contents
Chapter 1. A Gentle Introduction to Spark
Chapter 2. Structured API Overview
Chapter 3. Basic Structured Operations
Chapter 4. Working with Different Types of Data
Chapter 5. Aggregations
Chapter 6. Joins
Chapter 7. Data Sources
Chapter 8. Spark SQL
Chapter 9. Datasets
Chapter 10. Low Level API Overview
Chapter 11. Basic RDD Operations
Chapter 12. Advanced RDDs Operations
Chapter 13. Distributed Variables
Chapter 14. Advanced Analytics and Machine Learning
Chapter 15. Preprocessing and Feature Engineering
Chapter 16. Preprocessing
Chapter 17. Classification
Chapter 18. Regression
Chapter 19. Recommendation
Chapter 20. Clustering
Chapter 21. Graph Analysis
Chapter 22. Deep Learning

Cover

QQ截图20170617221331.png

Download

OReilly Spark The Definitive Guide 1491912219 Early Release.pdf (4.46 MB, 需要: 15 个论坛币)


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Definitive Processing processI Big data Process simple structured framework maintain building

已有 1 人评分经验 收起 理由
晓七 + 100 奖励积极上传好的资料

总评分: 经验 + 100   查看全部评分

本帖被以下文库推荐

沙发
晓七 在职认证  发表于 2017-6-18 04:57:57 |只看作者 |坛友微信交流群
谢谢分享。

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-20 04:15