楼主: igs816
8691 31

Learning PySpark by Tomasz Drabas [推广有奖]

已卖:261246份资源

泰斗

6%

还不是VIP/贵宾

-

威望
9
论坛币
1762873 个
通用积分
20526.5467
学术水平
2754 点
热心指数
3477 点
信用等级
2565 点
经验
485149 点
帖子
5457
精华
52
在线时间
3910 小时
注册时间
2007-8-6
最后登录
2026-1-1

高级学术勋章 特级学术勋章 高级信用勋章 特级信用勋章 高级热心勋章 特级热心勋章

楼主
igs816 在职认证  发表于 2017-3-4 11:46:39 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
GBbMa2GtgvcsD8I9S2YR0fACTwFrbl5S.jpg

Learning PySpark by Tomasz Drabas
English | 27 Feb. 2017 | ISBN: 1786463709 | 274 Pages | EPUB/PDF (conv) | 21.67 MB
Build data-intensive applications locally and deploy at scale using the combined powers of Python and Spark 2.0.

                

About This Book

Learn why and how you can efficiently use Python to process data and build machine learning models in Apache Spark 2.0
Develop and deploy efficient, scalable real-time Spark solutions
Take your understanding of using Spark with Python to the next level with this jump start guide

Who This Book Is For

If you are a Python developer who wants to learn about the Apache Spark 2.0 ecosystem, this book is for you. A firm understanding of Python is expected to get the best out of the book. Familiarity with Spark would be useful, but is not mandatory.

What You Will Learn

Learn about Apache Spark and the Spark 2.0 architecture
Build and interact with Spark DataFrames using Spark SQL
Learn how to solve graph and deep learning problems using GraphFrames and TensorFrames respectively
Read, transform, and understand data and use it to train machine learning models
Build machine learning models with MLlib and ML
Learn how to submit your applications programmatically using spark-submit
Deploy locally built                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      applications to a cluster

In Detail

Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture and how to set up a Python environment for Spark.

You will get familiar with the modules available in PySpark. You will learn how to abstract data with RDDs and DataFrames and understand the streaming capabilities of PySpark. Also, you will get a thorough overview of machine learning capabilities of PySpark using ML and MLlib, graph processing using GraphFrames, and polyglot persistence using Blaze. Finally, you will learn how to deploy your applications to the cloud using the spark-submit command.

By the end of this book, you will have established a firm understanding of the Spark Python API and how it can be used to build data-intensive applications.

Style and approach

This book takes a very comprehensive, step-by-step approach so you understand how the Spark ecosystem can be used with Python to develop efficient, scalable solutions. Every chapter is standalone and written in a very easy-to-understand manner, with a focus on both the hows and the whys of each concept.

本帖隐藏的内容

Learning PySpark..rar (19.62 MB, 需要: 10 个论坛币) 本附件包括:
  • Learning PySpark.pdf
  • Learning PySpark.epub


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝


本帖被以下文库推荐

沙发
Nicolle(真实交易用户) 学生认证  发表于 2017-3-4 13:27:09
提示: 作者被禁止或删除 内容自动屏蔽

藤椅
20115326(真实交易用户) 学生认证  发表于 2017-3-4 17:26:07
好书,学习了

板凳
lwell20(真实交易用户) 发表于 2017-3-4 18:19:43

报纸
啸傲江弧(未真实交易用户) 发表于 2017-3-5 07:14:44
Thank you for sharing!

地板
w-long(真实交易用户) 发表于 2017-3-5 08:52:29 来自手机
Learning PySpark

7
MouJack007(未真实交易用户) 发表于 2017-3-5 13:34:29
谢谢楼主分享!

8
MouJack007(未真实交易用户) 发表于 2017-3-5 13:34:45

9
白虎(真实交易用户) 发表于 2017-3-5 13:39:47 来自手机
Python Spark

10
qingxunz(真实交易用户) 发表于 2017-3-6 03:32:52
thanks

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2026-1-2 12:51