请选择 进入手机版 | 继续访问电脑版
楼主: igs816
5927 31

Learning PySpark by Tomasz Drabas [推广有奖]

泰斗

5%

还不是VIP/贵宾

-

威望
9
论坛币
2694289 个
通用积分
18515.3563
学术水平
2743 点
热心指数
3466 点
信用等级
2559 点
经验
484572 点
帖子
5413
精华
52
在线时间
3585 小时
注册时间
2007-8-6
最后登录
2024-4-16

高级学术勋章 特级学术勋章 高级信用勋章 特级信用勋章 高级热心勋章 特级热心勋章

igs816 在职认证  发表于 2017-3-4 11:46:39 |显示全部楼层 |坛友微信交流群
相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
GBbMa2GtgvcsD8I9S2YR0fACTwFrbl5S.jpg


Learning PySpark by Tomasz Drabas
English | 27 Feb. 2017 | ISBN: 1786463709 | 274 Pages | EPUB/PDF (conv) | 21.67 MB
Build data-intensive applications locally and deploy at scale using the combined powers of Python and Spark 2.0.

                

About This Book

Learn why and how you can efficiently use Python to process data and build machine learning models in Apache Spark 2.0
Develop and deploy efficient, scalable real-time Spark solutions
Take your understanding of using Spark with Python to the next level with this jump start guide

Who This Book Is For

If you are a Python developer who wants to learn about the Apache Spark 2.0 ecosystem, this book is for you. A firm understanding of Python is expected to get the best out of the book. Familiarity with Spark would be useful, but is not mandatory.

What You Will Learn

Learn about Apache Spark and the Spark 2.0 architecture
Build and interact with Spark DataFrames using Spark SQL
Learn how to solve graph and deep learning problems using GraphFrames and TensorFrames respectively
Read, transform, and understand data and use it to train machine learning models
Build machine learning models with MLlib and ML
Learn how to submit your applications programmatically using spark-submit
Deploy locally built                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      applications to a cluster

In Detail

Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture and how to set up a Python environment for Spark.

You will get familiar with the modules available in PySpark. You will learn how to abstract data with RDDs and DataFrames and understand the streaming capabilities of PySpark. Also, you will get a thorough overview of machine learning capabilities of PySpark using ML and MLlib, graph processing using GraphFrames, and polyglot persistence using Blaze. Finally, you will learn how to deploy your applications to the cloud using the spark-submit command.

By the end of this book, you will have established a firm understanding of the Spark Python API and how it can be used to build data-intensive applications.

Style and approach

This book takes a very comprehensive, step-by-step approach so you understand how the Spark ecosystem can be used with Python to develop efficient, scalable solutions. Every chapter is standalone and written in a very easy-to-understand manner, with a focus on both the hows and the whys of each concept.

本帖隐藏的内容

Learning PySpark..rar (19.62 MB, 需要: 10 个论坛币) 本附件包括:
  • Learning PySpark.pdf
  • Learning PySpark.epub


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝


本帖被以下文库推荐

Nicolle 学生认证  发表于 2017-3-4 13:27:09 |显示全部楼层 |坛友微信交流群
提示: 作者被禁止或删除 内容自动屏蔽

使用道具

20115326 学生认证  发表于 2017-3-4 17:26:07 |显示全部楼层 |坛友微信交流群
好书,学习了

使用道具

lwell20 发表于 2017-3-4 18:19:43 |显示全部楼层 |坛友微信交流群

使用道具

Thank you for sharing!

使用道具

w-long 发表于 2017-3-5 08:52:29 来自手机 |显示全部楼层 |坛友微信交流群
Learning PySpark

使用道具

MouJack007 发表于 2017-3-5 13:34:29 |显示全部楼层 |坛友微信交流群
谢谢楼主分享!

使用道具

MouJack007 发表于 2017-3-5 13:34:45 |显示全部楼层 |坛友微信交流群

使用道具

白虎 发表于 2017-3-5 13:39:47 来自手机 |显示全部楼层 |坛友微信交流群
Python Spark

使用道具

qingxunz 发表于 2017-3-6 03:32:52 |显示全部楼层 |坛友微信交流群
thanks

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-16 22:11