楼主: jasonwu24
1532 2

[书籍介绍] 【2017新书】Taming Big Data with Apache Spark and Python – Hands On! [推广有奖]

  • 5关注
  • 43粉丝

已卖:14705份资源

讲师

98%

还不是VIP/贵宾

-

威望
0
论坛币
58893 个
通用积分
251.8733
学术水平
119 点
热心指数
114 点
信用等级
85 点
经验
22677 点
帖子
344
精华
1
在线时间
505 小时
注册时间
2015-2-15
最后登录
2022-11-18

楼主
jasonwu24 在职认证  发表于 2017-7-11 16:19:57 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
  • Title: Taming Big Data with Apache Spark and Python – Hands On!
  • Author: Frank Kane
  • Length: 81 pages
  • Edition: 1
  • Language: English
  • Publisher: Packt Publishing
  • Publication Date: 2017-07-06
  • ISBN-10: 1787287947
  • ISBN-13: 9781787287945



Packt.Taming.Big.Data.with.Apache.Spark.and.Python.1787287947.pdf (5.97 MB, 需要: 5 个论坛币)
Packt.Taming.Big.Data.with.Apache.Spark.and.Python.1787287947_Code.zip (950.09 KB)


Key Features
  • Understand how Spark can be distributed across computing clusters
  • Develop and run Spark jobs efficiently using Python
  • A hands-on tutorial with over 15 real-world examples teaching you Big Data processing with Spark

Book Description
Apache Spark has emerged as the next big thing in the Big Data domain - quickly rising from an ascending technology to an established superstar in just a matter of years. Spark allows you to quickly extract actionable insights from large amounts of data, on a real-time basis. This book is your companion to learn Apache Spark in a hands-on manner. Start with understanding how to set up Spark on a single system or on a cluster. From analyzing large data sets using Spark RDD to developing and running effective Spark jobs quickly using Python, this course will teach you everything. Packed with over 15 interactive, fun-filled examples relevant to the real-world, the course will empower you to understand the Spark ecosystem and implement production-grade real-time Spark projects with ease.


What you will learn
  • Learn how you can identify the Big Data problems as Spark problems
  • Install and run Apache Spark on your computer or on a cluster
  • Analyze large data sets across many CPUs using Spark's Resilient Distributed Datasets
  • Implement machine learning on Spark using the MLlib library
  • Process continuos streams of data in real time using the Spark streaming module
  • Perform complex network analysis using Spark's GraphX library
  • Use Amazon's Elastic MapReduce service to run your Spark jobs on a cluster

Table of Contents
Chapter 1. Getting Started with Spark
Chapter 2. Spark Basics and Spark Examples
Chapter 3. Advanced Examples of Spark Programs
Chapter 4. Running Spark on a Cluster
Chapter 5. SparkSQL, DataFrames, and DataSets
Chapter 6. Other Spark Technologies and Libraries
Chapter 7. Where to Go From Here? – Learning More About Spark and Data Science

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Apache Spark Big data taming apache python

沙发
西门高(未真实交易用户) 发表于 2017-7-11 16:25:28
z谢谢分享

藤椅
sukiyou2000(未真实交易用户) 发表于 2017-7-12 08:27:46
多谢分享!

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-29 05:28