Fast Data Processing with Spark - Second Edition covers how to write distributed programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API to developing analytics applications and tuning them for your purposes.
Table of Contents
Chapter 1: Installing Spark and Setting up your Cluster
Chapter 2: Using the Spark Shell
Chapter 3: Building and Running a Spark Application
Chapter 4: Creating a SparkContext
Chapter 5: Loading and Saving Data in Spark
Chapter 6: Manipulating your RDD
Chapter 7: Spark SQL
Chapter 8: Spark with Big Data
Chapter 9: Machine Learning Using Spark MLlib
Chapter 10: Testing
Chapter 11: Tips and Tricks
- Fast_Data_Processing_with_Spark_2nd_Edition.pdf