Book Description
If you've successfully used Apache Spark to solve medium sized-problems, but still struggle to realize the "Spark promise" of unparalleled performance on big data, this book is for you. High Performance Spark shows you how take advantage of Spark at scale, so you can grow beyond the novice-level. It's ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications.
Learn how to make Spark jobs run faster; Productionize exploratory data science with Spark; Handle even larger data sets with Spark; Reduce pipeline running times for faster insights.
Book Details
Publisher: O'Reilly Media
By: Holden Karau, Rachel Warren
ISBN: 978-1-49194-320-5
Year: 2016
Pages: 175
Language: English
File size: 5.8 MB
File format: PDF