Author:Andreas François Vermeulen, Ankur Gupta, David Kjerrumgaard, Scott Shaw
Isbn:1484202724
Year:2016
Pages:265
Language:English
File size:9.3 MB
File format:PDF
Category:Programming
Book Description:
Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardware or virtual machine and setting up its initial configuration to learning how Hive interacts with Hadoop, MapReduce, Tez and other big data technologies,Practical Hive gives you a detailed treatment of the software.
In addition, this book discusses the value of open source software, Hive performance tuning, and how to leverage semi-structured and unstructured data.
What You Will Learn
Install and configure Hive for new and existing datasetsPerform DDL operationsExecute efficient DML operationsUse tables, partitions, buckets, and user-defined functionsDiscover performance tuning tips and Hive best practices