SQL on Hadoop – Analyzing Big Data with Hive
百度网盘下载地址:
http://pan.baidu.com/s/1pJ80qd9
密码:7li1
This course will teach you the Hive query language and how to apply it to solve common Big Data problems. This includes an introduction to distributed computing, Hadoop, and MapReduce fundamentals and the latest features released with Hive 0.11. From developer to analyst, this course tackles a few big questions about big data: Why does this technology exist and why do I need it? How can I get the best out of it utilizing something familiar like SQL and how does this all fit together in an ever-evolving eco-system? This course will introduce the concepts of distributed computing, Hadoop and MapReduce and then goes into great detail into Apache Hive which is an SQL-like query language that can be used with Hadoop and NoSQL databases like HBase and Cassandra. The course presents some challenges you might experience solving real production problems and how Hive makes that task easier to accomplish.
│ sql-hadoop-analyzing-big-data-hive.zip
│
├───01. Introduction to Hadoop
│ 01. Introduction.wmv
│ 02. Motivation for Hadoop.wmv
│ 03. Distributed Computing Challenges.wmv
│ 04. Hadoop File System (HDFS).wmv
│ 05. MapReduce.wmv
│ 06. Word Count Example.wmv
│ 07. Demo Basic Hadoop Commands and Environment Setup.wmv
│ 08. Summary.wmv
│
├───02. Introduction to Hive
│ 01. Introduction.wmv
│ 02. O£Hive Motivation.wmv
│ 03. Hive Architecture.wmv
│ 04. Hive Principles – Schema on Read.wmv
│ 05. Hive Principles – The Hive Warehouse.wmv
│ 06. Hive Query Language Basics – SELECT and Sub Queries.wmv
│ 07. Creating Databases and Tables with HiveQL.wmv
│ 08. Demo Working with Hive Tables and Loading Data into Warehouse.wmv
│ 09. Loading Data – Hive Managed and External Tables.wmv
│ 10. Demo External Tables and Create Table Alternatives.wmv
│ 11. Summary.wmv
│
├───03. Hive Query Language
│ 01. Introduction.wmv
│ 02. Data Types.wmv
│ 03. Type Conversions.wmv
│ 04. Managed Partitioned Tables.wmv
│ 05. External Partitioned Tables.wmv
│ 06. Demo Table Partitioning.wmv
│ 07. Multi Inserts and Dynamic Partition Inserts.wmv
│ 08. Demo Loading Data Use Case.wmv
│ 09. Data Retrieval – Group By and Functions.wmv
│ 10. Sorting and Controlling Data Flow.wmv
│ 11. The CLI and Variable Substitution.wmv
│ 12. Summary.wmv
│
├───04. Advanced HiveQL
│ 01. Introduction.wmv
│ 02. Bucketing.wmv
│ 03. Bucket and Block Sampling.wmv
│ 04. Joins.wmv
│ 05. Joins in Depth and Join Optimizations.wmv
│ 06. Map-side Joins for Bucketed Tables.wmv
│ 07. Distributed Cache.wmv
│ 08. UDTFs, Explode and Lateral View.wmv
│ 09. Demo Extending Hive – Creating Your own UDF.wmv
│ 10. Demo Extending Hive – Compiling and Testing Custom UDF.wmv
│ 11. Extending Hive – Custom UDF Recap.wmv
│ 12. Demo Hive Initialization File.wmv
│ 13. Accessing The Distributed Cache.wmv
│ 14. Hadoop Streaming and Transform().wmv
│ 15. Windowing and Analytics Functions.wmv
│ 16. Demo Putting it All Together Using Transform.wmv
│ 17. Demo Analytics Functions.wmv
│ 18. Demo Ranking Functions.wmv
│ 19. Summary.wmv
│
└───05. Storage and The Eco-System
01. Create Table Statement – File Formats and SerDes.wmv
02. HCatalog.wmv
03. Sqoop.wmv
04. DistCP.wmv
05. Hadoop Eco-System Projects.wmv
06. References and Resources.wmv
07. Summary.wmv