Analyzing Big Data with SQL
![](https://cloud-hox.com/wp-content/uploads/2024/01/abd.jpg)
What Will You Learn?
- Understand the basics of SELECT statements
- Understand how and why to filter results
- Explore grouping and aggregation to answer analytic questions
- Work with sorting and limiting results
Course Content
Module 1: Orientation to SQL on Big Data
-
Welcome to the Course
00:00 -
Review and Preparation
00:00 -
Using the Hue Query Editors
00:00 -
Running SQL Utility Statements
00:00 -
Running SQL SELECT Statements
00:00 -
Understanding Different SQL Interfaces
00:00 -
Overview of Beeline and Impala Shell
00:00 -
Using Beeline
00:00 -
Using Impala Shell
00:00
5 readings
-
Instructions for Downloading and Installing the Exercise Environment
00:00 -
Troubleshooting the VM
00:00 -
(Optional) What about Spark SQL?
00:00 -
Expectations for Learners
00:00 -
(Optional) Using Other SQL Engines
00:00
2 quizzes
-
Week 1 Core Quiz
00:00 -
Week 1 Honors Quiz
00:00
1 discussion prompt
-
Your Expectations
00:00
Module 2: SQL SELECT Essentials
-
Introduction
00:00 -
SQL SELECT Building Blocks
00:00 -
Introduction to the SELECT List
00:00 -
Expressions and Operators
00:00 -
Data Types
00:00 -
Column Aliases
00:00 -
Built-In Functions
00:00 -
Data Type Conversion
00:00 -
The DISTINCT Keyword
00:00 -
Introduction to the FROM Clause
00:00 -
Identifiers
00:00 -
Formatting SELECT Statements
00:00 -
Using Beeline in Non-Interactive Mode
00:00 -
Using Impala Shell in Non-Interactive Mode
00:00 -
Formatting the Output of Beeline and Impala Shell
00:00 -
Saving Hive and Impala Query Results to a File
00:00
4 readings
-
Order of Operations
00:00 -
Division and Modulo Operators
00:00 -
Common String Functions
00:00 -
Case (In)Sensitivity in SQL
00:00
2 quizzes
-
Week 2 Core Quiz
00:00 -
Week 2 Honors Quiz
00:00
1 discussion prompt
-
Data Analysis and You
00:00
Module 3: Filtering Data
-
Introduction
00:00 -
About the Datasets
00:00 -
Introduction to the WHERE Clause
00:00 -
Using Expressions in the WHERE Clause
00:00 -
Comparison Operators
00:00 -
Data Types and Precision
00:00 -
Logical Operators
00:00 -
Other Relational Operators
00:00 -
Understanding Missing Values
00:00 -
Handling Missing Values
00:00 -
Conditional Functions
00:00 -
Using Variables with Beeline and Impala Shell
00:00 -
Calling Beeline and Impala Shell from Scripts
00:00 -
Querying Hive and Impala in Scripts and Applications
00:00
6 readings
-
Data Reference
00:00 -
(Optional) Unicode Characters
00:00 -
Working with Literal Strings
00:00 -
Missing Values with Logical Operators
00:00 -
Missing Values in String Columns
00:00 -
(Optional Exercise) Change VM Desktop Color
00:00
2 quizzes
-
Week 3 Core Quiz
00:00 -
Week 3 Honors Quiz
00:00
1 discussion prompt
-
Filtering Examples
00:00
Module 4: Grouping and Aggregation Data
-
Introduction
00:00 -
Introduction to Aggregation
00:00 -
Common Aggregate Functions
00:00 -
Using Aggregate Functions in the SELECT Statement
00:00 -
Introduction to the GROUP BY Clause
00:00 -
Choosing an Aggregate Function and Grouping Column
00:00 -
Grouping Expressions
00:00 -
Grouping and Aggregation, Together and Separately
00:00 -
NULL Values in Grouping and Aggregation
00:00 -
The COUNT Function
00:00 -
Tips for Applying Grouping and Aggregation
00:00 -
Filtering on Aggregates
00:00 -
The HAVING Clause
00:00 -
Understanding Hive and Impala Version Differences
00:00 -
Understanding Hue Version Differences
00:00
6 readings
-
COUNT(*) and SUM(1)
00:00 -
Interpreting Aggregates: Populations and Samples
00:00 -
The least and greatest Functions
00:00 -
Why Aggregate Expressions Ignore NULL Values
00:00 -
(Optional) Shortcuts for Grouping
00:00 -
How Grouping and Aggregation Can Mislead
00:00
2 quizzes
-
Week 4 Core Quiz
00:00 -
Week 4 Honors Quiz
00:00
2 discussion prompts
-
Grouping and Aggregation Examples
00:00 -
The Analytic Journey
00:00
Module 5: Sorting and Limiting Data
-
Introduction
00:00 -
Introduction to the ORDER BY Clause
00:00 -
Controlling Sort Order
00:00 -
Ordering Expressions
00:00 -
Missing Values in Ordered Results
00:00 -
Using ORDER BY with Hive and Impala
00:00 -
Introduction to the LIMIT Clause
00:00 -
When to Use the LIMIT Clause
00:00 -
Using LIMIT with ORDER BY
00:00 -
Using LIMIT for Pagination
00:00 -
Review
00:00 -
How to Effectively Use the Hive and Impala Documentation
00:00 -
Tips for Using the Hive Documentation
00:00 -
Tips for Using the Impala Documentation
00:00
1 reading
-
Ordering by String Columns
00:00
2 quizzes
-
Week 5 Core Quiz
00:00 -
Week 5 Honors Quiz
00:00
1 discussion prompt
-
Questions?
00:00
Module 6: Combining Data
-
Introduction
00:00 -
Combining Query Results with the UNION Operator
00:00 -
Using ORDER BY and LIMIT with UNION
00:00 -
Introduction to Joins
00:00 -
Join Syntax
00:00 -
Inner Joins
00:00 -
Outer Joins
00:00 -
Conclusion
00:00 -
Handling NULL Values in Join Key Columns
00:00 -
Non-Equijoins
00:00 -
Cross Joins
00:00 -
Left Semi-Joins
00:00
5 readings
-
Missing or Truncated Values from Type Conversion
00:00 -
Using UNION to Combine Three or More Results
00:00 -
Alternative Join Syntax
00:00 -
Joining Three or More Tables
00:00 -
Specifying Two or More Join Conditions
00:00
2 quizzes
-
Week 6 Core Quiz
00:00 -
Week 6 Honors Quiz
00:00
1 peer review
-
Analyzing Big Data with SQL
00:00
1 discussion prompt
-
Get Excited!
00:00
Student Ratings & Reviews
No Review Yet