Introduction to Big Data
Categories: Big Data Specialization, Data Programs
What Will You Learn?
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
Course Content
Module 1: Welcome
-
Welcome to the Big Data Specialization
00:00 -
Tell us about yourself and learn about your classmates
00:00
2 readings
-
By the end of this course you will be able to…
00:00 -
Optional: Watch this fun video about the San Diego Supercomputer Center!
00:00
1 discussion prompt
-
Let’s Discuss: Why are you taking this class?
00:00
Module 2: Big Data Why and Where?
-
What launched the Big Data era?
00:00 -
Applications: What makes big data valuable
00:00 -
Example: Saving lives with Big Data
00:00 -
Example: Using Big Data to Help Patients
00:00 -
A Sentiment Analysis Success Story: Meltwater helping Danone
00:00 -
Getting Started: Where Does Big Data Come From?
00:00 -
Machine-Generated Data: It’s Everywhere and There’s a Lot!
00:00 -
Machine-Generated Data: Advantages
00:00 -
Big Data Generated By People: The Unstructured Challenge
00:00 -
Big Data Generated By People: How Is It Being Used?
00:00 -
Organization-Generated Data: Structured but often siloed
00:00 -
Organization-Generated Data: Benefits Come From Combining With Other Data Types
00:00 -
The Key: Integrating Diverse Data
00:00
13 readings
-
Did you know?: 25 facts about big data
00:00 -
Slides: What Launched the Big Data Era?
00:00 -
Slides: Applications: What Makes Big Data Valuable?
00:00 -
Slides: Saving Lives With Big Data
00:00 -
Slides: Using Big Data to Help Patients
00:00 -
Extra Resources
00:00 -
Slides: Machine-Generated Data: It’s Everywhere and There’s a Lot!
00:00 -
Slides: Machine-Generated Data: Advantages
00:00 -
Slides: Big Data Generated By People: The Unstructured Challenge
00:00 -
Slides: Big Data Generated By People: How is it Being Used?
00:00 -
Slides: Organization-Generated Big Data: Structured But Often Siloed
00:00 -
Slides: Organizaton-Generated Big Data: Benefits
00:00 -
Slides: The Key – Integrating Diverse Data
00:00
1 quiz
-
Why Big Data and Where Did it Come From?
00:00
2 discussion prompts
-
Let’s Discuss: What application area interests you?
00:00 -
Let’s discuss: Who are you providing data to?
00:00
Module 3: Characteristics of Big Data and Dimensions of Scalability
-
Getting Started: Characteristics Of Big Data
00:00 -
Characteristics of Big Data – Volume
00:00 -
Characteristics of Big Data – Variety
00:00 -
Characteristics of Big Data – Velocity
00:00 -
Characteristics of Big Data – Veracity
00:00 -
Characteristics of Big Data – Valence
00:00 -
The Sixth V: Value
00:00
9 readings
-
What does astronomical scale mean?
00:00 -
A Small Definition of Big Data
00:00 -
Slides: Getting Started – Characteristics of Big Data
00:00 -
Slides: Characteristics of Big Data – Volume
00:00 -
Slides: Characteristics of Big Data – Variety
00:00 -
Slides: Characteristics of Big Data – Velocity
00:00 -
Slides: Characteristics of Big Data – Veracity
00:00 -
Slides: Characteristics of Big Data – Value
00:00 -
Slides: Characteristics of Big Data – Valence
00:00
1 quiz
-
V for the V’s of Big Data
00:00
2 discussion prompts
-
Practice: Writing Big Data questions
00:00 -
Let’s Discuss: Improving the Flamingo Game
00:00
Module 4: Data Science : Getting value out of Big Data
-
Data Science: Getting Value out of Big Data
00:00 -
Building a Big Data Strategy
00:00 -
How does big data science happen?: Five Components of Data Science
00:00 -
Asking the Right Questions
00:00 -
Steps in the Data Science Process
00:00 -
Step 1: Acquiring Data
00:00 -
Step 2-A: Exploring Data
00:00 -
Step 2-B: Pre-Processing Data
00:00 -
Step 3: Analyzing Data
00:00 -
Step 4: Communicating Results
00:00 -
Step 5: Turning Insights into Action
00:00
12 readings
-
Five P’s of Data Science
00:00 -
Slides: Getting Value Out of Big Data
00:00 -
Slides: Building a Big Data Strategy
00:00 -
Slides: The Five P’s of Data Science
00:00 -
Slides: Asking the Right Questions
00:00 -
Slides: Steps in the Data Science Process
00:00 -
Slides: Step 1 – Acquiring Data
00:00 -
Slides: Step 2A-Exploring Data
00:00 -
Slides: Step 2B-Preprocessing Data
00:00 -
Slides: Step 3-Data Analysis
00:00 -
Slides: Step 4-Communicating Results
00:00 -
Slides: Step 5-Turning Insights Into Action
00:00
1 quiz
-
Data Science 101
00:00
2 discussion prompts
-
Let’s Discuss: Thinking more deeply about the Ps
00:00 -
Let’s Discuss: Building a Team
00:00
Module 5: Foundations for Big Data Systems and Programming
-
Getting Started: Why worry about foundations?
00:00 -
What is a Distributed File System?
00:00 -
Scalable Computing over the Internet
00:00 -
Programming Models for Big Data
00:00
4 readings
-
Slides: Getting Started-Why Worry About Foundations?
00:00 -
Slides: What is a Distributed File System?
00:00 -
Slides: Scalable Computing Over the Internet
00:00 -
Slides: Programming Models for Big Data
00:00
1 quiz
-
Foundations for Big Data
00:00
Module 6: Systems: Getting Started with Hadoop
-
Hadoop: Why, Where and Who?
00:00 -
The Hadoop Ecosystem: Welcome to the zoo!
00:00 -
The Hadoop Distributed File System: A Storage System for Big Data
00:00 -
YARN: A Resource Manager for Hadoop
00:00 -
MapReduce: Simple Programming for Big Results
00:00 -
When to Reconsider Hadoop?
00:00 -
Cloud Computing: An Important Big Data Enabler
00:00 -
Cloud Service Models: An Exploration of Choices
00:00 -
Value From Hadoop and Pre-built Hadoop Images
00:00 -
Copy your data into the Hadoop Distributed File System (HDFS)
00:00 -
Run the WordCount program
00:00
8 readings
-
MapReduce in the Pasta Sauce Example
00:00 -
Slides for Getting Started With Hadoop
00:00 -
Downloading and Installing the Cloudera VM Instructions (Mac)
00:00 -
Downloading and Installing the Cloudera VM Instructions (Windows)
00:00 -
FAQ
00:00 -
Copy your data into the Hadoop Distributed File System (HDFS) Instructions
00:00 -
Run the WordCount program Instructions
00:00 -
How do I figure out how to run Hadoop MapReduce programs?
00:00
2 quizzes
-
Intro to Hadoop
00:00 -
Running Hadoop MapReduce Programs Quiz
00:00
1 peer review
-
Understand by Doing: MapReduce
00:00
1 discussion prompt
-
Let’s Discuss: Map Reduce in your life
00:00
Student Ratings & Reviews
No Review Yet