Introduction to Big Data with Spark and Hadoop

Wishlist Share
Share Course
Page Link
Share On Social Media

What Will You Learn?

  • Explain the impact of big data, including use cases, tools, and processing methods.
  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.
  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.
  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Course Content

Module 1: What is Big Data?

  • Course Introduction
    00:00
  • What is Big Data?
    00:00
  • Impact of Big Data
    00:00
  • Parallel Processing, Scaling, and Data Parallelism
    00:00
  • Big Data Tools and Ecosystem
    00:00
  • Open Source and Big Data
    00:00
  • Beyond the Hype
    00:00
  • Big Data Use Cases
    00:00

1 reading

2 assignments

1 plugin

Module 2: Introduction to the Handoop Ecosystem

1 reading

2 assignments

3 app items

2 plugins

Module 3: Apache Spark

1 reading

2 assignments

1 app item

2 plugins

Module 4: DataFrames and Spark SQL

1 reading

2 assignments

2 app items

4 plugins

Module 5: Development and Runtime Environment Options

2 readings

3 assignments

2 app items

4 plugins

Module 6: Monitoring and Tuning

1 reading

2 assignments

1 app item

3 plugins

Module 7: Final Project and Assessment

Student Ratings & Reviews

No Review Yet
No Review Yet
error: Content is protected !!
Select your currency
AED United Arab Emirates dirham