Apache Spark With Examples for Big Data Analytics

Features Includes:
  • Self-paced with Life Time Access
  • Certificate on Completion
  • Access on Android and iOS App

Course Preview Video

  • Categories

    All Development

  • Duration


  • 1 Students Enrolled


This course covers all the fundamentals you need to write complex Spark applications. By the end of this course you will get in-depth knowledge on Spark core,Spark SQL,Spark Streaming.

This course is divided into 9 modules

  • Dive Into Scala - Understand the basics of Scala that are required for programming Spark applications.Learn about the basic constructs of Scala such as variable types, control structures, collections,and more.
  • OOPS and Functional Programming in Scala - Learn about object oriented programming and functional programming techniques in Scala
  • Introduction to Apache Spark - Learn Spark Architecture,Spark Components and spark use-cases
  • Spark Basics - Learn how to configure/run spark in eclipse/intellij
  • Working with RDDs in Spark - Learn what is Resilient Distributed Dataset,Different types of actions and transformations which can be applied on RDDs
  • Aggregating Data with Pair RDDs - Learn how Pair RDD is different from RDD,Different types of actions and transformations which can be applied on Pair RDDs
  • Advanced Spark Concepts - Learn how Spark uses Broadcast variables and Accumulators to perform calculations,how persistence and partitioning helps to achieve performance
  • Spark SQL and Data Frames - Understand the difference between Dataframe and Dataset
  • Spark Streaming - Learn how to analyse massive amount of dataset on the fly

All the concepts are explained using hands-on examples.This course covers 10+ hands-on big data examples such as

  • Explore player data from 2014 world cup
  • Aggregate data from ebay online auction data
  • Understand different data points from Adhaar data
  • Develop application to analyse funds received by Indian startup
  • Explore the price trend by looking at the real estate data in California
  • Help retailer to find out valid and invalid purchase transactions of chain of stores in Bangalore
  • Write Spark program find out count of stores in each US region from USA states & Store locations data
  • Develop Spark Streaming application to perform Twitter Sentiment Analysis

Basic knowledge
  • Basic programming skills
  • A computer running Windows, OSX or Linux
  • The software needed for this course is freely available and detailed steps to install and configure software is include in the course

What will you learn
  • Get clear understanding of the limitations of MapReduce and role of Spark in overcoming these limitations
  • Understand fundamentals of Scala Programming Language and it’s features
  • Expertise in using RDD for creating applications in Spark
  • Mastering SQL queries using SparkSQL
  • Gain thorough understanding of Spark Streaming features
Course Curriculum
Number of Lectures: 41 Total Duration: 03:49:27

No Review Yet