BIG DATA HADOOP | Demo Vedio| Outline|Duration: 45 Hours |Class Room & Online Training

Big Data & Hadoop Course Outline

Big Data & Hadoop

  • What is Big Data
  • Sources of Big Data
  • IBM Definition for Big Data
  • Definition of Hadoop
  • History of Hadoop
  • Features of Hadoop
  • Hadoop Eco-System
  • Other Hadoop related products of Apache
  • Hadoop Distributed File System

  • Distributed File System
  • Definition of HDFS
  • Where not to use HDFS
  • HDFS Concepts
  • Hadoop Architecture
  • NameNode, DataNode & SNN
  • HDFS Federation
  • HDFS High Availability
  • Hadoop IO Operations(Read & Write)
  • HDFS Rack Awareness
  • Hadoop Modes
  • Hadoop Configuration
  • Linux & Hadoop Commands
  • HDFS Interview Questions and Real time use cases
  • MapReduce

  • What is MapReduce & Key Value Concepts
  • Traditional Solution
  • MapReduce Solution
  • Input & Output of M/R
  • MapReduce Phases
  • Word Count Flowchart
  • Advantages of MapReduce
  • Input Split in M/R
  • Box Classes in Hadoop
  • Execution of Word Count Program
  • Combiner
  • Partitioner
  • MapReduce Joins
  • Distributed Cache
  • Counters
  • MapReduce Formats(Input & Output)
  • Secondary Sort in Map Reduce
  • MapReduce Interview Questions and Real time use cases
  • YARN

  • Challenges in Hadoop 1.x
  • Hadoop 2.x Features
  • Apache YARN
  • Hadoop 2.x Eco-system
  • Hadoop 2.x High Availability
  • Anatomy of YARN Application Run
  • Run a MapReduce application on YARN
  • Hive

  • Applications of Hive
  • Advantages & Disadvantages of Hive
  • Hive Metastore
  • Hive Architecture
  • Hive Concepts
  • Hive Data Types
  • Demonstration of DataBase Commands
  • Hive Tables
  • Demonstration of Create, rename, alter & Drop
  • Partitions in Hive
  • Bucketing in Hive
  • Hive Joins
  • Complex Data Types
  • Demonstration of External Table
  • SubQueries
  • Views
  • User Defined Functions (UDFs)
  • PIG

  • Need for PIG
  • PIG versus MapReduce
  • Where to use PIG
  • Where NOT to use PIG
  • What is PIG
  • Applications of PIG
  • PIG Installation
  • Execution Types
  • Running PIG programs
  • PIG data types
  • RDBMS Vs Pig
  • Comments in Pig
  • Case Sensitivity in Pig
  • Logical and Physical Plan
  • Pig Operators
  • Pig Built in Functions
  • Diagnostic Operators in PIG
  • Special Joins in PIG:
  • PIG UDFs
  • Pig Best Practices
  • Pig Interview Questions and Real time use cases
  • Sqoop

  • Introduction and Installation
  • Sqoop Tools
  • Sqoop Connectors
  • Creating a DB and table in MySQL
  • Loading the MySQL DB
  • Sqoop Import Process
  • Import Hive Data
  • Sqoop Export Process
  • Sqoop Compressions
  • Sqoop interview questions and Real time use cases
  • Scala

  • Why Scala
  • Functional programming & First Scala Program
  • Data Types
  • Variable
  • Conditional expressions
  • Scala Pattern Matching Example
  • Classes
  • Objects
  • Scala Constructors
  • Access Modifies
  • Scala Method Overloading and overriding Functions
  • Scala this and Final keywords
  • Scala Inheritance
  • Scala abstract and traits
  • Scala Collections
  • Scala Exception Handling
  • Scala File handling
  • Scala Multithreading
  • Scala programs
  • Scala Interview questions
  • Spark

  • MapRedue Vs Spark
  • Apache Spark - By Definition
  • Features of Spark
  • Spark Deployment
  • Spark Core & Components
  • Spark Context & Invoking
  • Resilient Distributed Datasets (RDDs)
  • RDD Operations
  • RDD Persistence
  • Lazy Evaluation & Lineage Graph
  • Spark SQL
  • SchemaRDD, DataFrame & Datasets
  • Linking with SparkSQL
  • Initializing Spark SQL
  • Creating a DataFrame
  • Transformations, Actions, Laziness
  • Spark Streaming
  • Kafka with spark Streaming
  • Flume with spark Streaming
  • Spark Interview questions and real time use cases
  • REAL TIME PROJECT DISCUSSION

    Courses We Offer


    Success!