Fundamentals of Hadoop

  1. Big Data
    • What is Big Data
    • History
    • Real life use cases
  2. Distributed Systems
    • What are Distributed Systems
    • Real life use cases
    • Transparencies in Distributed Systems
  3. Hadoop
    • Evolution of Hadoop
    • Motivation and Need for Hadoop
  4. HDFS
    • Motivation for Hadoop
    • Architecture
    • Interacting with HDFS
  5. MapReduce
    • MapReduce concepts with examples
    • Code walkthrough - WordCount example
    • Compile & Run your first MR application
    • Modify WC example to OddEven
    • Write MR application for ‘Sum of square of OddEvenPrime’
    • MR Dataflow - Shuffle & Sort
    • MR example with multiple values per output key
    • Quiz with MapReduce examples - Aadhar usecase
  6. YARN
    • Hadoop1.x vs. Hadoop2.x
    • YARN architecture, MR1 vs. YARN
    • Failure Handling - NN, RM, NM, Container, Speculative execution
    • Anatomy of MR Job
  7. Misc
    • Setting up Hadoop Cluster
    • Running on Big Data
    • Advanced MR Application hands-on

Back to Home Page