Fundamentals of Hadoop
- Big Data
- What is Big Data
- History
- Real life use cases
- Distributed Systems
- What are Distributed Systems
- Real life use cases
- Transparencies in Distributed Systems
- Hadoop
- Evolution of Hadoop
- Motivation and Need for Hadoop
- HDFS
- Motivation for Hadoop
- Architecture
- Interacting with HDFS
- MapReduce
- MapReduce concepts with examples
- Code walkthrough - WordCount example
- Compile & Run your first MR application
- Modify WC example to OddEven
- Write MR application for ‘Sum of square of OddEvenPrime’
- MR Dataflow - Shuffle & Sort
- MR example with multiple values per output key
- Quiz with MapReduce examples - Aadhar usecase
- YARN
- Hadoop1.x vs. Hadoop2.x
- YARN architecture, MR1 vs. YARN
- Failure Handling - NN, RM, NM, Container, Speculative execution
- Anatomy of MR Job
- Misc
- Setting up Hadoop Cluster
- Running on Big Data
- Advanced MR Application hands-on
Back to Home Page