About Course

Course Information

Data is  an integral part of every organization, because it  is small or large; and maintaining it in a proper form has become  more difficult.

Hadoop framework acts as a helping hand in this process; and the professionals who know Hadoop well, are preferred for this task.

Hadoop Administration is one of the specialization of Hadoop framework.

The Hadoop Administration online training by Online IT Guru imparts the knowledge of Hadoop concepts by starting with the basics of Apache Hadoop and Hadoop Cluster.

The course progresses to cover  the deeper knowledge of Hadoop Architecture, Hadoop Installation, Hadoop Security, and Hadoop Culture. Students also acquire understanding of Hbase administration.

Along with this, the instructors at Online IT Guru aid the candidates to get a deep knowledge of HDFS, as well as:

  1. Plan and Deploy a Hadoop Cluster
  1. Manage, Maintain, Monitor and Troubleshoot a Hadoop Cluster
  1. Understand about Oozie, Hcatalog/Hive

Course Content

Please Download for Detailed Course Content

Core topics of HADOOP ADMIN Online Course

Hadoop 1.x.x

Ø Introduction to Hadoop

Ø Parallel Computing vs Distributed Computing

Ø How to install Hadoop on your system

Ø How to install Hadoop cluster on multiple machines

Ø Hadoop daemons introduction: NameNode, DataNode, JobTracker, TaskTracker

Ø Exploring HDFS (Hadoop Distributed File System)

Ø Exploring Apache HDFS web UI

Ø Namenode architecture (FS Image, Replica placement)

Ø Secondary Namenode architecture

Ø Datanode architecture


YARN ( Hadoop 2.x.x )

Ø Introduction to YARN ( Hadoop 2.x.x )

Ø Hadoop 1 Vs Hadoop 2

Ø Hadoop 2 installation

Ø Copy data from local file system to HDFS

Ø Execute Hadoop job on YARN

Ø Exploring HDFS/YARN/Job history UI

Ø Hands-On Exercise


Hadoop Administrative Tasks

Ø Routine Administrative Procedures

Ø Understanding dfsadmin and mradmin

Ø Block Scanner, HDFS Balancer

Ø Health Check & Safe mode

Ø Monitoring and Debugging on Hadoop cluster

Ø Namenode backup and recovery

Ø Datanode commissioning/decommissioning

Ø ACL (Access Control List)

Ø Upgrading Hadoop


MapReduce Architecture

Ø Exploring JobTracker/TaskTracker

Ø How to run a Map-Reduce job

Ø Exploring Mapper/Reducer/Combiner

Ø Shuffle: Sort & Partition

Ø Input/output formats

Ø Exploring Apache MapReduce web UI


Hadoop Developer Tasks

Ø Hadoop Eclipse integration

Ø Reading and writing data using Java

Ø How to write a Map-Reduce Job

Ø Mapper/Reducer in details

Ø Searching in HDFS

Ø Sorting in HDFS



Ø Introduction to HBase

Ø Installation of HBase on your system

Ø Exploring HBase Master & Regionservers

Ø Exploring Zookeeper

Ø Column Families and Qualifiers

Ø Basic HBase shell commands.

Ø Hands-On Exercise



Ø Introduction to Hive

Ø HBase vs Hive

Ø Installation of Hive on your system

Ø HQL (Hive query language )

Ø Basic Hive commands

Ø Hands-On Exercise



Ø Introduction to Pig

Ø Installation of Pig on your system

Ø Basic Pig commands

Ø Hands-On Exercise



Ø Introduction to Sqoop

Ø Installation of Sqoop on your system

Ø Import/Export data from RDBMS to HDFS

Ø Import/Export data from RDBMS to HBase

Ø Import/Export data from RDBMS to Hive

Ø Hands-On Exercise


Mini Project / POC (Proof of Concept)

Ø Facebook-Hive POC

Ø Usages of Hadoop/Hive @ Facebook

Ø Static & Dynamic partitioning

Ø UDF ( User defined functions )

Ø Project usecases

Ø Hands-On Exercise

Demo Video

Watch HADOOP ADMIN Demo Video

About Trainer

Trainer Information