RM3,800.00 RM3,610.00

Date: Aug 13, 2018 – Aug 17, 2018
Duration: 5 Days, 9:00 am – 5:00 pm
Place: i-Riang Academy Petaling Jaya


Categories: ,


This course provides participants an expertise in all the steps necessary to operate and maintain a Hadoop cluster, i.e. from Planning, Installation and Configuration through load balancing, Security and Tuning providing hands-on preparation for the real-world challenges faced by Hadoop Administrators.


Our intake is on Oct 22, 2018 – Oct 26, 2018.

Get Brochure here.

Topics Covered

Day 1 – Introduction To Bigdata  And Hadoop

In this module, participants will understand what is big data and Apache Hadoop. Participants will also learn how Hadoop solves the big data problems, about Hadoop cluster architecture, its core components & ecosystem, Hadoop data loading & reading mechanism and role of a Hadoop cluster administrator

  • Big data basics,
  • Limitations of existing solutions,
  • Hadoop architecture,
  • Hadoop components and ecosystem, Data loading & reading from HDFS, replication rules,
  • Rack awareness theory,
  • Hadoop cluster administrator: Roles and responsibilities.

Day 2 – Hadoop Cluster Administration

In this, module participants will understand the working of the secondary name node, working with Hadoop distributed cluster, enabling rack awareness, maintenance mode of Hadoop cluster, adding or removing nodes to participating cluster in an ad-hoc and recommended way, understand the MapReduce programming model in the context of Hadoop administrator and schedules.

  • Understanding secondary Namenode
  • Working with Hadoop distributed cluster
  • Decommissioning or commissioning of nodes
  • Understanding MapReduce
  • Understanding schedulers and enabling them

Day 3 – Hadoop Cluster Maintenance Monitoring & Hadoop 1.x Vs 2.x

In this module, participants will understand the day to day cluster administration tasks, balancing data in a cluster, protecting data by enabling trash, attempting a manual failover, creating backup within or across clusters, safeguarding participants metadata and doing metadata recovery or manual failover of NameNode recovery, learning how to restrict the usage of HDFS in terms of count and volume of data, and more.

participants also will learn more about the new features of Hadoop 2.0, HDFS High Availability, YARN framework and job execution flow, MRv2, federation, limitations of Hadoop 1.x and setting up Hadoop 2.0 Cluster setup in pseudo-distributed and distributed mode. 

  • Key admin commands like Balancer, Trash,
  • Import Check Point, Distcp,
  • data backup and recovery
  • Enabling trash, namespace count quota or space quota,
  • Manual failover or metadata recovery.
  • Limitations of Hadoop 1.x,
  • Features of Hadoop 2.0,
  • YARN framework, MRv2, YARN ecosystem
  • Hadoop high availability and federation,
  • Hadoop 2.0 Cluster setup.

Day 4 – Hadoop Cluster 2.x Installation and Configuration & Hadoop Eco Systems

In this module, participants will gather insights around cluster planning and management; learn about the various aspects one needs to remember while planning a setup of a new cluster, capacity sizing, understanding recommendations and comparing different distributions of Hadoop, understanding workload and usage patterns and some examples from the world of big data.

participants also will learn to, importing data from RDBMS into HDFS USING sqoop, Apache Sqoop, Oozie, Hive and HBase are used and working on the components.

  • Planning a Hadoop 2.0 cluster,
  • Cluster sizing, hardware, network, and software considerations,
  • Popular Hadoop distributions,
  • Workload and usage patterns,
  • Industry recommendations,
  • Configuring Hadoop 2 with high availability and upgrading to Hadoop 2.

Day 5 – Hadoop Distributions (Cloudera)

About Trainer

G.S.RAMAN M.Sc., M.C.A., M.E (Ph.D.)
Consultant / Trainer

GS Raman is a very passionate educationist, trainer, motivational speaker and an ardent Hadoop and cloud evangelist. Having 20 plus years of experience, he strongly believes in experiential learning methodologies and multiple intelligence skills based teaching-learning approach. GS Raman has trained 5000 plus engineering students & faculty across India and the APAC region on various skills through 100‘s of faculty development workshop’s and student development workshops. Being from educational and research background for many years, GS Raman has written many research papers on various big data and cloud-related topics and published in numerous conferences and journals such as IEEE and Springer. He also holds various industry certifications to further strengthen his skills and knowledge in the area of big data, cloud and NoSQL databases.

Among his certifications are:
 Oracle Certified Professional – OCP 9i
 Oracle Certified Professional – OCP10g
 Sun Certified System Administrator for Solaris10 – SCSA
 Sun Certified Network Administrator for Solaris10 – SCSA
 EMC Proven Professional in Information Storage & Management
 VMware certified CLOUD Associate
 VMware certified Associate-Data Center Virtualization
 Amazon certified cloud Architect.
 Cloudera Certified Administrator Apache Hadoop