Hadoop Training and Certification Workshop
This 3-day training program is geared to give developers hands-on working knowledge for harnessing the power of Hadoop in their organizations. Hadoop is a software framework that supports data-intensive distributed applications. Hadoop empowers applications to work with thousands of nodes and petabytes of data without exposing the complexity of clustering to the end user.
This intense course assumes no prior knowledge of Hadoop or BigData Concepts. It begins with first giving an overview of MapReduce and the Hadoop ecosystem and then works its wayto hands-on exploration with datasets and live clusters. The course goes over some common configuration mechanisms, tools and debugging.
There will be good amount of hands on exercises on Hadoop during training.
Software developers interested in learning Hadoop, distributed systems concepts and Map Reduce.
Minimal exposure to database or datawarehouse concepts. Some familiarity with launching scripts, SQL and Java is preferred but not essential.
Want to know more !!
Call us for further details. 080-65683622
Our outline follows The Hadoop Definitive Guide which is a good reading companion for the course. We pepper the hands-on training with slideshows & videos from real world implementations.
Introduction to Apache Hadoop and its Ecosystem
The Motivation for Hadoop
Hadoop: Basic Concepts
Basic Programming with the Hadoop Core API
Writing MapReduce Program (basic)
Practical Development Tips and Techniques
Data Input and Output
Problem Solving with MapReduce
Common MapReduce Algorithms
The Hadoop Ecosystem
Integrating Hadoop into the Enterprise Workflow
Machine Learning and Mahout (Basics)
An Introduction to Hive and Pig
An Introduction to Oozie
Course Conclusion and Appendices
Graph Manipulation in MapReduce