IBM banner image

Courses

Hadoop 101

Effort: 5 hours
Level: Beginner
Badge: Hadoop 101

About the course

Learn the basics of Apache Hadoop, a free, open source, Java-based programming framework. Why was it invented?

Learn about Hadoop's architecture and core components, such as MapReduce and the Hadoop Distributed File System (HDFS).

Moving Data into Hadoop

Effort: 5 hours
Level: Beginner

About the course

This course gives an overview of Oozie and how it is able to control Hadoop jobs. It begins with looking at the components required to code a workflow as well as optional components such as case statements, forks, and joins. That is followed by using the Oozie coordinator in order to schedule a workflow.

MapReduce and YARN

Effort: 5 hours
Level: Beginner

About the course

Apache Hadoop is one of the most popular tools for big data processing. It has been successfully deployed in production by many companies for several years. Though Hadoop is considered a reliable, scalable, and cost-effective solution, it is constantly being improved by a large community of developers. As a result, the 2.0 version offers several revolutionary features, including Yet Another Resource Negotiator (YARN), HDFS Federation, and high availability, which make the Hadoop cluster much more efficient, powerful, and reliable.