Duration: 1 Day
In this course, you will receive an overview of Apache Hadoop and discover how it can help meet your business goals. You will cover the different Apache Hadoop technologies, including MapReduce, Hadoop Distributed File System (HDFS), Hive, Pig, HBase, Sqoop, Flume, and Hue, and you will learn how these fit into your existing technology environment.
What You Will Learn
- Your business goals and Hadoop
- Fitting Hadoop into your existing environment
Audience
- Architects
- Technical managers
- CTOs
- Engineering managers
Prerequistes
Course Outline
1. Why Hadoop?
- Motivation for Hadoop
- Use Cases and Case Studies about Hadoop
2. Hadoop Ecosystem
- MapReduce and HDFS
- Hive
- Pig
- HBase
- Sqoop
- Flume
- Hue
- Cloudera's Distribution of Hadoop (CDH)
3. Hadoop into Your Architecture
- Augment Your Existing Environment
- Relational Databases
- SANs
- OLAP Systems and More
4. Managing a Hadoop Cluster
- People Resources Required
- Physical Resources Required
- Cost to Organization
- Scale for Growth
5. Apache Open-Source Model and Cloudera's Role
Course Labs