Lesson 1 - Introduction to Hadoop |
Learning objectives
- Understand what Hadoop is
- Understand what Big Data is
- Learn about other open source software related to Hadoop
- Understand how Big Data solutions can work on the Cloud
Instructions
- Review all the videos provided
- Complete the lab
Videos
- Lesson 1 - Part 1: Video (5:28)
- Lesson 1 - Part 1: Transcript (PDF)
- Lesson 1 - Part 2: Video (6:23)
- Lesson 1 - Part 2: Transcript (PDF)
Hands-on lab - Creating your own Hadoop cluster
- We will use IBM InfoSphere BigInsights (BigInsights) software to work with Hadoop.
- You will need to have a free IBM ID in order to access the image.
- BigInsights is available in different editions; this course uses the Quick Start Edition which is free, has no time usage limits and no data size usage limits.
- Lesson 1 - Exercise 1: Instructions - BigInsights Image Quick Start Guide (PDF)
- Download the 64-bit VMWare image
- Download and install free VMWare Player to play VMWare image. (If you do not already have VMWare player, click the link. The player is available for download near the new page.)
Lesson 2 - Hadoop architecture |
Learning objectives
- Understand the main Hadoop components
- Learn how HDFS works
- List data access patterns for which HDFS is designed
- Describe how data is stored in an HDFS cluster
Instructions
- Review all the videos provided
- Complete the lab
Videos
- Lesson 2 - Part 1: Video (11:32)
- Lesson 2 - Part 1: Transcript (PDF)
- Lesson 2 - Part 2: Video (7:46)
- Lesson 2 - Part 2: Transcript (PDF)
- Lesson 2 - Part 3: Video (10:28)
- Lesson 2 - Part 3: Transcript (PDF)
- Lesson 2 - Part 4: Video - HDFS Command Line (10:43)
- Lesson 2 - Part 4: Transcript (PDF)
Hands-on lab - Hadoop Architecture
- Lesson 2 - Exercise 1: Instructions - Hadoop Architecture (PDF)
- Lesson 2 - Exercise 1 Solution: Video (11:59)
Lesson 3 - Hadoop Administration |
Learning objectives
- Add and remove nodes from a cluster
- Verify the health of a clusterStart and stop a clusters components
- Modify Hadoop configuration parameters
- Setup a rack topology
Instructions
- Review all the videos provided
- Complete the lab
Videos
- Lesson 3 - Part 1: Video (4:56)
- Lesson 3 - Part 1: Transcript (PDF)
- Lesson 3 - Part 2: Video (6:22)
- Lesson 3 - Part 2: Transcript (PDF)
Hands-on lab - Hadoop Administration
- Lesson 3 - Exercise 1: Instructions - Hadoop Administration (PDF)
- Lesson 3 - Exercise 1 Solution - Part 1: Video (4:39)
- Lesson 3 - Exercise 1 Solution - Part 2: Video (7:00)
- Lesson 3 - Exercise 1 Solution - Part 3: Video (4:20)
Lesson 4 - Hadoop Components |
Learning objectives
- Describe the MapReduce philosophy
- Explain how Pig, Hive, and Jaql can be used in a Hadoop environment
- Describe how Flume and Sqoop can be used to move data into Hadoop
- Describe how Oozie is used to schedule and control Hadoop job execution
Instructions
- Review all the videos provided
Videos
- Lesson 4 - Part 1: Video (5:53)
- Lesson 4 - Part 1: Transcript (PDF)
- Lesson 4 - Part 2: Video (12:37)
- Lesson 4 - Part 2: Transcript (PDF)
- Lesson 4 - Part 3: Video (5:26)
- Lesson 4 - Part 3: Transcript (PDF)