Contact Us


Thank you for your interest in LearnQuest.

Your request is being processed and LearnQuest or a LearnQuest-Authorized Training Provider will be in touch with you shortly.


Thank you for your interest in Private Training.

We look forward to helping you develop the perfect training solution to help you meet your company's goals.

For immediate assistance, speak with one of our representatives using the chat module below. Otherwise, LearnQuest or a LearnQuest-Authorized Training Provider will be in touch with you shortly.


Thank you for your interest in LearnQuest!

Now, you will be able to stay up-to-date on our latest course offerings, promotions, and training discounts. Watch your inbox for upcoming special offers.


Date: xxx

Location: xxx

Time: xxx

Price: xxx

Please take a moment to fill out this form. We will get back to you as soon as possible.

All fields marked with an asterisk (*) are mandatory.

Cloudera Developer Training for MapReduce

Course content updated by LearnQuest
3,195 USD
4 Days
Classroom Training, Online Training
Cloudera Training
Prices reflect a 22.5% discount for IBM employees.
Prices shown are the special AWS Partner Prices.
Prices reflect the Capgemini employee discount.
Prices reflect the UPS employee discount.
Prices reflect the ??democompanyname?? employee discount.
GSA Private/Onsite Price: ??gsa-private-price??
For GSA pricing, please go to GSA Advantage.
This course is eligible for the IBM Full Access Training Pass
Enroll today and save 10% on this course. Use promo code CLOUD10 when registering.
Working on a laptop
Gain access to IBM’s library of digital, on-demand courses for one low annual subscription fee
IBM Full Access Training Pass
$500 off any IBM Full Access Training Pass Option
See Offer
Get a 30% Discount on IBM Self-Paced Courses
See Offer

Class Schedule

Delivery Formats

Sort results

Filter Classes

Guaranteed to Run





    Sorry, there are no public classes currently scheduled in your country.

    Please complete this form, and a Training Advisor will be in touch with you shortly to address your training needs.

View Global Schedule

Course Description


Cloudera Universityâ??s four-day developer training course delivers the key concepts and expertise participants need to create robust data processing applications using Apache Hadoop. From workflow implementation and working with APIs through writing MapReduce code and executing joins, Clouderaâ??s training course is the best preparation for the real-world challenges faced by Hadoop developers.



  • The internals of MapReduce and HDFS and how to write MapReduce code
  • Best practices for Hadoop development, debugging, and implementation of workflows and common algorithms
  • How to leverage Hive, Pig, Sqoop, Flume, Oozie, and other Hadoop ecosystem projects
  • Creating custom components such as WritableComparables and InputFormats to manage complex data types
  • Writing and executing joins to link data sets in MapReduce
  • Advanced Hadoop API topics required for real-world data analysis
  • Audience






      The Motivation For Hadoop

      • Problems with Traditional Large-Scale Systems
      • Introducing Hadoop
      • Hadoopable Problems

      Hadoop: Basic Concepts and HDFS

      • The Hadoop Project and Hadoop Components
      • The Hadoop Distributed File System

      Introduction to MapReduce

      • MapReduce Overview
      • Example: WordCount
      • Mappers
      • Reducers

      Hadoop Clusters and the Hadoop Ecosystem

      • Hadoop Cluster Overview
      • Hadoop Jobs and Tasks
      • Other Hadoop Ecosystem Components

      Writing a MapReduce Program in Java

      • Basic MapReduce API Concepts
      • Writing MapReduce Drivers, Mappers, and Reducers in Java
      • Speeding Up Hadoop Development by Using Eclipse
      • Differences Between the Old and New MapReduce APIs

      Writing a MapReduce Program Using Streaming

      • Writing Mappers and Reducers with the Streaming API

      Unit Testing MapReduce Programs

      • Unit Testing
      • The JUnit and MRUnit Testing Frameworks
      • Writing Unit Tests with MRUnit
      • Running Unit Tests

      Delving Deeper into the Hadoop API

      • Using the ToolRunner Class
      • Setting Up and Tearing Down Mappers and reducers
      • Decreasing the Amount of Intermediate
      • Data with Combiners
      • Accessing HDFS Programmatically
      • Using The Distributed Cache
      • Using the Hadoop APIâ??s Library of Mappers, Reducers, and Partitioners

      Practical Development Tips and Techniques

      • Strategies for Debugging MapReduce Code
      • Testing MapReduce Code Locally by Using LocalJobRunner
      • Writing and Viewing Log Files
      • Retrieving Job Information with Counters
      • Reusing Objects
      • Creating Map-Only MapReduce Jobs

      Partitioners and Reducers

      • How Partitioners and Reducers Work Together
      • Determining the Optimal Number of Reducers for a Job
      • Writing Customer Partitioners

      Data Input and Output

      • Creating Custom Writable and Writable
      • Comparable Implementations
      • Saving Binary Data Using SequenceFile and Avro Data Files
      • Issues to Consider When Using File Compression
      • Implementing Custom InputFormats and OutputFormats

      Common MapReduce Algorithms

      • Sorting and Searching Large Data Sets
      • Indexing Data
      • Computing Term Frequency â?? Inverse Document Frequency
      • Calculating Word Co-Occurrence
      • Performing Secondary Sort

      Joining Data Sets in MapReduce Jobs

      • Writing a Map-Side Join
      • Writing a Reduce-Side Join

      Integrating Hadoop into the Enterprise Workflow

      • Integrating Hadoop into an Existing Enterprise
      • Loading Data from an RDBMS into HDFS by Using Sqoop
      • Managing Real-Time Data Using Flume
      • Accessing HDFS from Legacy Systems with FuseDFS and HttpFS

      An Introduction to Hive, Imapala, and Pig

      • The Motivation for Hive, Impala, and Pig
      • Hive Overview
      • Impala Overview
      • Pig Overview
      • Choosing Between Hive, Impala, and Pig

      An Introduction to Oozie

      • Introduction to Oozie
      • Creating Oozie Workflows

      • HBase for Developers

        • Duration: 3 Days
        • Delivery Format: Classroom Training, Online Training
        • Price: 2,100.00 USD
      2020 Top 20 Training Industry Company - IT Training

      Need Help?

      Call us toll free at 877-206-0106 or e-mail us at

      Personalized Solutions

      Need a personalized solution for your training? Contact us, and one of our advisors will help you find the best solution to your training needs.

      Contact us

      Need Help?

      Do you have a question about the courses, instruction, or materials covered? Do you need help finding which course is best for you?

      Talk to us

      Self-Paced Training Info

      Learn at your own pace with anytime, anywhere training

      • Same in-demand topics as instructor-led public and private classes.
      • Standalone learning or supplemental reinforcement.
      • e-Learning content varies by course and technology.
      • View the Self-Paced version of this outline and what is included in the SPVC course.
      • Learn more about e-Learning

      Course Added To Shopping Cart







      Self-Paced Training Terms & Conditions


      Sorry, there are no classes that meet your criteria.

      Please contact us to schedule a class.
      Nothing yet
      here's the message from the cart

      To view the cart, you can click "View Cart" on the right side of the heading on each page
      Add to cart clicker.

      Purchase Information

      ??elearning-coursenumber?? ??coursename??
      View Cart

      Need more Information?

      Speak with our training specialists to continue your learning journey.


      Delivery Formats


      By submitting this form, I agree to LearnQuest's Terms and Conditions

      heres the new schedule
      This website uses third-party profiling cookies to provide services in line with the preferences you reveal while browsing the Website. By continuing to browse this Website, you consent to the use of these cookies. If you wish to object such processing, please read the instructions described in our Privacy Policy.
      Your use of this LearnQuest site affirms your consent to our use of session and persistent cookies to track how you use our website.