Contact Us info@learnquest.com

??WelcomeName??
??WelcomeName??
photo

Thank you for your interest in LearnQuest.

Your request is being processed and LearnQuest or a LearnQuest-Authorized Training Provider will be in touch with you shortly.

photo

Thank you for your interest in Private Training.

We look forward to helping you develop the perfect training solution to help you meet your company's goals.

For immediate assistance, speak with one of our representatives using the chat module below. Otherwise, LearnQuest or a LearnQuest-Authorized Training Provider will be in touch with you shortly.

photo

Thank you for your interest in LearnQuest!

Now, you will be able to stay up-to-date on our latest course offerings, promotions, and training discounts. Watch your inbox for upcoming special offers.

title

Date: xxx

Location: xxx

Time: xxx

Price: xxx

Please take a moment to fill out this form. We will get back to you as soon as possible.

All fields marked with an asterisk (*) are mandatory.

Cloudera Developer Training for Apache Spark

Course content updated by LearnQuest
Price
2,595 USD
3 Days
DEV-SPARK-E1XC
Classroom Training, Online Training
Cloudera Training
Prices reflect a 22.5% discount for IBM employees.
Prices shown are the special AWS Partner Prices.
Prices reflect the Capgemini employee discount.
Prices reflect the UPS employee discount.
Prices reflect the ??democompanyname?? employee discount.
GSA Private/Onsite Price: ??gsa-private-price??
For GSA pricing, please go to GSA Advantage.
This course is eligible for the IBM Full Access Training Pass
Enroll today and save 10% on this course. Use promo code CLOUD10 when registering.
Working on a laptop
Gain access to IBM’s library of digital, on-demand courses for one low annual subscription fee
IBM Full Access Training Pass
$500 off any IBM Full Access Training Pass Option
See Offer
Get a 30% Discount on IBM Self-Paced Courses
See Offer

Class Schedule

Delivery Formats

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

    Sorry, there are no public classes currently scheduled in your country.

    Please complete this form, and a Training Advisor will be in touch with you shortly to address your training needs.

View Global Schedule

Course Description

Overview

Cloudera Universityâ??s three-day training course for Apache Spark enables participants to build complete, unified big data applications combining batch, streaming, and interactive analytics on all their data. With Spark, evelopers can write sophisticated parallel applications to execute faster decisions, better decisions, and real-time actions, applied to a wide variety of use cases, architectures, and industries.


 

Objectives

  • Using the Spark shell for interactive data analysis
  • The features of Sparkâ??s Resilient Distributed Datasets
  • How Spark runs on a cluster
  • How Spark parallelizes task execution
  • Writing Spark applications
  • Processing streaming data with Spark
     
  • Audience


     

    Prerequisites


       

    Topics

    Introduction to Spark

    • What is Spark?
    • Review: From Hadoop MapReduce to Spark
    • Review: HDFS
    • Review: YARN
    • Spark Overview

    Spark Basics

    • Using the Spark Shell
    • RDDs (Resilient Distributed Datasets)
    • Functional Programming in Spark

    Working with RDDs in Spark

    • Creating RDDs
    • Other General RDD Operations

    Aggregating Data with Pair RDDs

    • Key-Value Pair RDDs
    • Map-Reduce
    • Other Pair RDD Operations

    Writing and Deploying Spark Applications

    • Spark Applications vs. Spark Shell
    • Creating the SparkContext
    • Building a Spark Application (Scala and Java)
    • Running a Spark Application
    • The Spark Application Web UI
    • Hands-On Exercise: Write and Run
    • Spark Application
    • Configuring Spark Properties
    • Logging

    Parallel Processing

    • Review: Spark on a Cluster
    • RDD Partitions
    • Partitioning of File-based RDDs
    • HDFS and Data Locality
    • Executing Parallel Operations
    • Stages and Tasks

    Spark RDD Persistence

    • RDD Lineage
    • RDD Persistence Overview
    • Distributed Persistence

    Basic Spark Streaming

    • Spark Streaming Overview
    • Example: Streaming Request Count
    • DStreams
    • Developing Spark Streaming Applications

    Advanced Spark Streaming

    • Multi-Batch Operations
    • State Operations
    • Sliding Window Operations
    • Advanced Data Sources

    Common Patterns in Spark Data Processing

    • Common Spark Use Cases
    • Iterative Algorithms in Spark
    • Graph Processing and Analysis
    • Machine Learning
    • Example: k-means

    Improving Spark Performance

    • Shared Variables: Broadcast Variables
    • Shared Variables: Accumulators
    • Common Performance Issues
    • Diagnosing Performance Problems

    Spark SQL and DataFrames

    • Spark SQL and the SQL Context
    • Creating DataFrames
    • Transforming and Querying DataFrames
    • Saving DataFrames
    • DataFrames and RDDs
    • Comparing Spark SQL, Impala and Hive-on-Spark

    Conclusion


       
    • HBase for Developers

      DBNS-105
      • Duration: 3 Days
      • Delivery Format: Classroom Training, Online Training
      • Price: 2,100.00 USD
    2020 Top 20 Training Industry Company - IT Training

    Need Help?

    Call us toll free at 877-206-0106 or e-mail us at info@learnquest.com

    Personalized Solutions

    Need a personalized solution for your training? Contact us, and one of our advisors will help you find the best solution to your training needs.

    Contact us

    Need Help?

    Do you have a question about the courses, instruction, or materials covered? Do you need help finding which course is best for you?

    Talk to us

    Self-Paced Training Info

    Learn at your own pace with anytime, anywhere training

    • Same in-demand topics as instructor-led public and private classes.
    • Standalone learning or supplemental reinforcement.
    • e-Learning content varies by course and technology.
    • View the Self-Paced version of this outline and what is included in the SPVC course.
    • Learn more about e-Learning

    Course Added To Shopping Cart

    bla

    bla

    bla

    bla

    bla

    bla

    Self-Paced Training Terms & Conditions

    ??spvc-wbt-warning??
    ??group-training-form-area??
    ??how-can-we-help-you-area??
    ??personalized-form-area??
    ??request-quote-area??

    Sorry, there are no classes that meet your criteria.

    Please contact us to schedule a class.
    Nothing yet
    here's the message from the cart

    To view the cart, you can click "View Cart" on the right side of the heading on each page
    Add to cart clicker.

    Purchase Information

    ??elearning-coursenumber?? ??coursename??
    View Cart

    Need more Information?

    Speak with our training specialists to continue your learning journey.

     

    Delivery Formats

    Close

    By submitting this form, I agree to LearnQuest's Terms and Conditions

    heres the new schedule
    This website uses third-party profiling cookies to provide services in line with the preferences you reveal while browsing the Website. By continuing to browse this Website, you consent to the use of these cookies. If you wish to object such processing, please read the instructions described in our Privacy Policy.
    Your use of this LearnQuest site affirms your consent to our use of session and persistent cookies to track how you use our website.