title

Date: xxx

Location: xxx

Time: xxx

Price: xxx

Please take a moment to fill out this form. We will get back to you as soon as possible.

All fields marked with an asterisk (*) are mandatory.

Machine Learning With Spark

Price

2,340 USD

Duration

4 Days

Course

DCSK-110

Available Formats

Classroom Training, Online Training

View Class Schedule Course Description Group Training Questions? Contact Us

AWS Training Pass

Take advantage of flexible training options with the AWS Training Pass and get Authorized AWS Training for a full year.

Learn More

Class Schedule

Delivery Formats

Sort & Filter

×

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

Sorry, there are no public classes currently scheduled in your country.

Please complete this form, and a Training Advisor will be in touch with you shortly to address your training needs.

First Name* Last Name* Company Email* Phone Training Option

Comments Yes, I'd like to receive special offers from LearnQuest. I have read and agree to LearnQuest's Terms and Conditions and Privacy Policy, and I consent to have my submitted information stored.

View Global Schedule

Course Description

Overview

This Machine Learning with Spark course is designed to teach Machine Learning at Scale with the popular Apache Spark framework. This course is taught using Spark & Python.

For each machine learning concept, we first discuss the foundations, its applicability, and limitations. Then we explain the implementation and use, and specific use cases. This is achieved through a combination of about 50% lecture, 50% lab work.

Please note that this course does not cover the in-depth coverage of Math / Stats is behind Machine Learning.

Objectives

Upon completion of the Machine Learning with Spark course, students will be able to:

Learn popular machine learning algorithms, their applicability, and limitations
Practice the application of these methods in the Spark machine learning environment
Learn practical use cases and limitations of algorithms

Audience

Data Scientists and Software Engineers

Prerequisites

Working knowledge of Apache Spark.
If students are new to Apache Spark, we can offer one day of ‘Introduction to Spark’ training
Programming background
Familiarity with Python would be a plus, but not required
No machine learning knowledge is assumed

Topics

Section 1: Machine Learning (ML) Overview

Machine Learning landscape
Machine Learning applications
Understanding ML algorithms & models (supervised and unsupervised)

Section 2: ML in Python and Spark

Spark ML Overview
Introduction to Jupyter notebooks
- Lab: Working with Jupyter + Python + Spark
- Lab: Spark ML utilities

Section 3: Machine Learning Concepts

Statistics Primer
Covariance, Correlation, Covariance Matrix
Errors, Residuals
Overfitting / Underfitting
Cross-validation, bootstrapping
Confusion Matrix
ROC curve, Area Under Curve (AUC)
- Lab: Basic stats

Section 4: Feature Engineering (FE)

Preparing data for ML
Extracting features, enhancing data
Data cleanup
Visualizing Data
- Lab: data cleanup
- Lab: visualizing data

Section 5: Linear regression

Simple Linear Regression
Multiple Linear Regression
Running LR
Evaluating LR model performance
- Lab
Use case: House price estimates

Section 6: Logistic Regression

Understanding Logistic Regression
Calculating Logistic Regression
Evaluating model performance
- Lab
Use case: credit card application, college admissions

Section 7: Classification: SVM (Supervised Vector Machines)

SVM concepts and theory
SVM with kernel
- Lab
Use case: Customer churn data

Section 8: Classification

Theory behind trees
Classification and Regression Trees (CART)
Random Forest concepts
- Labs
Use case: predicting loan defaults, estimating election contributions

Section 9: Classification: Naive Bayes

Theory
- Lab
Use case: spam filtering

Section 10: Clustering (K-Means)

Theory behind K-Means
Running K-Means algorithm
Estimating the performance
- Lab
Use case: grouping cars data, grouping shopping data

Section 11: Principal Component Analysis (PCA)

Understanding PCA concepts
PCA applications
Running a PCA algorithm
Evaluating results
- Lab

Use case: analyzing retail shopping data Section 12: Recommendations (Collaborative filtering)

Recommender systems overview
Collaborative Filtering concepts
- Lab
Use case: movie recommendations, music recommendations

Section 13: Performance

Best practices for scaling and optimizing Apache Spark
Memory and processing optimization in Spark and how to take advantage of them
Effective transformations
Beyond JVM
Testing and validation
Machine Learning Performance

Section 14: Final workshop (time permitting)

Introduction to Python 3

PLPJ-145
- Duration: 4 Days
- Delivery Format: Classroom Training, Online Training
- Price: 2,340.00 USD
Introduction to Programming with Python

PLPJ-160
- Duration: 2 Days
- Delivery Format: Classroom Training, Online Training
- Price: 1,170.00 USD

Top 20 Training Industry Company - IT Training

Need Help?

Call us at 877-206-0106 or e-mail us at info@learnquest.com

Personalized Solutions

Need a personalized solution for your Training? Contact us, and one of our training advisors will help you find the best solution.

Contact Us

Need Help?

Do you have a question about the courses, instruction, or materials covered? Do you need help finding which course is best for you? We are here to help!

Talk to us

Self-Paced Training Info

Learn at your own pace with anytime, anywhere training

Same in-demand topics as instructor-led public and private classes.
Standalone learning or supplemental reinforcement.
e-Learning content varies by course and technology.
View the Self-Paced version of this outline and what is included in the SPVC course.
Learn more about e-Learning

Course Added To Shopping Cart

bla

Self-Paced Training Terms & Conditions

??spvc-wbt-warning??

Exam Terms & Conditions

??exam-warning??

??group-training-form-area??

??how-can-we-help-you-area??

??personalized-form-area??

??request-quote-area??

Purchase Information

??elearning-coursenumber?? ??coursename??

View Cart

title

Date: xxx

Location: xxx

Time: xxx

Price: xxx

Please take a moment to fill out this form. We will get back to you as soon as possible.

All fields marked with an asterisk (*) are mandatory.

First Name* Last Name* Company Email* Phone Country* How many people need group training?* Student email addresses (optional) Comments Yes, I'd like to receive special offers from LearnQuest. I have read and agree to LearnQuest's Terms and Conditions and Privacy Policy, and I consent to have my submitted information stored.

Thank you for your interest in LearnQuest.

Thank you for your interest in Private Training.

Thank you for your interest in LearnQuest!

title

Machine Learning With Spark

AWS Training Pass

Class Schedule

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

The self-paced version of this course is also available through our IBM Learning Subscription

Course Description

Overview

Objectives

Audience

Prerequisites

Topics

Recognition

Introduction to Python 3

Introduction to Programming with Python

Need Help?

Personalized Solutions

Need Help?

Self-Paced Training Info

Course Added To Shopping Cart

Self-Paced Training Terms & Conditions

Exam Terms & Conditions

STOP! Before You Leave

Save 0% on this course!

Purchase Information

title

Need more Information?

title

Machine Learning With Spark

AWS Training Pass

Class Schedule

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

Course Description

Overview

Objectives

Audience

Prerequisites

Topics

Recognition

Related Courses

Introduction to Python 3

Introduction to Programming with Python

Need Help?

Personalized Solutions

Need Help?

Self-Paced Training Info

Course Added To Shopping Cart

Self-Paced Training Terms & Conditions

Exam Terms & Conditions

STOP! Before You Leave

Save 0% on this course!

Purchase Information

title

Need more Information?