title

Date: xxx

Location: xxx

Time: xxx

Price: xxx

Please take a moment to fill out this form. We will get back to you as soon as possible.

All fields marked with an asterisk (*) are mandatory.

Certified Data Science Practitioner CDSP (DSP-210) Exam Voucher

Price

Contact LearnQuest

Duration

1 Day

Course

LQEX-CNX0020V

Available Formats

Exam Vouchers

View Class Schedule Course Description Group Training Questions? Contact Us

AWS Training Pass

Take advantage of flexible training options with the AWS Training Pass and get Authorized AWS Training for a full year.

Learn More

Class Schedule

Delivery Formats

Sort & Filter

×

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

View Global Schedule

Course Description

Overview

The Certified Data Science Practitioner™ (CDSP) is an industry-validated certification which helps professionals differentiate themselves from other job candidates by demonstrating their ability to put data science concepts into practice. Data can reveal insights and inform—by guiding decisions and influencing day-to-day operations. This calls for a robust workforce of professionals who can analyze, understand, manipulate, and present data within an effective and repeatable process framework. This certification validates candidates’ ability to use data science principles to address business issues, use multiple techniques to prepare and analyze data, evaluate datasets to extract valuable insights, and design a machine learning approach. In addition, it will validate skills to design, finalize, present, implement, and monitor a model to address issues regardless of business sector.

Objectives

The exam will certify that the successful candidate has the knowledge, skills, and abilities required to frame and answer questions by collecting, wrangling, and exploring datasets and applying statistical models and artificial-intelligence algorithms to extract and communicate knowledge and insights.

Audience

The Certified Data Science Practitioner (CDSP) exam is designed for professionals across different industries seeking to demonstrate the ability to gain insights and build and operationalize predictive models from data.

Prerequisites

A working level knowledge of programming languages such as Python® and R
Proficiency with a querying language
Strong communication skills
Proficiency with statistics and linear algebra
Demonstrate responsibility based upon ethical implications when sharing data sources
Familiarity with data visualization

Introduction to Programming with Python®
Advanced Programming Techniques with Python®
Using Data Science Tools in Python®
R Programming for Data Science
DSBIZ™ (Exam DSZ-210)
DEBIZ™ (Exam DEB-110): Data Ethics for Business Professionals or Certified Ethical Emerging Technologist™ (CEET)

Topics

Domain 1.0 Defining the need to be addressed through the application of data science Objective 1.1 - Identify the project scope

Identify project specifications, including objectives (metrics/KPIs) and stakeholder requirements
Identify mandatory deliverables, optional deliverables
Determine project timeline
Identify project limitations (time, technical, resource, data, risks)

Objective 1.2 - Understand challenges

Understand terminology
- Milestone
- POC (Proof of concept)
- MVP (Minimal Viable Product)
Become aware of data privacy, security, and governance policies
- GDPR
- HIPPA
- California Privacy Act
Obtain permission/access to stakeholder data
Ensure appropriate voluntary disclosure and informed consent controls in place

Objective 1.3 - Classify a question into a known data science problem

Identify references relevant to the data science problem
- Optimization problem
- Forecasting problem
- Regression problem
- Classification problem
- Segmentation/Clustering problem
Identify data sources and type
- Structured/unstructured
- Image
- Text
- Numerical
- Categorical
Select modeling type
- Regression
- Classification
- Forecasting
- Clustering
- Optimization
- Recommender systems

Domain 2.0 Extracting, Transforming, and Loading Data Objective 2.1 - Gather data sets

Read Data
- Write a query for a SQL database
- Write a query for a NoSQL database
- Read data from/write data to cloud storage solutions
- AWS S3
- Google Storage Buckets
- Azure Data Lake
Become aware of first-, second-, and third-party data sources
- Understand data collection methods
- Understand data sharing agreements, where applicable
Explore third-party data availability
- Demographic data
- Bloomberg
Collect open-source data
- Use APIs to collect data
- Scrape the web
Generate data assets
- Dummy or test data
- Randomized data
- Anonymized data
- AI-generated synthetic data

Objective 2.2 - Clean data sets

Identify and eliminate irregularities in data (e.g., edge cases, outliers)
- Nulls
- Duplicates
- Corrupt values
Parse the data
Check for corrupted data
Correct the data format
Deduplicate data
Apply risk and bias mitigation techniques
- Understand common forms of ML bias
- Sampling bias
- Measurement bias
- Exclusion bias
- Observer bias
- Prejudicial bias
- Confirmation bias
- Bandwagoning
- Identify the sources of bias
- Sources of bias include data collection, data labeling, data transformation, data imputation, data selection, and data training methods
- Use exploratory data analysis to visualize and summarize the data, and detect outliers and anomalies
- Assess data quality by measuring and evaluating the completeness, correctness, consistency, and currency of data
- Use data auditing techniques to track and document the provenance, ownership, and usage of data, and applied data cleaning steps
- Mitigate the impact of bias
- Apply mitigation strategies such as data augmentation, sampling, normalization, encoding, validation
- Evaluate the outcomes of bias
- Use methods such as confusion matrix, ROC curve, AUC score, and fairness metrics
- Monitor and improve the data cleaning process
- Establish or adhere to data governance rules, standards, and policies for data and the data cleaning process

Objective 2.3 - Merge and load data sets

Join data from different sources
- Make sure a common key exists in all datasets
- Unique identifiers
Load data
- Load into DB
- Load into dataframe
- Export the cleaned dataset
- Load into visualization tool
Make an endpoint or API

Objective 2.4 - Apply problem-specific transformations to data sets

Apply word vectorization or word tokenization
- Word2vec
- TF-IDF
- Glove
Generate latent representations for image data

Domain 3.0 Performing exploratory data analysis Objective 3.1 - Examine data

Generate summary statistics
Examine feature types
Visualize distributions
Identify outliers
Find correlations
Identify target feature(s)

Objective 3.2 - Preprocess data

Identify missing values
Make decisions about missing values (e.g., imputing method, record removal)
Normalize, standardize, or scale data

Objective 3.3 - Carry out feature engineering

Apply encoding to categorical data
- One-hot encoding
- Target encoding
- Label encoding or Ordinal encoding
- Dummy encoding
- Effect encoding
- Binary encoding
- Base-N encoding
- Hash encoding
Split features
- Text manipulation
- Split
- Trim
- Reverse
- Manipulate data
- Split names
- Extract year from title
Convert dates to useful features
Apply feature reduction methods
- PCA
- Missing value ratio
- t-SNE
- Low-variance filter
- Random forest
- High-correlation filter
- Backward feature elimination
- SVD
- Forward feature selection
- False discovery rate
- Factor analysis
- Feature importance methods

Domain 4.0 Building models Objective 4.1 - Prepare data sets for modeling

Decide proportion of data set to use for training, testing, and (if applicable) validation
Split data to train, test, and (if applicable) validation sets, mitigating data leakage risk

Objective 4.2 - Train models

Define models to try
- Regression
- Linear regression
- Random forest
- XGBoost
- Classification
- Logistic regression
- Random forest classification
- XGBoost classifier
- naïve Bayes
- Forecasting
- ARIMA
- Clustering
- k-means
- Density-based methods
- Hierarchical clustering
- Train model or pre-train or adapt transformers
- Tune hyper-parameters, if applicable
- Cross-validation
- Grid search
- Gradient decent
- Bayesian optimization

Objective 4.3 - Evaluate models

Define evaluation metric
Compare model outputs
- Confusion matrix
- Learning curve
Select best-performing model
Store model for operational use
- MLflow
- Kubeflow

Domain 5.0 Testing models Objective 5.1 - Test hypotheses

Design A/B tests
- Experimental design
- Design use cases
- Test creation
- Statistics
Define success criteria for test
Evaluate test results

Domain 6.0 Operationalizing the pipeline Objective 6.1 - Deploy pipelines

Build streamlined pipeline (using dbt, Fivertran, or similar tools)
Implement confidentiality, integrity, and access control measures
Put model into production
- AWS SageMaker
- Azure ML
- Docker
- Kubernetes
Ensure model works operationally
Monitor pipeline for performance of model over time
- MLflow
- Kubeflow
- Datadog
Consider enterprise data strategy and data management architecture to facilitate the end-to-end integration of data pipelines and environments
- Data warehouse and ETL process
- Data lake and ETL processes
- Data mesh, micro-services, and APIs
- Data fabric, data virtualization, and low-code automation platforms

Domain 7.0 Communication findings Objective 7.1 - Report findings

Implement model in a basic web application for demonstration (POC implementation)
- Web frameworks (Flask, Django)
- Basic HTML
- CSS
Derive insights from findings
Identify features that drive outcomes (e.g., explainability, interpretability, variable importance plot)
Show model results
Generate lift or gain chart
Ensure transparency and explainability of model
- Use explainable methods (e.g., intrinsic and post hoc)
- Visualization
- Feature importance analysis
- Attention mechanisms
- Avoiding black-box techniques in model design
- Explainable AI (XAI) frameworks and tools
- SHAP
- LIME
- ELI5
- What-If Tool
- AIX360
- Skater
- Et al
- Document the model lifecycle
- ML design and workflow
- Code comments
- Data dictionary
- Model cards
- Impact assessments
- Engage with diverse perspectives
- Stakeholder analysis
- User testing
- Feedback loops
Participatory design

Objective 7.2 - Democratize data

Make data more accessible to a wider range of stakeholders
Make data more understandable and actionable for nontechnical individuals
- Implement self-service data/analytics platforms
Create a culture of data literacy
- Educate employees on how to use data effectively
- Offer support and guidance on data-related issues
- Promote transparency and collaboration around data

Certified Data Science Practitionerâ„¢ (CDSP) (Exam DSP-210)

CNX0020
- Duration: 5 Days
- Delivery Format: Classroom Training, Online Training
- Price: 3,500.00 USD
Data Science for Business Professionals DSBIZâ„¢ (Exam DSZ-210)

CNX0018
- Duration: 0.5 Day
- Delivery Format: Classroom Training, Online Training
- Price: 420.00 USD

Top 20 Training Industry Company - IT Training

Need Help?

Call us at 877-206-0106 or e-mail us at info@learnquest.com

Personalized Solutions

Need a personalized solution for your Training? Contact us, and one of our training advisors will help you find the best solution.

Contact Us

Need Help?

Do you have a question about the courses, instruction, or materials covered? Do you need help finding which course is best for you? We are here to help!

Talk to us

Self-Paced Training Info

Learn at your own pace with anytime, anywhere training

Same in-demand topics as instructor-led public and private classes.
Standalone learning or supplemental reinforcement.
e-Learning content varies by course and technology.
View the Self-Paced version of this outline and what is included in the SPVC course.
Learn more about e-Learning

Course Added To Shopping Cart

bla

Self-Paced Training Terms & Conditions

THIS IS A SELF-PACED VIRTUAL CLASS. AFTER YOU REGISTER, YOU HAVE 30 DAYS TO COMPLETE THE COURSE.

Before you enroll, review the system requirements to ensure that your system meets the minimum requirements for this course. AFTER YOU ARE ENROLLED IN THIS COURSE, YOU WILL NOT BE ABLE TO CANCEL YOUR ENROLLMENT. You are billed for the course when you submit the enrollment form. Self-Paced Virtual Classes are non-refundable. Once you purchase a Self-Paced Virtual Class, you will be charged the full price.

After you receive confirmation that you are enrolled, you will be sent further instructions to access your course material and remote labs. A confirmation email will contain your online link, your ID and password, and additional instructions for starting the course.

You can start the course at any time within 12 months of enrolling for the course. After you register/start the course, you have 30 days to complete your course. Within this 30 days, the self-paced format gives you the opportunity to complete the course at your convenience, at any location, and at your own pace. The course is available 24 hours a day.

Exam Terms & Conditions

??exam-warning??

??group-training-form-area??

??how-can-we-help-you-area??

??personalized-form-area??

??request-quote-area??

Purchase Information

??elearning-coursenumber?? ??coursename??

View Cart

title

Date: xxx

Location: xxx

Time: xxx

Price: xxx

Please take a moment to fill out this form. We will get back to you as soon as possible.

All fields marked with an asterisk (*) are mandatory.

First Name* Last Name* Company Email* Phone Country* How many people need group training?* Student email addresses (optional) Comments Yes, I'd like to receive special offers from LearnQuest. I have read and agree to LearnQuest's Terms and Conditions and Privacy Policy, and I consent to have my submitted information stored.

Thank you for your interest in LearnQuest.

Thank you for your interest in Private Training.

Thank you for your interest in LearnQuest!

title

Certified Data Science Practitioner CDSP (DSP-210) Exam Voucher

AWS Training Pass

Class Schedule

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

The self-paced version of this course is also available through our IBM Learning Subscription

Course Description

Overview

Objectives

Audience

Prerequisites

Topics

Recognition

Certified Data Science Practitionerâ„¢ (CDSP) (Exam DSP-210)

Data Science for Business Professionals DSBIZâ„¢ (Exam DSZ-210)

Need Help?

Personalized Solutions

Need Help?

Self-Paced Training Info

Course Added To Shopping Cart

Self-Paced Training Terms & Conditions

Exam Terms & Conditions

STOP! Before You Leave

Save 0% on this course!

Purchase Information

title

Need more Information?

title

Certified Data Science Practitioner CDSP (DSP-210) Exam Voucher

AWS Training Pass

Class Schedule

Sort results

Filter Classes

Guaranteed to Run

Modality

Location

Language

Date

Course Description

Overview

Objectives

Audience

Prerequisites

Topics

Recognition

Related Courses

Certified Data Science Practitionerâ„¢ (CDSP) (Exam DSP-210)

Data Science for Business Professionals DSBIZâ„¢ (Exam DSZ-210)

Need Help?

Personalized Solutions

Need Help?

Self-Paced Training Info

Course Added To Shopping Cart

Self-Paced Training Terms & Conditions

Exam Terms & Conditions

STOP! Before You Leave

Save 0% on this course!

Purchase Information

title

Need more Information?