Data Science Bootcamp Online | Data Science Dojo

For a hands-on learning experience to develop Agentic AI applications, join our Agentic AI Bootcamp today. Early Bird Discount

Data science bootcamp

A comprehensive program covering data science fundamentals, visualization, predictive modeling, model deployment, and advanced topics like text analytics and recommender systems.

Trusted by Leading Companies

4.95

Switchup Rating

12,000+

Alumni

2,500+

Companies Trained

1M+

Community Members

Who is this bootcamp for?

This Data Science Bootcamp is for professionals ready to master end-to-end data workflows—from data wrangling and visualization to machine learning and model deployment.

Instructors and guest speakers

Learn from though leaders at the forefront of end-to-end data workflows—from data wrangling and visualization to machine learning and model deployment.

Albar Wahab

Senior Data Scientist | Data Science Dojo

Zaid Ahmed

Senior Data Scientist | Data Science Dojo

curriculum

Explore the bootcamp curriculum

Overview of the topics and practical exercises.

In this module, we’ll build the foundational programming knowledge and theory needed to succeed in the bootcamp. By covering essential Python concepts and tools, we’ll set ourselves up for a smoother learning experience during the hands-on sessions. Whether you’re starting from scratch or refreshing your skills, this module is the perfect starting point for mastering Python and its practical applications.

What You’ll Learn

Grasp the fundamentals of Python programming, including key concepts for beginners

Master efficient input/output operations to handle data seamlessly

Navigate and utilize Jupyter Notebook for writing and executing Python code

Learn how to interact with REST APIs effectively using Python

In this module, we’ll focus on data exploration, visualization, and feature engineering—essential steps in preparing data for analysis. We’ll learn how to use techniques like summary statistics and visual tools to understand the structure of your data, spot patterns, and identify issues such as missing values or outliers. We’ll also cover how to transform and create new features that make your data more useful for modeling. By the end, you’ll be able to uncover insights, clean your data, and shape it for better analysis and predictive performance.

What You’ll Learn

Explore the idea behind exploratory data analysis using Python

Understand the importance of feature engineering in building effective machine-learning models

Apply summary statistics to interpret data

Analyze the importance of segmentation in data analysis and recognize pitfalls of data analysis

Understand how domain knowledge helps us to create informative and relevant features for a particular problem

In this module, we’ll explore how to turn raw data into compelling visual stories. We’ll learn how to choose the right visualizations for different data types, and apply tools like pandas, matplotlib, and seaborn to analyze real datasets. Through hands-on exercises, we’ll uncover key insights and develop a deeper understanding of the data. We’ll also discuss how to design and present visuals that clearly communicate the message to our audience. By the end of the module, you’ll be equipped to present data in a clear, impactful, and audience-friendly way.

What You’ll Learn

Understand how to select appropriate visualization types for different types of data

Analyze data using Pandas, Matplotlib, and Seaborn through hands-on exercises with multiple datasets

Unlock insights and gain a deeper understanding of data

Discuss how to deliver effective visuals for data storytelling

In this module, we’ll explore how to use predictive modelling to create real business impacts. We’ll learn to identify the right opportunities for machine learning, translate business goals into actionable models, and recognize when good models might still lead to poor outcomes. We’ll also cover essential data requirements and key ethical considerations. By the end, you’ll be able to understand predictive systems that are both effective and aligned with business strategy.

What You’ll Learn

Learn how to identify the right business opportunities for real analytics impact

Investigate conditions where a good machine learning model leads to an adverse business impact

Understand various types of data necessary for data collection and analysis

Identify how to translate business goals into actionable machine learning solutions

Understand key ethical and practical challenges in building effective predictive systems

In this module, we will explore decision tree learning and focus on how decision trees are constructed for supervised classification tasks. We will learn to apply splitting criteria like Gini index and how to evaluate model performance. Practical exercises and quizzes at the end will help you to apply the learned skills to real datasets and benchmark your performance. By the end of this module, you will be equipped to confidently implement decision tree classifiers.

What You’ll Learn

Understand the fundamentals of decision tree learning

Learn how to choose optimal split points using criteria like Gini index

Explore key decision tree concepts such as depth, overfitting, and pruning

Apply decision tree algorithms to real-world datasets through hands-on exercises

Gain experience with model interpretation and visualizing tree-based decisions

In this module, we’ll focus on evaluating classification models and understanding their impact in real-world applications. We’ll begin by discussing why accuracy alone isn’t a reliable metric and how different types of errors affect business decisions. Through the confusion matrix and key performance metrics like precision, recall, and F1 score, we’ll learn to interpret model results with clarity. We’ll also explore advanced evaluation tools like ROC curves and AUC to help us assess model performance across thresholds. By the end of this module, you’ll be able to choose appropriate metrics, interpret model results effectively, and make evaluation decisions that align with real-world goals.

What You’ll Learn

Explain why accuracy alone may be misleading when evaluating classification models

Interpret the confusion matrix to assess classification performance

Differentiate between key metrics such as precision, recall, and F1 score

Evaluate metric trade-offs in different business contexts

Apply ROC curves and AUC to compare model performance beyond threshold-based metrics

Master the essentials of hyperparameter tuning to improve your model’s accuracy and reliability. This module covers key concepts like generalization, overfitting, and the bias-variance tradeoff, alongside practical validation techniques and strategies to handle real-world data challenges. We’ll learn how to fine-tune models for better performance and robust predictions.

What You’ll Learn

Describe the role of hyperparameter tuning in improving machine learning model performance

Identify and apply key techniques for effective hyperparameter tuning

Explain what overfitting is and how it affects model generalization

Recognize the characteristics of a well-generalized model across different data samples

Understand the bias-variance tradeoff and how it influences tuning decisions

Unlock the potential of ensemble methods and elevate your predictive power. This module aims to go deep into techniques such as bagging and random forests, explore key concepts like the bias-variance trade-off and out-of-bag evaluation, and build practical skills through interactive notebooks. Whether you’re working with messy real-world data or aiming for stronger model performance, ensemble learning is the next step forward!

What You’ll Learn

Explore key techniques like bagging and random forests in depth

Apply concepts such as out-of-bag evaluation and binomial probability

Understand how ensemble methods balance bias and variance

Gain hands-on experience through interactive, real-world exercises

Boosting is a powerful ensemble technique that builds better models by learning from previous mistakes, one step at a time. In this course, we’ll explore the core ideas behind boosting, explore popular methods like AdaBoost, and apply them through real-world examples and hands-on exercises. Whether you’re just starting out or looking to sharpen your skills, this course will help you create smarter and more accurate predictive models.

What You’ll Learn

Understand the concept of using weak classifiers to create a strong one

Analyze boosting and its ability to adjust the weight of each classifier

Apply the logic behind the boosting algorithm to real-world examples

Evaluate the differences between bagging and boosting

Discuss the potential drawbacks of the boosting algorithm

In today’s data-driven world, successful digital products are not built on guesswork. They’re built on evidence. This module will walk you through the essential principles and practical techniques of online experimentation. From understanding the basics of A/B testing to mastering advanced testing strategies and avoiding common pitfalls, you’ll gain the knowledge and confidence to run meaningful experiments that drive better outcomes. Whether you’re refining a feature, testing a new idea, or scaling insights across teams, this module sets the foundation for thoughtful, measurable, and impactful decisions.

What You’ll Learn

Recognize the role of experimentation in online business and marketing

Learn how to design and run effective experiments

Distinguish between A/B, A/A, and multivariate testing and their use cases

Apply best practices for metrics selection, hypothesis creation, and error avoidance

Interpret results using statistical significance and confidence intervals

Every tweet, email, review, and support ticket holds valuable insights if we know how to extract them. This module introduces the tools and techniques businesses use to turn raw text into strategic decisions. We will learn how to clean, structure, and analyze text using NLP, TF-IDF, and word embeddings like Word2Vec. Whether you want to understand customer sentiment or automate tasks, this module will help you get started.

What You’ll Learn

Analyze unstructured data types such as text

Evaluate the process of converting text into structured data

Understand the steps involved in pre-processing text

Apply the concepts of TF-IDF and Word2Vec

Use cosine similarity to determine the similarity between documents

Discover how unsupervised learning helps uncover hidden patterns in data without labels. This module introduces key concepts of clustering, with a focus on the k-means algorithm. We’ll learn how k-means works, when to use it, and how to choose the right number of clusters for meaningful insights from raw data.

What You’ll Learn

Discuss the basics of unsupervised learning and how it differs from supervised learning

Explain the concept and steps of the K-means clustering algorithm

Apply K-means to group unlabeled data and uncover patterns

Evaluate the strengths and limitations of K-means clustering

Determine the optimal number of clusters using the elbow method

Discover how a simple straight line can unlock powerful insights and help predict the future. In this course, we’ll explore how linear models work, how to evaluate their performance, and how optimization techniques like gradient descent help improve accuracy. Through intuitive explanations and practical examples, we’ll learn to build, interpret, and apply linear regression models with confidence.

What You’ll Learn

How linear regression models predict continuous outcomes using input features

The difference between parametric and non-parametric approaches

The role of cost functions in quantifying prediction error

How gradient descent optimizes model parameters step by step

How to measure model performance using different metrics

What if our linear models could stay accurate and simple, even in messy real-world data? In this module, we will build models that balance precision and simplicity. We will learn how to control model complexity, use regularization to prevent overfitting, and tune hyperparameters so our models make reliable, real-world predictions.

What You’ll Learn

Balancing model complexity to avoid underfitting and overfitting

Applying L1 (Lasso) and L2 (Ridge) regularization to control complexity

Tuning key hyperparameters for optimal performance

Evaluating model fit and use cross-validation to improve generalization

Interpreting the bias-variance trade-off when tuning linear models

Ever wondered how Netflix knows what we’ll love next? This module unpacks the magic behind recommendation systems, from how they measure similarity and generate suggestions to choosing the right approach for your business. Discover how to use data-driven recommendations to boost engagement, delight customers, and drive results.

What You’ll Learn

Different type of recommendation systems like collaborative and content-based

Similarity measures to compare users or items and find the best matches

Recommendation algorithms used to generate and rank suggestions

Evaluation techniques to assess performance and effectiveness

Learn the core principles of big data engineering and distributed systems to confidently tackle large-scale data challenges. This module introduces key topics such as cloud infrastructure models, distributed computing frameworks, and scalable system design. We’ll use tools like Hadoop and Spark to process data efficiently and explore modern architectures that support real-time analytics and machine learning workflows.

What You’ll Learn

Explain the role of data engineering in enabling scalable machine learning workflows

Describe how distributed systems process and manage large-scale data efficiently

Compare cloud service models and assess their suitability for different applications

Identify the core components and workflows of Hadoop, Spark, and MapReduce

Earn a verified certificate

Earn a verified certificate from The University of New Mexico Continuing Education:

7 Continuing Education Credit (CEU)
Acceptable by employers for reimbursements
Valid for professional licensing renewal
Verifiable by The University of New Mexico Registrar’s office
Add to LinkedIn and share with your network

We accept tuition benefits

All of our programs are backed by a certificate from The University of New Mexico, Continuing Education. This means that you may be eligible to attend the bootcamp for FREE.

Not sure? fill out the form so we can help. Contact us ->

Recognition

Reserve your spot

We have carefully designed our data science bootcamp to bring you the best practical exposure in the world of data science, programming, and machine learning.

Data science bootcamp

Self paced

Launching Soon

A word from our alumni

At the end of the bootcamp I think all of us are at the same place, so that’s the beauty of this program. You could come from any background because we are covering some diverse topics here, and making sure it’s a level playing field and again, going back to to the motto of, hey, this is for everyone.

Kapil PandeyAnalytics Manager at Samsung

It was a great experience for increasing the expertise on data science. The abstract concepts were explained well and always focused on real applications and business cases. The pace was adjusted as needed to let everyone follow the topics. Week was intense as there are many topics to cover but schedule was well managed to optimize people attention.

Harris ThambyManager at Microsoft

What I enjoyed most about the Data Science Dojo bootcamp was the enthusiasm for data science from the instructors.

Eldon PrinceSenior Principal Data Scientist at DELL

Highly valuable course condensed into a single week. Enough background is given to allow one to continue their learning and training on their own.Good energy from the instructors. It is clear that they have real industry experience working on problems.

Ben GawiserSoftware Engineer at Amazon

Frequently asked questions, answered.

Is the training conducted in R or Python?

We are a technology-neutral and vendor-agnostic training. Both R and Python code samples will be shared with attendees.

Are the live sessions recorded for viewing later?

While we do not provide exact recordings of the live sessions, all key topics and content covered during the class are available in our companion courses. We’ve created structured lesson clips that reflect the material discussed, allowing you to review everything at your convenience even if you miss a live session.

How many Continuing Education Credits (CEUs) will I receive?

Yes, you will receive 7 CEU’s after completing the Data Science Bootcamp. You will be able to request a transcript from the University of New Mexico by paying a fee of ten US Dollars.

What is the difference between in-person and online?

There is no difference between the online and in-person bootcamp in terms of curriculum and instructors. The only distinction is that breakfast, lunch, and beverages are provided at the in-person bootcamp.

Do you have any in-person part-time options?

No, we do not offer any in-person part-time options. The program requires a full-time commitment of 40 hours over 5 days.

What is the refund policy?

If, for any reason, you decide to cancel, we will gladly refund your registration fee in full if you notify us at least five business days before the start of the training. We can also transfer your registration to another cohort if preferred.

However, refunds cannot be processed if you have transferred to a different cohort after registration. Additionally, once you have been added to the learning platform and have accessed the course materials, we are unable to issue a refund, as digital content access is considered program participation.

Will I earn a certificate from this data science bootcamp?

Yes! Once you have completed our Data Science Bootcamp, you will be issued a certificate in association with The University of New Mexico Continuing Education that you can print or add to your LinkedIn profile for others to see.

Are there any discounts available?

Yes, discounts are available. The Dojo and Guru packages are currently offered at a 25% early-bird discount. Please note that these discounts are limited and available on a first-come, first-served basis.

How many hours per week are we expected in class?

You are expected to be in class for 8 hours each day.

What backgrounds do people have that take this data science bootcamp?

We have had attendees from a wide range of backgrounds – software engineers, product/program managers, physicists, financial analysts – even medical doctors and veterinarians, attend and successfully completed our data science bootcamp. This bootcamp is for anyone who is curious about data science and willing to explore, segment, analyze, and understand their data in order to make better data-driven decisions. We recommend that you talk to one of our advisors before joining this data science bootcamp. If you would like to set up a time with one of our instructors, please let us know.

What kind of jobs can a data science bootcamp get me?

It depends on your capabilities and skills you have grasped while attending a bootcamp that will lead to a career track that you can choose in data analytics, but here are some jobs you can look forward to, based on your skillset:
Data Scientist
Data Engineer
Machine Learning Engineer
Data Analyst
Business Analyst / Product Analyst

If I have questions during the live instructor-led sessions or while working on homework?

Yes, our live instructor-led sessions are highly interactive. Students are encouraged to ask questions, and our instructors make sure to provide thorough responses without rushing. Additionally, discussions relevant to the topic being taught are actively encouraged. We also understand that questions may arise during homework. To support you, we have a dedicated Discord community where you can receive help from our instructors and connect with fellow students.

What is the transfer policy?

Transfers are allowed once with no penalty. Transfers requested more than once will incur a $200 processing fee.

Are classes taught live or self-paced?

Currently, this bootcamp is discontinued. We plan to relaunch it this year in a self-paced format. Once the updated structure and details are finalized, they will be shared here on our website.

Bootcamps

Courses

Case Studies

Reviews

Consulting

Case studies

Community

Company

Data science bootcamp

Trusted by Leading Companies

4.95

12,000+

2,500+

1M+

Who is this bootcamp for?

Aspiring data scientists & analysts

Engineers & technical professionals

Career changers

Instructors and guest speakers

Albar Wahab

Zaid Ahmed

curriculum

Explore the bootcamp curriculum

Earn a verified certificate

We accept tuition benefits

Recognition

Reserve your spot

Data science bootcamp

Self paced

Launching Soon

A word from our alumni

Frequently asked questions, answered.

Looking to upskill your team?