When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 4 modules in this course
Newly updated for 2024! Mathematics for Machine Learning and Data Science is a foundational online program created by DeepLearning.AI and taught by Luis Serrano. In machine learning, you apply math concepts through programming. And so, in this specialization, you’ll apply the math concepts you learn using Python programming in hands-on lab exercises. As a learner in this program, you'll need basic to intermediate Python programming skills to be successful.
After completing this course, you will be able to:
• Describe and quantify the uncertainty inherent in predictions made by machine learning models, using the concepts of probability, random variables, and probability distributions.
• Visually and intuitively understand the properties of commonly used probability distributions in machine learning and data science like Bernoulli, Binomial, and Gaussian distributions
• Apply common statistical methods like maximum likelihood estimation (MLE) and maximum a priori estimation (MAP) to machine learning problems
• Assess the performance of machine learning models using interval estimates and margin of errors
• Apply concepts of statistical hypothesis testing to commonly used tests in data science like AB testing
• Perform Exploratory Data Analysis on a dataset to find, validate, and quantify patterns.
Many machine learning engineers and data scientists need help with mathematics, and even experienced practitioners can feel held back by a lack of math skills. This Specialization uses innovative pedagogy in mathematics to help you learn quickly and intuitively, with courses that use easy-to-follow visualizations to help you see how the math behind machine learning actually works.
We recommend you have a high school level of mathematics (functions, basic algebra) and familiarity with programming (data structures, loops, functions, conditional statements, debugging). Assignments and labs are written in Python but the course introduces all the machine learning libraries you’ll use.
In this week, you will learn about probability of events and various rules of probability to correctly do arithmetic with probabilities. You will learn the concept of conditional probability and the key idea behind Bayes theorem. In lesson 2, we generalize the concept of probability of events to probability distribution over random variables. You will learn about some common probability distributions like the Binomial distribution and the Normal distribution.
(Optional) Partial Grading for Assignments•10 minutes
Week 1 - Slides•10 minutes
2 assignments•Total 45 minutes
Week 1 - Practice Quiz•15 minutes
Week 1 - Summative quiz•30 minutes
1 programming assignment•Total 240 minutes
Naive Bayes•240 minutes
4 ungraded labs•Total 240 minutes
Four Birthday Problems•60 minutes
Monty Hall Problem•60 minutes
Exploratory Data Analysis - Intro to Pandas•60 minutes
Exploratory Data Analysis - Exploring Your Data•60 minutes
Week 2 - Describing probability distributions and probability distributions with multiple variables
Week 2•8 hours to complete
Module details
This week you will learn about different measures to describe probability distributions as well as any dataset. These include measures of central tendency (mean, median, and mode), variance, skewness, and kurtosis. The concept of the expected value of a random variable is introduced to help you understand each of these measures. You will also learn about some visual tools to describe data and distributions. In lesson 2, you will learn about the probability distribution of two or more random variables using concepts like joint distribution, marginal distribution, and conditional distribution. You will end the week by learning about covariance: a generalization of variance to two or more random variables.
Other measures of central tendency: median and mode•6 minutes
Expected value of a Function•4 minutes
Sum of expectations•7 minutes
Variance•11 minutes
Standard Deviation•4 minutes
Sum of Gaussians•3 minutes
Standardizing a Distribution•4 minutes
Skewness and Kurtosis: Moments of a Distribution•2 minutes
Skewness and Kurtosis - Skewness•8 minutes
Skewness and Kurtosis - Kurtosis•7 minutes
Quantiles and Box-Plots•3 minutes
Visualizing data: Box-Plots•3 minutes
Visualizing data: Kernel density estimation•2 minutes
Visualizing data: Violin Plots•1 minute
Visualizing data: QQ plots•2 minutes
Joint Distribution (Discrete) - Part 1•5 minutes
Joint Distribution (Discrete) - Part 2•5 minutes
Joint Distribution (Continuous)•5 minutes
Marginal and Conditional Distribution•7 minutes
Conditional Distribution•5 minutes
Covariance of a Dataset•10 minutes
Covariance of a Probability Distribution•11 minutes
Covariance Matrix•2 minutes
Correlation Coefficient•5 minutes
Multivariate Gaussian Distribution•6 minutes
Week 2 - Conclusion•0 minutes
2 readings•Total 25 minutes
Interactive Tool: Mean, median and standard deviation•15 minutes
Week 2 - Slides•10 minutes
2 assignments•Total 60 minutes
Week 2 - Practice Quiz•30 minutes
Week 2 - Summative Quiz•30 minutes
1 programming assignment•Total 100 minutes
Loaded Dice•100 minutes
3 ungraded labs•Total 180 minutes
Summary statistics and visualization of data sets•60 minutes
Exploratory Data Analysis - Data Visualization and Summary Statistics•60 minutes
Simulating Dice Rolls with Numpy (helper for the assignment, not necessary and not graded)•60 minutes
Week 3 - Sampling and Point estimation
Week 3•6 hours to complete
Module details
This week shifts its focus from probability to statistics. You will start by learning the concept of a sample and a population and two fundamental results from statistics that concern samples and population: the law of large numbers and the central limit theorem. In lesson 2, you will learn the first and the simplest method of estimation in statistics: point estimation. You will see how maximum likelihood estimation, the most common point estimation method, works and how regularization helps prevent overfitting. You'll then learn how Bayesian Statistics incorporates the concept of prior beliefs into the way data is evaluated and conclusions are reached.
What's included
20 videos3 readings2 assignments2 ungraded labs
Show info about module content
20 videos•Total 99 minutes
Population and Sample•6 minutes
Sample Mean•3 minutes
Sample Proportion•2 minutes
Sample Variance•11 minutes
Law of Large Numbers•4 minutes
Central Limit Theorem - Discrete Random Variable•3 minutes
Central Limit Theorem - Continuous Random Variable•8 minutes
Point Estimation•1 minute
Maximum Likelihood Estimation: Motivation •3 minutes
MLE: Bernoulli Example•5 minutes
MLE: Gaussian Example•6 minutes
MLE: Linear Regression•6 minutes
Regularization•3 minutes
Back to "Bayesics"•3 minutes
Bayesian Statistics - Frequentist vs. Bayesian•3 minutes
Bayesian Statistics - MAP•5 minutes
Bayesian Statistics - Updating Priors•9 minutes
Bayesian Statistics - Full Worked Example•11 minutes
Relationship between MAP, MLE and Regularization•6 minutes
Week 3 - Conclusion•0 minutes
3 readings•Total 35 minutes
MLE for Gaussian population•10 minutes
Interactive Tool: Likelihood Functions•15 minutes
Week 3 - Slides•10 minutes
2 assignments•Total 90 minutes
Week 3 - Practice Quiz•30 minutes
Week 3 - Summative Quiz•60 minutes
2 ungraded labs•Total 120 minutes
Sampling data from different distribution and studying the distribution of sample mean•60 minutes
Exploratory Data Analysis - Linear Regression•60 minutes
Week 4 - Confidence Intervals and Hypothesis testing
Week 4•6 hours to complete
Module details
This week you will learn another estimation method called interval estimation. The most common interval estimates are confidence intervals and you will see how they are calculated and how to correctly interpret them. In lesson 2, you will learn about hypothesis testing where estimates are formulated as a hypothesis and then tested in the presence of available evidence or a sample of data. You will learn the concept of p-value that helps in making a decision about a hypothesis test and also learn some common tests like the t-test, two-sample t-test, and the paired t-test. You will end the week with an interesting application of hypothesis testing in data science: A/B testing.
DeepLearning.AI is an education technology company that develops a global community of AI talent.
DeepLearning.AI's expert-led educational experiences provide AI practitioners and non-technical professionals with the necessary tools to go all the way from foundational basics to advanced application, empowering them to build an AI-powered future.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.