Mathematics for Machine Learning: Multivariate Calculus

Mathematics for Machine Learning: Multivariate Calculus

This course is part of Mathematics for Machine Learning Specialization

Instructors: Samuel J. Cooper

158,638 already enrolled

Included with

Learn more

6 modules

Gain insight into a topic and learn the fundamentals.

5,763 reviews

Beginner level

No prior experience required

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

91%

Most learners liked this course

6 modules

Gain insight into a topic and learn the fundamentals.

5,763 reviews

Beginner level

No prior experience required

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

91%

Most learners liked this course

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

25 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Mathematics for Machine Learning Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 6 modules in this course

This course offers a brief introduction to the multivariate calculus required to build many common machine learning techniques. We start at the very beginning with a refresher on the “rise over run” formulation of a slope, before converting this to the formal definition of the gradient of a function. We then start to build up a set of tools for making calculus easier and faster. Next, we learn how to calculate vectors that point up hill on multidimensional surfaces and even put this into action using an interactive game. We take a look at how we can use calculus to build approximations to functions, as well as helping us to quantify how accurate we should expect those approximations to be. We also spend some time talking about where calculus comes up in the training of neural networks, before finally showing you how it is applied in linear regression models. This course is intended to offer an intuitive understanding of calculus, as well as the language necessary to look concepts up yourselves when you get stuck. Hopefully, without going into too much detail, you’ll still come away with the confidence to dive into some more focused machine learning courses in future.

Understanding calculus is central to understanding machine learning! You can think of calculus as simply a set of tools for analysing the relationship between functions and their inputs. Often, in machine learning, we are trying to find the inputs which enable a function to best match the data. We start this module from the basics, by recalling what a function is and where we might encounter one. Following this, we talk about the how, when sketching a function on a graph, the slope describes the rate of change of the output with respect to an input. Using this visual intuition we next derive a robust mathematical definition of a derivative, which we then use to differentiate some interesting functions. Finally, by studying a few examples, we develop four handy time saving rules that enable us to speed up differentiation for many common scenarios.

What's included

10 videos4 readings6 assignments1 discussion prompt1 plugin

10 videos Total 46 minutes

Welcome to Multivariate Calculus 2 minutes
Welcome to Module 1! 1 minute
Functions 4 minutes
Rise Over Run 5 minutes
Definition of a derivative 11 minutes
Differentiation examples & special cases 8 minutes
Product rule 4 minutes
Chain rule 5 minutes
Taming a beast 6 minutes
See you next module! 1 minute

4 readings Total 20 minutes

About Imperial College & the team 5 minutes
How to be successful in this course 5 minutes
Grading Policy 5 minutes
Additional Readings & Helpful References 5 minutes

6 assignments Total 120 minutes

Matching functions visually 20 minutes
Matching the graph of a function to the graph of its derivative 20 minutes
Let's differentiate some functions 20 minutes
Practicing the product rule 20 minutes
Practicing the chain rule 20 minutes
Unleashing the toolbox 20 minutes

1 discussion prompt Total 15 minutes

Nice to meet you! 15 minutes

1 plugin Total 15 minutes

Pre-course Survey 15 minutes

Building on the foundations of the previous module, we now generalise our calculus tools to handle multivariable systems. This means we can take a function with multiple inputs and determine the influence of each of them separately. It would not be unusual for a machine learning method to require the analysis of a function with thousands of inputs, so we will also introduce the linear algebra structures necessary for storing the results of our multivariate calculus analysis in an orderly fashion.

What's included

9 videos5 assignments2 ungraded labs

9 videos Total 41 minutes

Welcome to Module 2! 1 minute
Variables, constants & context 8 minutes
Differentiate with respect to anything 5 minutes
The Jacobian 6 minutes
Jacobian applied 6 minutes
The Sandpit 5 minutes
The Hessian 6 minutes
Reality is hard 5 minutes
See you next module! 0 minutes

5 assignments Total 100 minutes

Practicing partial differentiation 20 minutes
Calculating the Jacobian 20 minutes
Bigger Jacobians! 20 minutes
Calculating Hessians 20 minutes
Assessment: Jacobians and Hessians 20 minutes

2 ungraded labs Total 60 minutes

The Sandpit 30 minutes
The Sandpit - Part 2 30 minutes

Having seen that multivariate calculus is really no more complicated than the univariate case, we now focus on applications of the chain rule. Neural networks are one of the most popular and successful conceptual structures in machine learning. They are build up from a connected web of neurons and inspired by the structure of biological brains. The behaviour of each neuron is influenced by a set of control parameters, each of which needs to be optimised to best fit the data. The multivariate chain rule can be used to calculate the influence of each parameter of the networks, allow them to be updated during training.

What's included

6 videos3 assignments1 programming assignment1 discussion prompt1 ungraded lab

6 videos Total 19 minutes

Welcome to Module 3! 1 minute
Multivariate chain rule 3 minutes
More multivariate chain rule 6 minutes
Simple neural networks 6 minutes
More simple neural networks 4 minutes
See you next module! 1 minute

3 assignments Total 65 minutes

Multivariate chain rule exercise 20 minutes
Simple Artificial Neural Networks 20 minutes
Training Neural Networks 25 minutes

1 programming assignment Total 30 minutes

Backpropagation 30 minutes

1 discussion prompt Total 10 minutes

I ❤️ backpropagation 10 minutes

1 ungraded lab Total 60 minutes

Backpropagation 60 minutes

The Taylor series is a method for re-expressing functions as polynomial series. This approach is the rational behind the use of simple linear approximations to complicated functions. In this module, we will derive the formal expression for the univariate Taylor series and discuss some important consequences of this result relevant to machine learning. Finally, we will discuss the multivariate case and see how the Jacobian and the Hessian come in to play.

What's included

9 videos5 assignments1 plugin

9 videos Total 41 minutes

Welcome to Module 4! 1 minute
Building approximate functions 3 minutes
Power series 4 minutes
Power series derivation 9 minutes
Power series details 6 minutes
Examples 5 minutes
Linearisation 5 minutes
Multivariate Taylor 6 minutes
See you next module! 0 minutes

5 assignments Total 100 minutes

Matching functions and approximations 20 minutes
Applying the Taylor series 15 minutes
Taylor series - Special cases 30 minutes
2D Taylor series 15 minutes
Taylor Series Assessment 20 minutes

1 plugin Total 20 minutes

Visualising Taylor Series 20 minutes

If we want to find the minimum and maximum points of a function then we can use multivariate calculus to do this, say to optimise the parameters (the space) of a function to fit some data. First we’ll do this in one dimension and use the gradient to give us estimates of where the zero points of that function are, and then iterate in the Newton-Raphson method. Then we’ll extend the idea to multiple dimensions by finding the gradient vector, Grad, which is the vector of the Jacobian. This will then let us find our way to the minima and maxima in what is called the gradient descent method. We’ll then take a moment to use Grad to find the minima and maxima along a constraint in the space, which is the Lagrange multipliers method.

What's included

4 videos4 assignments1 discussion prompt1 ungraded lab

4 videos Total 28 minutes

Welcome to Module 5! 8 minutes
Gradient Descent 9 minutes
Constrained optimisation 9 minutes
See you next module! 2 minutes

4 assignments Total 70 minutes

Newton-Raphson in one dimension 20 minutes
Lagrange multipliers 20 minutes
Checking Newton-Raphson 10 minutes
Optimisation scenarios 20 minutes

1 discussion prompt Total 10 minutes

Steepest strategies 10 minutes

1 ungraded lab Total 45 minutes

Gradient descent in a sandpit 45 minutes

In order to optimise the fitting parameters of a fitting function to the best fit for some data, we need a way to define how good our fit is. This goodness of fit is called chi-squared, which we’ll first apply to fitting a straight line - linear regression. Then we’ll look at how to optimise our fitting function using chi-squared in the general case using the gradient descent method. Finally, we’ll look at how to do this easily in Python in just a few lines of code, which will wrap up the course.

What's included

4 videos1 reading2 assignments1 programming assignment1 ungraded lab1 plugin

4 videos Total 25 minutes

Simple linear regression 10 minutes
General non linear least squares 7 minutes
Doing least squares regression analysis in practice 6 minutes
Wrap up of this course 1 minute

1 reading Total 10 minutes

Did you like the course? Let us know! 10 minutes

2 assignments Total 40 minutes

Linear regression 25 minutes
Fitting a non-linear function 15 minutes

1 programming assignment Total 30 minutes

Fitting the distribution of height data 30 minutes

1 ungraded lab Total 30 minutes

Fitting the distribution of heights data 30 minutes

1 plugin Total 15 minutes

Post-course Survey 15 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings

(692 ratings)

Samuel J. Cooper

Imperial College London

2 Courses 489,950 learners

David Dye

Imperial College London

2 Courses 489,950 learners

A. Freddie Page

Imperial College London

2 Courses 489,950 learners

Offered by

Imperial College London

Explore more from Math and Logic

Packt
Matrix Calculus for Data Science & Machine Learning
Course
Status: Free Trial
DeepLearning.AI
Calculus for Machine Learning and Data Science
Course
Status: Free Trial
Imperial College London
Mathematics for Machine Learning
Specialization
Status: Free Trial
DeepLearning.AI
Linear Algebra for Machine Learning and Data Science
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
77.12%
4 stars
18.78%
3 stars
3.07%
2 stars
0.64%
1 star
0.38%

Showing 3 of 5763

Reviewed on May 12, 2020

Great course. It is clear and accessible, giving a lot of the intuition of why things are done. Some important topics in calculus are missing, such as Integration, but overall very good course.

Reviewed on Jul 27, 2019

Superb quality. The way instructors teach is really innovative. The course is good in terms of the area it covers but lacks depth, but is a good starting point if you want to dwell more in detail.

Reviewed on Aug 22, 2020

It was very challenging, but not to the point where I felt lost. And that to me means I pushed the limits of my knowledge and skills further than before, which is what I expected from the course.

View more reviews

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.