When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 6 modules in this course
This course offers a brief introduction to the multivariate calculus required to build many common machine learning techniques. We start at the very beginning with a refresher on the “rise over run” formulation of a slope, before converting this to the formal definition of the gradient of a function. We then start to build up a set of tools for making calculus easier and faster. Next, we learn how to calculate vectors that point up hill on multidimensional surfaces and even put this into action using an interactive game. We take a look at how we can use calculus to build approximations to functions, as well as helping us to quantify how accurate we should expect those approximations to be. We also spend some time talking about where calculus comes up in the training of neural networks, before finally showing you how it is applied in linear regression models. This course is intended to offer an intuitive understanding of calculus, as well as the language necessary to look concepts up yourselves when you get stuck. Hopefully, without going into too much detail, you’ll still come away with the confidence to dive into some more focused machine learning courses in future.
Understanding calculus is central to understanding machine learning! You can think of calculus as simply a set of tools for analysing the relationship between functions and their inputs. Often, in machine learning, we are trying to find the inputs which enable a function to best match the data. We start this module from the basics, by recalling what a function is and where we might encounter one. Following this, we talk about the how, when sketching a function on a graph, the slope describes the rate of change of the output with respect to an input. Using this visual intuition we next derive a robust mathematical definition of a derivative, which we then use to differentiate some interesting functions. Finally, by studying a few examples, we develop four handy time saving rules that enable us to speed up differentiation for many common scenarios.
Matching the graph of a function to the graph of its derivative•20 minutes
Let's differentiate some functions•20 minutes
Practicing the product rule•20 minutes
Practicing the chain rule•20 minutes
Unleashing the toolbox•20 minutes
1 discussion prompt•Total 15 minutes
Nice to meet you!•15 minutes
1 plugin•Total 15 minutes
Pre-course Survey•15 minutes
Multivariate calculus
Module 2•3 hours to complete
Module details
Building on the foundations of the previous module, we now generalise our calculus tools to handle multivariable systems. This means we can take a function with multiple inputs and determine the influence of each of them separately. It would not be unusual for a machine learning method to require the analysis of a function with thousands of inputs, so we will also introduce the linear algebra structures necessary for storing the results of our multivariate calculus analysis in an orderly fashion.
What's included
9 videos5 assignments2 ungraded labs
Show info about module content
9 videos•Total 41 minutes
Welcome to Module 2!•1 minute
Variables, constants & context•8 minutes
Differentiate with respect to anything•5 minutes
The Jacobian•6 minutes
Jacobian applied•6 minutes
The Sandpit•5 minutes
The Hessian•6 minutes
Reality is hard•5 minutes
See you next module!•0 minutes
5 assignments•Total 100 minutes
Practicing partial differentiation•20 minutes
Calculating the Jacobian•20 minutes
Bigger Jacobians!•20 minutes
Calculating Hessians•20 minutes
Assessment: Jacobians and Hessians•20 minutes
2 ungraded labs•Total 60 minutes
The Sandpit•30 minutes
The Sandpit - Part 2•30 minutes
Multivariate chain rule and its applications
Module 3•3 hours to complete
Module details
Having seen that multivariate calculus is really no more complicated than the univariate case, we now focus on applications of the chain rule. Neural networks are one of the most popular and successful conceptual structures in machine learning. They are build up from a connected web of neurons and inspired by the structure of biological brains. The behaviour of each neuron is influenced by a set of control parameters, each of which needs to be optimised to best fit the data. The multivariate chain rule can be used to calculate the influence of each parameter of the networks, allow them to be updated during training.
The Taylor series is a method for re-expressing functions as polynomial series. This approach is the rational behind the use of simple linear approximations to complicated functions. In this module, we will derive the formal expression for the univariate Taylor series and discuss some important consequences of this result relevant to machine learning. Finally, we will discuss the multivariate case and see how the Jacobian and the Hessian come in to play.
What's included
9 videos5 assignments1 plugin
Show info about module content
9 videos•Total 41 minutes
Welcome to Module 4!•1 minute
Building approximate functions•3 minutes
Power series•4 minutes
Power series derivation•9 minutes
Power series details•6 minutes
Examples•5 minutes
Linearisation•5 minutes
Multivariate Taylor•6 minutes
See you next module!•0 minutes
5 assignments•Total 100 minutes
Matching functions and approximations•20 minutes
Applying the Taylor series•15 minutes
Taylor series - Special cases•30 minutes
2D Taylor series•15 minutes
Taylor Series Assessment•20 minutes
1 plugin•Total 20 minutes
Visualising Taylor Series•20 minutes
Intro to optimisation
Module 5•3 hours to complete
Module details
If we want to find the minimum and maximum points of a function then we can use multivariate calculus to do this, say to optimise the parameters (the space) of a function to fit some data. First we’ll do this in one dimension and use the gradient to give us estimates of where the zero points of that function are, and then iterate in the Newton-Raphson method. Then we’ll extend the idea to multiple dimensions by finding the gradient vector, Grad, which is the vector of the Jacobian. This will then let us find our way to the minima and maxima in what is called the gradient descent method. We’ll then take a moment to use Grad to find the minima and maxima along a constraint in the space, which is the Lagrange multipliers method.
In order to optimise the fitting parameters of a fitting function to the best fit for some data, we need a way to define how good our fit is. This goodness of fit is called chi-squared, which we’ll first apply to fitting a straight line - linear regression. Then we’ll look at how to optimise our fitting function using chi-squared in the general case using the gradient descent method. Finally, we’ll look at how to do this easily in Python in just a few lines of code, which will wrap up the course.
Imperial College London is a world top ten university with an international reputation for excellence in science, engineering, medicine and business. located in the heart of London. Imperial is a multidisciplinary space for education, research, translation and commercialisation, harnessing science and innovation to tackle global challenges.
Imperial students benefit from a world-leading, inclusive educational experience, rooted in the College’s world-leading research. Our online courses are designed to promote interactivity, learning and the development of core skills, through the use of cutting-edge digital technology.
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
4.7
5,773 reviews
5 stars
77.10%
4 stars
18.77%
3 stars
3.08%
2 stars
0.64%
1 star
0.39%
Showing 3 of 5773
K
KB
4·
Reviewed on Jul 23, 2020
Very informative refresher on the basics of differentiation, though some of the later topics could have been fleshed out more (i.e. Taylor Series, Lagrange Multipliers, etc). Overall very good.
W
WW
5·
Reviewed on Jan 12, 2021
Quality of instruction and the visual presentation was superb. Completing the second course in this specialisation in Machine Learning has improved my confidence to tackle more advanced topics
A
AK
4·
Reviewed on Jul 27, 2019
Superb quality. The way instructors teach is really innovative. The course is good in terms of the area it covers but lacks depth, but is a good starting point if you want to dwell more in detail.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.