Robotics: Perception

This course is part of Robotics Specialization

Instructors: Kostas Daniilidis

40,463 already enrolled

Included with Coursera Plus

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

4.3

(653 reviews)

Intermediate level

Some related experience required

Flexible schedule

Approx. 33 hours

Learn at your own pace

81%

Most learners liked this course

4 modules

Gain insight into a topic and learn the fundamentals.

4.3

(653 reviews)

Intermediate level

Some related experience required

Flexible schedule

Approx. 33 hours

Learn at your own pace

81%

Most learners liked this course

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

21 quizzes, 1 assignment

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Build your subject-matter expertise

This course is part of the Robotics Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

There are 4 modules in this course

How can robots perceive the world and their own movements so that they accomplish navigation and manipulation tasks? In this module, we will study how images and videos acquired by cameras mounted on robots are transformed into representations like features and optical flow. Such 2D representations allow us then to extract 3D information about where the camera is and in which direction the robot moves. You will come to understand how grasping objects is facilitated by the computation of 3D posing of objects and navigation can be accomplished by visual odometry and landmark-based localization.

Welcome to Robotics: Perception! We will begin this course with a tutorial on the standard camera models used in computer vision. These models allow us to understand, in a geometric fashion, how light from a scene enters a camera and projects onto a 2D image. By defining these models mathematically, we will be able understand exactly how a point in 3D corresponds to a point in the image and how an image will change as we move a camera in a 3D environment. In the later modules, we will be able to use this information to perform complex perception tasks such as reconstructing 3D scenes from video.

What's included

15 videos2 readings8 quizzes1 assignment1 programming assignment

15 videosTotal 179 minutes

Introduction10 minutesPreview module
Camera Modeling10 minutes
Single View Geometry14 minutes
More on Perspective Projection8 minutes
Glimpse on Vanishing Points10 minutes
Perspective Projection I14 minutes
Perspective Projection II14 minutes
Point-Line Duality8 minutes
Rotations and Translations18 minutes
Pinhole Camera Model10 minutes
Focal Length and Dolly Zoom Effect8 minutes
Intrinsic Camera Parameter13 minutes
3D World to First Person Transformation13 minutes
How to Compute Intrinsics from Vanishing Points12 minutes
Camera Calibration11 minutes

2 readingsTotal 11 minutes

Setting up MATLAB10 minutes
Opt-in to Penn Engineering Online Communications1 minute

8 quizzesTotal 240 minutes

Introduction30 minutes
Vanishing Points30 minutes
Perspective Projection30 minutes
Rotations and Translations30 minutes
Dolly Zoom30 minutes
Feeling of Camera Motion30 minutes
How to Compute Intrinsics from Vanishing Points30 minutes
Camera Calibration30 minutes

1 assignmentTotal 30 minutes

Learning Style Preference Survey30 minutes

1 programming assignmentTotal 180 minutes

Dolly Zoom180 minutes

Now that we have a good camera model, we will explore the geometry of perspective projections in depth. We will find that this projection is the cause of the main challenge in perception, as we lose a dimension that we can no longer directly observe. In this module, we will learn about several properties of projective transformations in depth, such as vanishing points, which allow us to infer complex information beyond our basic camera model.

What's included

5 videos4 quizzes1 programming assignment

5 videosTotal 68 minutes

Vanishing Points; How to Compute Camera Orientation23 minutesPreview module
Compute Projective Transformations13 minutes
Projective Transformations and Vanishing Points6 minutes
Cross Ratios and Single View Metrology13 minutes
Two View Soccer Metrology11 minutes

4 quizzesTotal 120 minutes

Homogeneous Coordinates30 minutes
Projective Transformations30 minutes
Vanishing Points30 minutes
Cross Ratios and Single View Metrology30 minutes

1 programming assignmentTotal 180 minutes

Image Projection using Homographies180 minutes

In this module we will be learning about feature extraction and pose estimation from two images. We will learn how to find the most salient parts of an image and track them across multiple frames (i.e. in a video sequence). We will then learn how to use features to find the position of the camera with respect to another reference frame on a plane using Homographies. We will also learn about how to make these techniques more robust, using least squares to hand noisy feature points or RANSAC to remove completely erroneous feature points.

What's included

8 videos5 quizzes1 programming assignment

8 videosTotal 125 minutes

Visual Features23 minutesPreview module
Singular Value Decomposition30 minutes
RANSAC: Random Sample Consensus I13 minutes
Where am I? Part 116 minutes
Where am I? Part 213 minutes
Pose from 3D Point Correspondences: The Procrustes Problem9 minutes
Pose from Projective Transformations8 minutes
Pose from Point Correspondences P3P10 minutes

5 quizzesTotal 150 minutes

Visual Features30 minutes
Singular Value Decomposition30 minutes
RANSAC30 minutes
3D-3D Pose30 minutes
Pose Estimation30 minutes

1 programming assignmentTotal 180 minutes

Image Projection180 minutes

Now we will use what we learned from two view geometry and extend it to sequences of images, such as a video. We will explain the fundamental geometric constraints between point features in images, the Epipolar constraint, and learn how to use it to extract the relative poses between multiple frames. We will finish by combining all this information together for the application of Structure from Motion, where we will compute the trajectory of a camera and a map throughout many frames and refine our estimates using Bundle adjustment.

What's included

14 videos1 reading4 quizzes1 programming assignment

14 videosTotal 221 minutes

Epipolar Geometry I23 minutesPreview module
Epipolar Geometry II14 minutes
Epipolar Geometry III24 minutes
RANSAC: Random Sample Consensus II6 minutes
Nonlinear Least Squares I3 minutes
Nonlinear Least Squares II6 minutes
Nonlinear Least Squares III13 minutes
Optical Flow: 2D Point Correspondences19 minutes
3D Velocities from Optical Flow16 minutes
3D Motion and Structure from Multiple Views18 minutes
Visual Odometry19 minutes
Bundle Adjustment I17 minutes
Bundle Adjustment II18 minutes
Bundle Adjustment III17 minutes

1 readingTotal 1 minute

Opt-in to Penn Engineering Online Communications1 minute

4 quizzesTotal 120 minutes

Epipolar Geometry30 minutes
Nonlinear Least Squares30 minutes
3D Velocities from Optical Flow30 minutes
Bundle Adjustment30 minutes

1 programming assignmentTotal 180 minutes

Structure from Motion180 minutes

Instructors

Instructor ratings

4.0 (48 ratings)

Kostas Daniilidis

University of Pennsylvania

1 Course40,463 learners

Offered by

University of Pennsylvania

Recommended if you're interested in Mechanical Engineering

University of Pennsylvania
Robotics: Estimation and Learning
Course
University of Pennsylvania
Robotics: Mobility
Course
University of Pennsylvania
Robotics: Aerial Robotics
Course
University of Pennsylvania
Robotics: Capstone
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

Showing 3 of 653

4.3

653 reviews

5 stars
60.33%
4 stars
24.34%
3 stars
7.81%
2 stars
4.13%
1 star
3.36%

Reviewed on Oct 16, 2019

Reviewed on Mar 27, 2020

Reviewed on Jan 31, 2021

View more reviews

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:

The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy.

Robotics: Perception

Skills you'll gain

Details to know

See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise

Earn a career certificate

There are 4 modules in this course

Geometry of Image Formation

What's included

Projective Transformations

What's included

Pose Estimation

What's included

Multi-View Geometry

What's included

Instructors

Offered by

Recommended if you're interested in Mechanical Engineering

Robotics: Estimation and Learning

Robotics: Mobility

Robotics: Aerial Robotics

Robotics: Capstone

Why people choose Coursera for their career

Learner reviews

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

When will I have access to the lectures and assignments?

What will I get if I subscribe to this Specialization?

What is the refund policy?

More questions