Back to Visual Perception
Columbia University

Visual Perception

The ultimate goal of a computer vision system is to generate a detailed symbolic description of each image shown. This course focuses on the all-important problem of perception. We first describe the problem of tracking objects in complex scenes. We look at two key challenges in this context. The first is the separation of an image into object and background using a technique called change detection. The second is the tracking of one or more objects in a video. Next, we examine the problem of segmenting an image into meaningful regions. In particular, we take a bottom-up approach where pixels with similar attributes are grouped together to obtain a region. Finally, we tackle the problem of object recognition. We describe two approaches to the problem. The first directly recognize an object and its pose using the appearance of the object. This method is based on the concept of dimension reduction, which is achieved using principal component analysis. The second approach is to use a neural network to solve the recognition problem as one of learning a mapping from the input (image) to the output (object class, object identity, activity, etc.). We describe how a neural network is constructed and how it is trained using the backpropagation algorithm.

Status: Algorithms
Status: Image Analysis
BeginnerCourse84 hours

Featured reviews

KJ

5.0Reviewed Apr 27, 2022

Amazing course , Well explained and interesting assignments!!!

AD

5.0Reviewed Aug 2, 2025

Excellent course to get your fundamentals right about computer vision

All reviews

Showing: 9 of 9

Ferenc Junger
5.0
Reviewed Nov 21, 2022
Achyut Duggal
5.0
Reviewed Aug 3, 2025
Krushi Jethe
5.0
Reviewed Apr 28, 2022
SOHYUN CHOI
5.0
Reviewed Jul 30, 2024
Marco Morais
5.0
Reviewed Jan 27, 2023
Ali Karakurum
5.0
Reviewed Sep 24, 2024
Shahid Rahman
5.0
Reviewed May 23, 2023
ali al matar
5.0
Reviewed Apr 8, 2026
Abdullah Sadık Satır
3.0
Reviewed May 27, 2025