Master the essential preprocessing techniques that transform raw visual data into model-ready inputs for computer vision systems. This course empowers you to systematically prepare image data through normalization and color-space conversions, then advance to extracting meaningful motion information from video sequences. You'll apply pixel value normalization, execute color transformations between RGB, grayscale, HSV, and BGR formats, then implement optical flow algorithms and frame differencing to capture temporal dynamics. By completing this course, you'll be able to:

Process Images, Create Captioning AI Models

Process Images, Create Captioning AI Models
This course is part of Vision & Audio AI Systems Specialization

Instructor: Hurix Digital
Access provided by Inter IKEA
Recommended experience
What you'll learn
Image preprocessing using normalization and color-space conversion ensures stable training and consistent model performance.
Optical flow and frame differencing complement motion analysis, helping systems capture scene dynamics over time.
Preprocessing is essential for vision tasks, directly affecting model convergence, stability, and real-world results
Motion feature extraction links static images with dynamic understanding for recognition, tracking, and navigation.
Skills you'll gain
Tools you'll learn
Details to know

Add to your LinkedIn profile
3 assignments
March 2026
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 2 modules in this course
Learners will master systematic image preprocessing techniques including normalization and color-space conversions to prepare raw visual data for computer vision applications.
What's included
3 videos1 reading1 assignment1 ungraded lab
Learners will master optical flow and frame differencing techniques to extract temporal motion features from video sequences for computer vision applications.
What's included
2 videos1 reading2 assignments
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.






