What prior knowledge do I need for this course?

A basic understanding of Python programming and machine learning concepts is recommended.

Do I need any specific software or tools for this course?

Yes, you'll need to set up Python, OpenCV, and other relevant libraries for image, video, and speech processing.

Are the course materials suitable for beginners?

While the course is beginner-friendly, prior Python knowledge and basic understanding of machine learning concepts will enhance your learning experience.

What programming languages and tools are used in this course?

You’ll primarily use Python with libraries and machine learning frameworks for computer vision and speech tasks.

Do I need prior experience in AI or machine learning?

No prior AI or ML experience is required. Basic programming skills in Python are recommended to follow along effectively.

What career opportunities can AI skills in vision and speech open up?

These skills prepare you for roles in AI engineering, computer vision development, speech technology, and data science.

Will I earn a certificate after completing the course?

Yes, you’ll receive a Coursera certificate that validates your AI skills and can be showcased on LinkedIn or to employers.

When will I have access to the lectures and assignments?

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

What will I get if I subscribe to this Specialization?

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

AI Applications: Computer Vision and Speech Recognition

4 days left! Save on skills that make you shine with 40% off 3 months of Coursera Plus. Save now

AI Applications: Computer Vision and Speech Recognition

This course is part of Mastering AI: Neural Nets, Vision System, Speech Recognition Specialization

Instructor: Edureka

Included with

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Analyze speech waveforms and apply audio signal processing techniques.
Develop and implement computer vision algorithms using OpenCV.
Perform morphological operations on images and videos for data manipulation.
Apply speech recognition techniques for digitizing and analyzing audio signals.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

12 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Mastering AI: Neural Nets, Vision System, Speech Recognition Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 4 modules in this course

Welcome to AI Applications: Computer Vision and Speech Recognition, where you will gain hands-on expertise in using cutting-edge technologies to process visual data and interpret human speech. This course equips you with practical skills to address real-world challenges in computer vision and speech analysis.

By the end of this course, you will be able to: - Analyze speech waveforms using advanced audio signal processing techniques. - Develop a strong understanding of computer vision principles and applications. - Perform morphological operations on images and videos within a customized environment. - Implement advanced audio and video processing techniques. - Apply OpenCV functionalities to build robust solutions for image and video analysis. This course is ideal for AI enthusiasts, data scientists, and developers aiming to expand their skills in computer vision and speech recognition. Prior experience with Python programming and a basic understanding of machine learning concepts is recommended for optimal learning. Master the skills required to build intelligent systems in the evolving field of artificial intelligence with this focused course.

Module details

This module is designed to help learners understand the history of AI and how its ongoing development has led to the creation of image and video processing tools like OpenCV. Learn to perform various image processing such as morphological operations.

What's included

32 videos5 readings5 assignments2 discussion prompts

32 videosTotal 166 minutes

Course Introduction4 minutes
Industrial Breakthrough in Audio and Speech Recognition 2 minutes
Speech Recognition Technology5 minutes
Computer Vision Application5 minutes
Computer Vision Applications: Medical and Plant Disease6 minutes
AI Responsibility Pyramid7 minutes
Evolution of Computer Vision and Speech Analysis5 minutes
Evolution of Speech Analysis5 minutes
What is OpenCV?6 minutes
Installing OpenCV on Windows 4 minutes
Installing OpenCV on Windows: Handling Libraries in Jupyter5 minutes
Installing Integrated Libraries - NumPy, Matplotlib, SciPy, and Pillow6 minutes
Installing Integrated Libraries - Dlib, Scikit, and Pytorch5 minutes
Operations on OpenCV3 minutes
Demonstration: Loading the Image and Encoding Image to RGB6 minutes
Demonstration: Resizing, Rotating and Flipping the Image7 minutes
Demonstration: Gaussian Blur7 minutes
Demonstration: Edge Detection and Conversion6 minutes
Demonstration: Image Thresholding - Binary Image5 minutes
Demonstration: Different Methods of Thresholding6 minutes
Demonstration: Practical Use Cases3 minutes
What is Adaptive Thresholding5 minutes
Demonstration of Global Adaptive Threshold5 minutes
Demonstration: Implementing Adaptive Thresholding Methods7 minutes
Morphological Operations5 minutes
Morphological Operations in OpenCV4 minutes
Demonstration: Opening (Dilation and Erosion)6 minutes
Demonstration: Closing and Morphological Gradient 6 minutes
Blackhat and Whitehat Transformations5 minutes
Demonstration: Whitehat/Tophat4 minutes
Demonstration: Blackhat6 minutes
Summary of Computer Vision with OpenCV7 minutes

5 readingsTotal 40 minutes

Welcome to AI Applications: Computer Vision and Speech Recognition10 minutes
Exploring Technologies for Computer Vision10 minutes
Ethical Considerations in Computer Vision10 minutes
LBPH Algorithm: Local Binary Patterns Histogram5 minutes
Watershed Algorithm for Image Processing5 minutes

5 assignmentsTotal 54 minutes

Knowledge Check : Computer Vision with OpenCV30 minutes
Practice Quiz : Evolution of AI and Computer Vision6 minutes
Practice Quiz : Setting Up Environment6 minutes
Practice Quiz : Image Processing6 minutes
Practice Quiz : Morphological Operations6 minutes

2 discussion promptsTotal 20 minutes

Introduce Yourself10 minutes
Simplifying OpenCV Environment Setup10 minutes

In the second module of this course, you'll delve deeper into OpenCV functionalities for video preprocessing. You'll learn how to play videos using OpenCV, extract and combine frames, and demonstrate the use of Haar cascades and their integration with OpenCV.

What's included

28 videos2 readings3 assignments1 discussion prompt

28 videosTotal 137 minutes

Video Processing3 minutes
Demonstration: Implementing Frame by Frame Video Processing4 minutes
Demonstration: Exiting the Processing Operation3 minutes
Demonstration: Initializing the Video Frames6 minutes
Demonstration: Saving the Frames7 minutes
Demonstration: Loading the Data 5 minutes
Demonstration: Reading and Writing Operations 6 minutes
Demonstration: Histogram Matching7 minutes
Demonstration: Matching Source and Reference Images5 minutes
Demonstration: Cumulative Distribution Function5 minutes
Demonstration: Differences in Images4 minutes
Haar Cascade2 minutes
Haar Cascade: Algorithm Overview6 minutes
Haar Cascade Application and Limitation2 minutes
Demonstration: Implementation of Haar Cascade Algorithm7 minutes
Demonstration: Face Detection Code for Static Image4 minutes
Demonstration: Implementing Boundary Box for Face Detection6 minutes
Demonstration: Applying Face Detection on Images6 minutes
Introduction to Face Recognition6 minutes
Demonstration: Setting Up Pre-requisite Libraries and Loading the Image2 minutes
Demonstration: Face Recognition and Detection5 minutes
Demonstration: Facial Landmark Detection with File Loading and Library Setup5 minutes
Demonstration: Adjusting Video details through OpenCV6 minutes
Demonstration: Face Recognition7 minutes
Demonstration: Encoding Facial Landmarks 4 minutes
Demonstration: Implementing Facial Landmarks on Images4 minutes
Demonstration: Implementing Facial Landmarks on Videos5 minutes
Summary of Video Processing with OpenCV6 minutes

2 readingsTotal 20 minutes

Marker-Based Augmented Reality (AR)10 minutes
Pros and Cons of OpenCV’s Haar cascade Face Detector10 minutes

3 assignmentsTotal 42 minutes

Knowledge Check : Video Processing with OpenCV30 minutes
Practice Quiz : Video Processing Using OpenCV6 minutes
Practice Quiz : Exploring Various Techniques for Face Detection and Recognition6 minutes

1 discussion promptTotal 10 minutes

The Future of Face Recognition and Security: Opportunities and Challenges10 minutes

In the third module of this course, you will learn the fundamental structure of speech and how it is organized. Process speech by analyzing waveforms and applying various techniques to manipulate them effectively.

What's included

27 videos2 readings3 assignments1 discussion prompt

27 videosTotal 137 minutes

Introduction to Speech: Audio Data 4 minutes
Introduction to Speech: Human Computer Interaction and Applications5 minutes
Processing Speech 6 minutes
Speech Production7 minutes
Difficulties in Analyzing Speech7 minutes
Working of Sound Waves5 minutes
ADC and Sample Rate, Bit Rate3 minutes
Conversion of ADC (Analog to Digital Converter) to DAC (Digital to Analog Converter)5 minutes
Demonstration: Generating Sound7 minutes
Demonstration: Spectrogram6 minutes
Demonstration: Signal Frequencies Over Time4 minutes
Summary of Audio File Analysis2 minutes
Demonstration: Converting a Sound File into Waveform7 minutes
Human Speech7 minutes
Speech Waveform5 minutes
Digital Signal Processing7 minutes
MFCC (Mel Frequency Cepstral Coefficient)6 minutes
Windowing Formula and Cepstrum5 minutes
Demonstration: Computing the Spectrogram4 minutes
Demonstration: Digitizing the Audio Data5 minutes
Demonstration: Converting Fragmented Parts of Audio File for Speech Recognition4 minutes
Voice Onset, Voice Offset, Tremor, and Noise Detection6 minutes
Understanding the Concepts of Voice Onset and Offset3 minutes
Tremor Detection2 minutes
Demonstration: ZCR, Pitch Detection, Voice Activity Detection5 minutes
Demonstration: Tremor Detection5 minutes
Summary of Speech Recognition and Audio Analysis5 minutes

2 readingsTotal 20 minutes

Speech Analysis In Cyber Security10 minutes
Speech Processing - Interactive Creation and Evaluation (SPICE) Toolkit10 minutes

3 assignmentsTotal 42 minutes

Knowledge Check : Speech Recognition and Audio Analysis30 minutes
Practice Quiz : Speech and it's Variation6 minutes
Practice Quiz : Digitizing and Analyzing Speech6 minutes

1 discussion promptTotal 10 minutes

The Role of Speech Analysis in Modern AI Applications10 minutes

This module is designed to assess an individual on the various concepts and teachings covered in this course. Answer a comprehensive quiz which marks you as a learner who is confident in working with Computer Vision and OpenCV.