When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 4 modules in this course
Welcome to AI Applications: Computer Vision and Speech Recognition, where you will gain hands-on expertise in using cutting-edge technologies to process visual data and interpret human speech. This course equips you with practical skills to address real-world challenges in computer vision and speech analysis.
By the end of this course, you will be able to:
- Analyze speech waveforms using advanced audio signal processing techniques.
- Develop a strong understanding of computer vision principles and applications.
- Perform morphological operations on images and videos within a customized environment.
- Implement advanced audio and video processing techniques.
- Apply OpenCV functionalities to build robust solutions for image and video analysis.
This course is ideal for AI enthusiasts, data scientists, and developers aiming to expand their skills in computer vision and speech recognition.
Prior experience with Python programming and a basic understanding of machine learning concepts is recommended for optimal learning.
Master the skills required to build intelligent systems in the evolving field of artificial intelligence with this focused course.
This module is designed to help learners understand the history of AI and how its ongoing development has led to the creation of image and video processing tools like OpenCV. Learn to perform various image processing such as morphological operations.
Demonstration: Opening (Dilation and Erosion)•6 minutes
Demonstration: Closing and Morphological Gradient •6 minutes
Blackhat and Whitehat Transformations•5 minutes
Demonstration: Whitehat/Tophat•4 minutes
Demonstration: Blackhat•6 minutes
Summary of Computer Vision with OpenCV•7 minutes
5 readings•Total 40 minutes
Welcome to AI Applications: Computer Vision and Speech Recognition•10 minutes
Exploring Technologies for Computer Vision•10 minutes
Ethical Considerations in Computer Vision•10 minutes
LBPH Algorithm: Local Binary Patterns Histogram•5 minutes
Watershed Algorithm for Image Processing•5 minutes
5 assignments•Total 54 minutes
Knowledge Check : Computer Vision with OpenCV•30 minutes
Practice Quiz : Evolution of AI and Computer Vision•6 minutes
Practice Quiz : Setting Up Environment•6 minutes
Practice Quiz : Image Processing•6 minutes
Practice Quiz : Morphological Operations•6 minutes
2 discussion prompts•Total 20 minutes
Introduce Yourself•10 minutes
Simplifying OpenCV Environment Setup•10 minutes
Video Processing using OpenCV
Module 2•3 hours to complete
Module details
In the second module of this course, you'll delve deeper into OpenCV functionalities for video preprocessing. You'll learn how to play videos using OpenCV, extract and combine frames, and demonstrate the use of Haar cascades and their integration with OpenCV.
Demonstration: Implementing Facial Landmarks on Images•4 minutes
Demonstration: Implementing Facial Landmarks on Videos•5 minutes
Summary of Video Processing with OpenCV•6 minutes
2 readings•Total 20 minutes
Marker-Based Augmented Reality (AR)•10 minutes
Pros and Cons of OpenCV’s Haar cascade Face Detector•10 minutes
3 assignments•Total 42 minutes
Knowledge Check : Video Processing with OpenCV•30 minutes
Practice Quiz : Video Processing Using OpenCV•6 minutes
Practice Quiz : Exploring Various Techniques for Face Detection and Recognition•6 minutes
1 discussion prompt•Total 10 minutes
The Future of Face Recognition and Security: Opportunities and Challenges•10 minutes
Speech Recognition and Audio Analysis
Module 3•3 hours to complete
Module details
In the third module of this course, you will learn the fundamental structure of speech and how it is organized. Process speech by analyzing waveforms and applying various techniques to manipulate them effectively.
Summary of Speech Recognition and Audio Analysis•5 minutes
2 readings•Total 20 minutes
Speech Analysis In Cyber Security•10 minutes
Speech Processing - Interactive Creation and Evaluation (SPICE) Toolkit•10 minutes
3 assignments•Total 42 minutes
Knowledge Check : Speech Recognition and Audio Analysis•30 minutes
Practice Quiz : Speech and it's Variation•6 minutes
Practice Quiz : Digitizing and Analyzing Speech•6 minutes
1 discussion prompt•Total 10 minutes
The Role of Speech Analysis in Modern AI Applications•10 minutes
Course Wrap-Up and Assessment
Module 4•1 hour to complete
Module details
This module is designed to assess an individual on the various concepts and teachings covered in this course. Answer a comprehensive quiz which marks you as a learner who is confident in working with Computer Vision and OpenCV.
What's included
1 video1 reading1 assignment1 discussion prompt
Show info about module content
1 video•Total 4 minutes
Summary for AI Applications: Computer Vision and Speech Recognition•4 minutes
1 reading•Total 10 minutes
Practice Project - Vehicle Tracking and Detection•10 minutes
1 assignment•Total 30 minutes
Knowledge Check : AI Applications: Computer Vision and Speech Recognition•30 minutes
1 discussion prompt•Total 10 minutes
Describe Your Learning Journey•10 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Edureka is an online education platform focused on delivering high-quality learning to working professionals. We have the
highest course completion rate in the industry and we strive to create an online ecosystem for our global learners to equip
themselves with industry-relevant skills in today’s cutting edge technologies.
A basic understanding of Python programming and machine learning concepts is recommended.
Do I need any specific software or tools for this course?
Yes, you'll need to set up Python, OpenCV, and other relevant libraries for image, video, and speech processing.
Are the course materials suitable for beginners?
While the course is beginner-friendly, prior Python knowledge and basic understanding of machine learning concepts will enhance your learning experience.
What programming languages and tools are used in this course?
You’ll primarily use Python with libraries and machine learning frameworks for computer vision and speech tasks.
Do I need prior experience in AI or machine learning?
No prior AI or ML experience is required. Basic programming skills in Python are recommended to follow along effectively.
What career opportunities can AI skills in vision and speech open up?
These skills prepare you for roles in AI engineering, computer vision development, speech technology, and data science.
Will I earn a certificate after completing the course?
Yes, you’ll receive a Coursera certificate that validates your AI skills and can be showcased on LinkedIn or to employers.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.