By completing this course, you'll master building powerful machine learning systems that excel with limited data. You'll gain expertise in multi-task learning, meta-learning, and advanced data augmentation—from physics-based simulations to generative approaches—enabling models to adapt quickly and perform beyond their dataset size.

Machine Learning with Small Data Part 2

Machine Learning with Small Data Part 2

Instructor: Sarah Ostadabbas
Access provided by Anima Educacao
Skills you'll gain
- Machine Learning Algorithms
- Applied Machine Learning
- Machine Learning Methods
- Model Optimization
- Small Data
- Deep Learning
- Generative Model Architectures
- Computer Vision
- Image Analysis
- Transfer Learning
- 3D Modeling
- Computer Graphics
- Machine Learning
- Simulation and Simulation Software
- Model Training
- 3D Assets
- Simulations
Tools you'll learn
Details to know

Add to your LinkedIn profile
7 assignments
See how employees at top companies are mastering in-demand skills

There are 7 modules in this course
In this module, we will introduce the fundamentals of Multi-Task Learning (MTL), a paradigm where multiple related tasks are learned simultaneously by sharing representations. This approach leverages the commonalities among tasks to improve generalization, reduce overfitting, and achieve better performance with fewer training examples. We will explore how MTL is applied across various domains, such as natural language processing, computer vision, and speech recognition, and examine practical examples such as using MTL to enhance image classification and object detection in autonomous systems. Students will gain insights into both the benefits and challenges of MTL, including issues such as task imbalance, negative transfer, and scalability. Additionally, we will delve into meta-learning techniques, such as Conditional Neural Adaptive Processes (CNAPs), that extend MTL by enabling models to quickly adapt to new tasks with minimal data.
What's included
1 video15 readings1 assignment
1 video•Total 6 minutes
- Multi-Task Learning•6 minutes
15 readings•Total 68 minutes
- Course Introduction•2 minutes
- Syllabus - Machine Learning for Small Data Part 2•10 minutes
- Academic Integrity•1 minute
- Introduction to Multi-Task Learning•2 minutes
- Examples of Multi-Task Learning•5 minutes
- Why Multi-Task Learning•5 minutes
- Key Challenges in MTL•2 minutes
- Meta-Learning and Few-Shot Learning for Multi-Task Learning•5 minutes
- An Overview of Conditional Neural Processes (CNPs)•10 minutes
- Conditional Neural Adaptive Processes (CNAPs)•2 minutes
- Adaptation Mechanisms of CNAPs•10 minutes
- CNAPs Balances Adaptation•2 minutes
- Key Extension in CNAPs•5 minutes
- CNAPs in Practice•2 minutes
- Adaptation Network for CNAPs•5 minutes
1 assignment•Total 20 minutes
- Module 8 Quiz•20 minutes
This module explores the concept of meta-learning, or "learning to learn," which enables models to generalize across various tasks by leveraging knowledge from similar tasks. We will delve into key meta-learning algorithms such as Model-Agnostic Meta-Learning (MAML) and Prototypical Networks and examine their applications in computer vision using datasets such as ImageNet, Omniglot, CUB-200-2011, and FGVC-Aircraft. The module also covers the Meta-Dataset framework, which provides a diverse range of tasks for training robust and adaptable meta-learning models.
What's included
1 video7 readings1 assignment
1 video•Total 4 minutes
- Meta Learning•4 minutes
7 readings•Total 36 minutes
- What is Meta-Learning?•3 minutes
- Model-Agnostic Meta-Learning (MAML)•5 minutes
- Prototypical Networks•5 minutes
- Beyond Simple Meta-Learning •3 minutes
- Mathematical Formulation of Meta-Learning•5 minutes
- Mathematical Formulation of Transductive Learning•5 minutes
- An Overview of Some Vision Meta-Datasets•10 minutes
1 assignment•Total 15 minutes
- Module 9 Quiz•15 minutes
This module focuses on generative models for data augmentation, covering key generative AI techniques that enhance machine learning applications by generating synthetic but realistic data. We begin by introducing generative adversarial networks (GANs), Variational Autoencoders (VAEs), Normalizing Flows, Diffusion Models, and Motion Graphs, highlighting their mathematical foundations, training mechanisms, and real-world applications. Additionally, we discuss the limitations of each model and the computational challenges they present. The lecture provides insights into how generative models contribute to modern AI systems, including image synthesis, domain adaptation, super-resolution, motion synthesis, and data augmentation in small-data learning scenarios.
What's included
1 video28 readings1 assignment
1 video•Total 5 minutes
- Learning with Data Augmentation: Data-Driven Simulation•5 minutes
28 readings•Total 152 minutes
- Introduction to Generative Models•10 minutes
- Limitations of Generative Models for Data Augmentation•5 minutes
- Generative Adversarial Networks (GANs)•10 minutes
- Applications of Generative Models•2 minutes
- Vanilla GAN•2 minutes
- Conditional GAN (cGAN)•5 minutes
- Deep Convolutional GAN (DCGAN)•5 minutes
- Wasserstein GAN (WGAN)•4 minutes
- CycleGAN•5 minutes
- Progressive Growing of GANs (PGGAN)•5 minutes
- InfoGAN•5 minutes
- BigGAN•5 minutes
- Super-Resolution GAN (SRGAN)•5 minutes
- Text-to-Image GAN•5 minutes
- Autoencoder Basics•5 minutes
- Variational Autoencoders•5 minutes
- Probabilistic Encoder, Reparameterization Trick•10 minutes
- VAE Loss Function•5 minutes
- Vanilla VAE •2 minutes
- Beta-VAE •5 minutes
- Conditional VAE•5 minutes
- VQ-VAE•5 minutes
- Flow-Based Models•10 minutes
- Advancements in Flow-Based Generative Models Part 1•5 minutes
- Advancements in Flow-Based Generative Models Part 2•5 minutes
- Advancements in Flow-Based Generative Models Part 3•10 minutes
- Diffusion Models•5 minutes
- Comparative Summary of Generative Models•2 minutes
1 assignment•Total 20 minutes
- Module 10 Quiz•20 minutes
This module focuses on physics-based simulation for data augmentation, exploring how physics-driven techniques generate realistic synthetic data to enhance machine learning models. We will discuss key advantages of physics-based simulations, such as scalability, cost-effectiveness, and their ability to model rare events. The module also covers notable approaches, including GeoNet (CVPR 2018) for depth and motion estimation, ScanAva (ECCVW 2018) for semi-supervised learning with 3D avatars, and SMPL (ACM Transactions on Graphics, Volume 15) for human body modeling. Additionally, we introduce equation-based simulation techniques such as Finite Element Method (FEM) and Navier-Stokes equations for modeling fluid dynamics. The module highlights challenges in bridging the simulation-to-reality gap and optimizing computational costs while ensuring high-fidelity synthetic data generation.
What's included
1 video10 readings1 assignment
1 video•Total 5 minutes
- Introduction to Physics-Based Simulation•5 minutes
10 readings•Total 64 minutes
- Physics-Based Simulation•3 minutes
- GeoNet: Using Physical Relationship in Image Formation •10 minutes
- Avatar-Based Simulation•3 minutes
- ScanAva•10 minutes
- Skinned Multi-Person Linear Model (SMPL)•10 minutes
- Skinned Multi-Person Linear Model (SMPL) Part 2•10 minutes
- Governing Equations in Physics-Based Simulation•3 minutes
- Partial Differential Equations (PDEs)•3 minutes
- Numerical Methods for Solving PDEs•10 minutes
- Comparison of Methods•2 minutes
1 assignment•Total 30 minutes
- Module 11 Quiz•30 minutes
This module introduces Neural Radiance Fields (NeRF), a deep learning-based approach for synthesizing novel views of complex 3D scenes. Unlike traditional 3D reconstruction techniques such as Structure-from-Motion (SfM) and Multi-View Stereo (MVS), which rely on explicit point cloud representations, NeRF learns a continuous volumetric representation of a scene using a fully connected neural network. By taking a set of 2D images captured from different viewpoints, NeRF estimates the density and color of light rays at each spatial location, enabling high-quality, photorealistic novel view synthesis. The lecture also explores how NeRF improves upon prior methods, such as depth estimation, photogrammetry, and classic geometric techniques. Understanding NeRF provides valuable insights into data-efficient 3D scene representation—a critical area for applications in computer vision, robotics, virtual reality (VR), and augmented reality (AR).
What's included
1 video6 readings1 assignment
1 video•Total 3 minutes
- NeRF•3 minutes
6 readings•Total 35 minutes
- Introducing Neural Radiance Fields (NeRF)•10 minutes
- Volume Rendering•10 minutes
- Discrete Approximation in Volume Rendering•3 minutes
- NeRF Network Structure•5 minutes
- NeRF Extension: NeRV•5 minutes
- NeRF vs. NeRV•2 minutes
1 assignment•Total 25 minutes
- Module 12 Quiz•25 minutes
This module explores diffusion models, a class of generative models that incrementally add noise to data and then learn to reverse the process to reconstruct high-quality samples. Diffusion models have gained prominence due to their state-of-the-art performance in image, video, and text generation, surpassing GANs in terms of sample quality and diversity. The module covers the foundational principles of Denoising Diffusion Probabilistic Models (DDPMs) and their training objectives, advancements such as Score-Based Generative Models, Latent Diffusion Models (LDMs), and Classifier-Free Guidance techniques. We also examine their real-world applications in text-to-image generation (Stable Diffusion, DALL·E), video synthesis (Sora, Veo 2), and high-resolution image synthesis. Finally, the module provides insights into the mathematical framework, the optimization strategies, and the growing role of diffusion models in AI-driven content creation.
What's included
1 video11 readings1 assignment
1 video•Total 5 minutes
- Introduction to Diffusion Models•5 minutes
11 readings•Total 98 minutes
- Forward and Reverse Diffusion in Denoising•5 minutes
- Components of Denoising Diffusion Models•10 minutes
- Loss Decomposition and Noise Levels•10 minutes
- Variance Schedule and Training Steps•5 minutes
- The Rapidly Evolving Field of DDM•5 minutes
- Foundational Understanding of Diffusion Models•10 minutes
- Key Model Variants and Improvements•10 minutes
- Guided and Conditional Generation•10 minutes
- Video Diffusion Models I•15 minutes
- Video Diffusion Models II•10 minutes
- Video Diffusion Models III•8 minutes
1 assignment•Total 20 minutes
- Module 13 Quiz•20 minutes
This lecture explores 3D Gaussian Splatting (3DGS), a novel approach in computer vision for high-fidelity, real-time 3D scene rendering. Unlike traditional methods like Neural Radiance Fields (NeRF), which rely on continuous neural fields, 3DGS represents scenes using a collection of discrete anisotropic Gaussian functions. These Gaussians efficiently approximate scene geometry, radiance, and depth, enabling real-time rendering with minimal computational overhead. We discuss the theoretical foundations, mathematical formulations, and rendering techniques that make 3D Gaussian Splatting a game-changer in virtual reality (VR), augmented reality (AR), and interactive media. Additionally, we highlight key differences between isotropic and anisotropic Gaussian splats, their impact on rendering quality, and how optimization techniques refine their accuracy. Finally, we compare 3DGS to NeRF, analyzing their trade-offs in rendering speed, computational efficiency, and application suitability.
What's included
1 video6 readings1 assignment
1 video•Total 6 minutes
- 3D Gaussian Spatting•6 minutes
6 readings•Total 52 minutes
- Introducing 3D Gaussian Spatting•5 minutes
- Isotropic & Anisotropic in 3DGS•15 minutes
- Key Concepts & Methodology in 3DGS•10 minutes
- Optimization & Training in 3DGS•5 minutes
- NeRF versus 3DGS•15 minutes
- Congratulations! •2 minutes
1 assignment•Total 25 minutes
- Module 14 Quiz•25 minutes
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

