Machine Learning with Small Data Part 2

Machine Learning with Small Data Part 2

Instructor: Sarah Ostadabbas

Access provided by Anima Educacao

7 modules

Gain insight into a topic and learn the fundamentals.

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

7 modules

Gain insight into a topic and learn the fundamentals.

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

7 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 7 modules in this course

By completing this course, you'll master building powerful machine learning systems that excel with limited data. You'll gain expertise in multi-task learning, meta-learning, and advanced data augmentation—from physics-based simulations to generative approaches—enabling models to adapt quickly and perform beyond their dataset size.

What makes this course unique is its focus on cutting-edge 3D and generative technologies: Neural Radiance Fields (NeRF), diffusion models, and 3D Gaussian Splatting. Unlike traditional ML courses that assume abundant data, this program tackles real-world constraints while unlocking advanced capabilities in science, engineering, and creative industries. This course is primarily aimed at graduate students in computer science, engineering, or data science, along with industry professionals and researchers working with limited datasets who need to develop high-performance machine learning systems despite data constraints.

In this module, we will introduce the fundamentals of Multi-Task Learning (MTL), a paradigm where multiple related tasks are learned simultaneously by sharing representations. This approach leverages the commonalities among tasks to improve generalization, reduce overfitting, and achieve better performance with fewer training examples. We will explore how MTL is applied across various domains, such as natural language processing, computer vision, and speech recognition, and examine practical examples such as using MTL to enhance image classification and object detection in autonomous systems. Students will gain insights into both the benefits and challenges of MTL, including issues such as task imbalance, negative transfer, and scalability. Additionally, we will delve into meta-learning techniques, such as Conditional Neural Adaptive Processes (CNAPs), that extend MTL by enabling models to quickly adapt to new tasks with minimal data.

What's included

1 video15 readings1 assignment

1 videoTotal 6 minutes

Multi-Task Learning6 minutes

15 readingsTotal 68 minutes

Course Introduction2 minutes
Syllabus - Machine Learning for Small Data Part 210 minutes
Academic Integrity1 minute
Introduction to Multi-Task Learning2 minutes
Examples of Multi-Task Learning5 minutes
Why Multi-Task Learning5 minutes
Key Challenges in MTL2 minutes
Meta-Learning and Few-Shot Learning for Multi-Task Learning5 minutes
An Overview of Conditional Neural Processes (CNPs)10 minutes
Conditional Neural Adaptive Processes (CNAPs)2 minutes
Adaptation Mechanisms of CNAPs10 minutes
CNAPs Balances Adaptation2 minutes
Key Extension in CNAPs5 minutes
CNAPs in Practice2 minutes
Adaptation Network for CNAPs5 minutes

1 assignmentTotal 20 minutes

Module 8 Quiz20 minutes

This module explores the concept of meta-learning, or "learning to learn," which enables models to generalize across various tasks by leveraging knowledge from similar tasks. We will delve into key meta-learning algorithms such as Model-Agnostic Meta-Learning (MAML) and Prototypical Networks and examine their applications in computer vision using datasets such as ImageNet, Omniglot, CUB-200-2011, and FGVC-Aircraft. The module also covers the Meta-Dataset framework, which provides a diverse range of tasks for training robust and adaptable meta-learning models.

What's included

1 video7 readings1 assignment

1 videoTotal 4 minutes

Meta Learning4 minutes

7 readingsTotal 36 minutes

What is Meta-Learning?3 minutes
Model-Agnostic Meta-Learning (MAML)5 minutes
Prototypical Networks5 minutes
Beyond Simple Meta-Learning 3 minutes
Mathematical Formulation of Meta-Learning5 minutes
Mathematical Formulation of Transductive Learning5 minutes
An Overview of Some Vision Meta-Datasets10 minutes

1 assignmentTotal 15 minutes

Module 9 Quiz15 minutes

This module focuses on generative models for data augmentation, covering key generative AI techniques that enhance machine learning applications by generating synthetic but realistic data. We begin by introducing generative adversarial networks (GANs), Variational Autoencoders (VAEs), Normalizing Flows, Diffusion Models, and Motion Graphs, highlighting their mathematical foundations, training mechanisms, and real-world applications. Additionally, we discuss the limitations of each model and the computational challenges they present. The lecture provides insights into how generative models contribute to modern AI systems, including image synthesis, domain adaptation, super-resolution, motion synthesis, and data augmentation in small-data learning scenarios.

What's included

1 video28 readings1 assignment

1 videoTotal 5 minutes

Learning with Data Augmentation: Data-Driven Simulation5 minutes

28 readingsTotal 152 minutes

Introduction to Generative Models10 minutes
Limitations of Generative Models for Data Augmentation5 minutes
Generative Adversarial Networks (GANs)10 minutes
Applications of Generative Models2 minutes
Vanilla GAN2 minutes
Conditional GAN (cGAN)5 minutes
Deep Convolutional GAN (DCGAN)5 minutes
Wasserstein GAN (WGAN)4 minutes
CycleGAN5 minutes
Progressive Growing of GANs (PGGAN)5 minutes
InfoGAN5 minutes
BigGAN5 minutes
Super-Resolution GAN (SRGAN)5 minutes
Text-to-Image GAN5 minutes
Autoencoder Basics5 minutes
Variational Autoencoders5 minutes
Probabilistic Encoder, Reparameterization Trick10 minutes
VAE Loss Function5 minutes
Vanilla VAE 2 minutes
Beta-VAE 5 minutes
Conditional VAE5 minutes
VQ-VAE5 minutes
Flow-Based Models10 minutes
Advancements in Flow-Based Generative Models Part 15 minutes
Advancements in Flow-Based Generative Models Part 25 minutes
Advancements in Flow-Based Generative Models Part 310 minutes
Diffusion Models5 minutes
Comparative Summary of Generative Models2 minutes

1 assignmentTotal 20 minutes

Module 10 Quiz20 minutes

This module focuses on physics-based simulation for data augmentation, exploring how physics-driven techniques generate realistic synthetic data to enhance machine learning models. We will discuss key advantages of physics-based simulations, such as scalability, cost-effectiveness, and their ability to model rare events. The module also covers notable approaches, including GeoNet (CVPR 2018) for depth and motion estimation, ScanAva (ECCVW 2018) for semi-supervised learning with 3D avatars, and SMPL (ACM Transactions on Graphics, Volume 15) for human body modeling. Additionally, we introduce equation-based simulation techniques such as Finite Element Method (FEM) and Navier-Stokes equations for modeling fluid dynamics. The module highlights challenges in bridging the simulation-to-reality gap and optimizing computational costs while ensuring high-fidelity synthetic data generation.

What's included

1 video10 readings1 assignment

1 videoTotal 5 minutes

Introduction to Physics-Based Simulation5 minutes

10 readingsTotal 64 minutes

Physics-Based Simulation3 minutes
GeoNet: Using Physical Relationship in Image Formation 10 minutes
Avatar-Based Simulation3 minutes
ScanAva10 minutes
Skinned Multi-Person Linear Model (SMPL)10 minutes
Skinned Multi-Person Linear Model (SMPL) Part 210 minutes
Governing Equations in Physics-Based Simulation3 minutes
Partial Differential Equations (PDEs)3 minutes
Numerical Methods for Solving PDEs10 minutes
Comparison of Methods2 minutes

1 assignmentTotal 30 minutes

Module 11 Quiz30 minutes

This module introduces Neural Radiance Fields (NeRF), a deep learning-based approach for synthesizing novel views of complex 3D scenes. Unlike traditional 3D reconstruction techniques such as Structure-from-Motion (SfM) and Multi-View Stereo (MVS), which rely on explicit point cloud representations, NeRF learns a continuous volumetric representation of a scene using a fully connected neural network. By taking a set of 2D images captured from different viewpoints, NeRF estimates the density and color of light rays at each spatial location, enabling high-quality, photorealistic novel view synthesis. The lecture also explores how NeRF improves upon prior methods, such as depth estimation, photogrammetry, and classic geometric techniques. Understanding NeRF provides valuable insights into data-efficient 3D scene representation—a critical area for applications in computer vision, robotics, virtual reality (VR), and augmented reality (AR).

What's included

1 video6 readings1 assignment

This module explores diffusion models, a class of generative models that incrementally add noise to data and then learn to reverse the process to reconstruct high-quality samples. Diffusion models have gained prominence due to their state-of-the-art performance in image, video, and text generation, surpassing GANs in terms of sample quality and diversity. The module covers the foundational principles of Denoising Diffusion Probabilistic Models (DDPMs) and their training objectives, advancements such as Score-Based Generative Models, Latent Diffusion Models (LDMs), and Classifier-Free Guidance techniques. We also examine their real-world applications in text-to-image generation (Stable Diffusion, DALL·E), video synthesis (Sora, Veo 2), and high-resolution image synthesis. Finally, the module provides insights into the mathematical framework, the optimization strategies, and the growing role of diffusion models in AI-driven content creation.

What's included

1 video11 readings1 assignment

1 videoTotal 5 minutes

Introduction to Diffusion Models5 minutes

11 readingsTotal 98 minutes

Forward and Reverse Diffusion in Denoising5 minutes
Components of Denoising Diffusion Models10 minutes
Loss Decomposition and Noise Levels10 minutes
Variance Schedule and Training Steps5 minutes
The Rapidly Evolving Field of DDM5 minutes
Foundational Understanding of Diffusion Models10 minutes
Key Model Variants and Improvements10 minutes
Guided and Conditional Generation10 minutes
Video Diffusion Models I15 minutes
Video Diffusion Models II10 minutes
Video Diffusion Models III8 minutes

1 assignmentTotal 20 minutes

Module 13 Quiz20 minutes

This lecture explores 3D Gaussian Splatting (3DGS), a novel approach in computer vision for high-fidelity, real-time 3D scene rendering. Unlike traditional methods like Neural Radiance Fields (NeRF), which rely on continuous neural fields, 3DGS represents scenes using a collection of discrete anisotropic Gaussian functions. These Gaussians efficiently approximate scene geometry, radiance, and depth, enabling real-time rendering with minimal computational overhead. We discuss the theoretical foundations, mathematical formulations, and rendering techniques that make 3D Gaussian Splatting a game-changer in virtual reality (VR), augmented reality (AR), and interactive media. Additionally, we highlight key differences between isotropic and anisotropic Gaussian splats, their impact on rendering quality, and how optimization techniques refine their accuracy. Finally, we compare 3DGS to NeRF, analyzing their trade-offs in rendering speed, computational efficiency, and application suitability.