When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 3 modules in this course
This intermediate course provides a practical, hands-on exploration of Databricks Governance, focusing on the essential tools and workflows for managing and securing your data lakehouse. You will learn to navigate and control access to your data assets using Unity Catalog, the foundation of Databricks governance. The course covers the core hierarchy of metastores, catalogs, schemas, and tables, and teaches you how to manage them programmatically using the Databricks Python SDK, CLI, and VS Code extension.
Beyond foundational access control, you will master the skills to implement modern CI/CD and MLOps practices directly within the Databricks environment. You'll learn to integrate Databricks Repos with GitHub, automate notebook testing and deployment with GitHub Actions, and understand the architectural considerations for managing machine learning models in production. Finally, you will explore how to ensure ongoing data reliability by setting up and understanding Lakehouse Monitoring for data quality and freshness.
This course is unique because it moves beyond theory, demonstrating how to apply these governance concepts with the actual tools and code used by data professionals. By the end, you'll be equipped to build, deploy, and monitor secure and reliable data pipelines and AI applications on the Databricks platform
This module establishes the foundation of Databricks governance
through Unity Catalog. You'll navigate the metastore-catalog-schema-
table hierarchy, set up role-based access control using service
principals and GRANT/REVOKE statements, and learn to manage your
governance setup programmatically with the Databricks Python SDK,
CLI, and VS Code extension.
What's included
16 videos9 readings1 assignment
Show info about module content
16 videos•Total 48 minutes
Course Introduction•1 minute
Introduction•0 minutes
Unity Catalog overview•5 minutes
Navigating the catalog hierarchy•5 minutes
Setting up your first Unity Catalog•6 minutes
Summary•0 minutes
Introduction•0 minutes
Introducing the Databricks Python SDK•7 minutes
Setting up the Databricks VS Code extension•3 minutes
Overview of the Databricks CLI•5 minutes
Summary•0 minutes
Introduction•0 minutes
Principals and configurations•3 minutes
Using the SDK to create a Service Principal•5 minutes
Writing REVOKE and GRANT statements•4 minutes
Summary•1 minute
9 readings•Total 17 minutes
About this course and your instructors•1 minute
Key terms•1 minute
Lab•5 minutes
Reflection•1 minute
Key terms•1 minute
Lab•5 minutes
Reflection•1 minute
Key terms•1 minute
Reflection•1 minute
1 assignment•Total 30 minutes
Quiz: Governance •30 minutes
CI/CD and MLOps
Module 2•2 hours to complete
Module details
This module covers the workflows that take Databricks code from a
developer's laptop to production. You'll integrate Databricks Repos
with GitHub using branching strategies and code review, automate
notebook testing and deployment with GitHub Actions, and build a
complete MLOps pipeline that serves a GenAI application through a
model serving endpoint.
What's included
16 videos9 readings1 assignment
Show info about module content
16 videos•Total 52 minutes
Introduction•0 minutes
Connecting Databricks to GitHub•5 minutes
Authenticating to GitHub•4 minutes
Branching strategies and code review•4 minutes
Summary•1 minute
Introduction•0 minutes
Running notebooks as jobs•6 minutes
Challenges with notebooks•6 minutes
Automating tests and runs with GitHub Actions•6 minutes
Summary•1 minute
Introduction•0 minutes
Overview of ML and AI capabilities•4 minutes
Creating a GenAI application•5 minutes
Creating a serving endpoint•6 minutes
MLOps architectural overview•4 minutes
Summary•0 minutes
9 readings•Total 17 minutes
Key terms•1 minute
Lab•5 minutes
Reflection•1 minute
Databricks Free Edition•1 minute
Key terms•1 minute
Lab•5 minutes
Reflection•1 minute
Key terms•1 minute
Reflection•1 minute
1 assignment•Total 30 minutes
Quiz: CI/CD and MLOps•30 minutes
Monitoring and quality
Module 3•1 hour to complete
Module details
This module closes the production loop with Lakehouse Monitoring.
You'll enable quality and freshness monitoring on Unity Catalog
tables, interpret monitoring results to detect data anomalies and
drift, and review the recommendations that turn a working pipeline
into a production-ready governance setup.
What's included
8 videos6 readings1 assignment
Show info about module content
8 videos•Total 20 minutes
Introduction•0 minutes
Data quality and freshness•3 minutes
Enabling monitoring for tables•3 minutes
Understanding monitoring results•5 minutes
Summary•0 minutes
Introduction•1 minute
Recommendations and next steps•5 minutes
Course Conclusion•1 minute
6 readings•Total 10 minutes
Key terms•1 minute
Lab•5 minutes
Reflection•1 minute
Capstone project•1 minute
Before You Go•1 minute
Next Steps•1 minute
1 assignment•Total 5 minutes
Final graded quiz•5 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
I'm already using the Databricks UI for governance. Why do I need the SDK or CLI?
While the UI is great for one-off tasks, managing governance at scale requires automation. This course teaches you how to use the SDK and CLI to programmatically manage users, permissions, and data assets, which is essential for integrating governance into your CI/CD pipelines and Infrastructure as Code practices.
I'm a data engineer, not a machine learning expert. Will the ML module be too advanced?
The ML module is designed to give data engineers the necessary context for working with ML teams. It focuses on the operational aspects—like setting up a serving endpoint and the overall MLOps architecture—that are relevant for integrating and supporting ML models within governed data pipelines.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.