Packt
Snowflake - Build and Architect Data Pipelines Using AWS
Packt

Snowflake - Build and Architect Data Pipelines Using AWS

Access provided by Reveille Foundation

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Architect and optimize scalable data pipelines with Snowflake and AWS.

  • Implement ingestion, transformation, and extraction workflows with best practices.

  • Deploy machine learning pipelines using Snowpark and real-time streaming with Kafka.

  • Ensure data governance with advanced security and compliance features.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

14 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 14 modules in this course

In this module, we will set the stage for the entire course by outlining the roadmap, discussing the prerequisites, and sharing success strategies. These foundational insights will ensure you're well-prepared to navigate and excel in the upcoming material.

What's included

2 videos1 reading

In this module, we will explore the foundational concepts of data warehousing and its significance within a data ecosystem. We’ll take a closer look at Snowflake’s architecture, object hierarchy, and virtual warehouses. Additionally, you’ll learn about Snowflake’s billing components, tracking consumption, and setting up resource monitors, ensuring you’re equipped to manage resources effectively.

What's included

9 videos1 assignment1 plugin

In this module, we will delve into the various table types available in Snowflake, providing a comprehensive introduction to their structures and purposes. You’ll gain hands-on experience through labs focused on creating tables, views, and secure views. We’ll also explore the nuances of views, including materialized and secure views, to enhance your understanding of Snowflake's data presentation capabilities.

What's included

6 videos1 assignment1 plugin

In this module, we will examine Snowflake’s advanced data organization features, focusing on micro-partitions and clustering keys. Through hands-on labs, you’ll learn to select and configure clustering keys, analyze query profiles, and leverage caching mechanisms to enhance performance. Additionally, we’ll explore the benefits of search optimization to further streamline data retrieval and processing efficiency.

What's included

9 videos1 assignment1 plugin

In this module, we will explore the end-to-end processes for loading and extracting data in Snowflake. You'll learn how to connect Snowflake with AWS S3, ingest structured and semi-structured data, and implement continuous ingestion using Snowpipe. Additionally, we'll cover critical aspects such as billing estimation and key considerations to ensure efficient data operations. Hands-on labs will solidify your understanding of these concepts.

What's included

9 videos1 assignment1 plugin

In this module, we will delve into Snowflake's task management and query scheduling features. You'll learn how to create and manage tasks, build complex task trees for dependent workflows, and monitor their execution. We'll also explore billing insights and query history to ensure efficient and cost-effective operations. Through hands-on labs, you’ll gain practical skills in implementing and optimizing tasks in Snowflake.

What's included

4 videos1 assignment1 plugin

In this module, we will uncover the power of streams in Snowflake for implementing Change Data Capture (CDC) workflows. You'll learn how to use standard and append-only streams, manage data retention, and handle stream staleness. Through a series of labs and a project, you’ll create and implement end-to-end pipelines that leverage streams to track and process data changes efficiently. This hands-on experience will solidify your understanding of CDC in modern data architectures.

What's included

11 videos1 assignment1 plugin

In this module, we will explore User-Defined Functions (UDFs) in Snowflake, a powerful feature for extending database functionality. You'll learn about different UDF types, including scalar, tabular, and JavaScript-based UDFs, and gain hands-on experience implementing them. Additionally, we'll discuss pushdown in UDFs and its impact, as well as best practices for writing secure UDFs to ensure data privacy and compliance.

What's included

7 videos1 assignment1 plugin

In this module, we will explore the capabilities of external functions in Snowflake for interacting with external systems. You’ll learn how to deploy AWS Lambda functions, create and secure API Gateway, and integrate these components with Snowflake to build external functions. Through hands-on labs, you will gain practical skills in configuring and deploying these powerful integrations for extending Snowflake’s functionality.

What's included

7 videos1 assignment1 plugin

In this module, we will explore how to integrate Snowflake with Python, Spark, and Airflow on AWS to build robust data engineering solutions. You’ll learn how to connect Snowflake with Python locally and on AWS Glue, parameterize scripts, and use Pandas for data manipulation. Additionally, we will dive into PySpark jobs, the pushdown optimization in Spark 3.1, and setting up Airflow for task orchestration. Hands-on labs will provide practical experience in deploying and automating workflows across these tools.

What's included

12 videos1 assignment1 plugin

In this module, we will focus on real-time streaming using Kafka and Snowflake. You'll learn to set up Kafka on your local system, configure the Kafka-Snowflake connector, and enable secure connectivity with encryption keys. Through hands-on labs, you'll implement streaming pipelines to ingest real-time data into Snowflake, solidifying your understanding of integrating modern streaming platforms with Snowflake.

What's included

6 videos1 assignment1 plugin

In this module, we will explore key features of Snowflake that ensure robust data protection and governance. You'll learn about Time Travel and Failsafe mechanisms for data recovery, and implement column-level dynamic data masking for safeguarding sensitive information. Additionally, we'll cover row-level security and guide you through hands-on labs to create and apply access policies, ensuring controlled and compliant data access.

What's included

6 videos1 assignment1 plugin

In this module, we will dive into Snowpark, Snowflake's powerful framework for building advanced data pipelines and supporting data science use cases. You'll gain hands-on experience with deploying Python UDFs, creating stored procedures for ETL tasks, and preparing data for machine learning. Furthermore, you will build and deploy model training and prediction pipelines using Scikit-Learn, all powered by Snowpark. Additional learning resources and a coupon code for extended exploration will also be provided.

What's included

9 videos1 assignment1 plugin

In this module, we will wrap up the course by reflecting on the key topics and skills covered. You’ll receive guidance on next steps, including updates on Snowflake's evolving features and additional learning opportunities. This final section will help you chart a path for continued growth and mastery of Snowflake and its ecosystem.

What's included

1 video2 assignments

Instructor

Packt - Course Instructors
Packt
1,031 Courses242,267 learners

Offered by

Packt

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Explore more from Information Technology