This course is designed to equip data engineers with the skills to build scalable and efficient data pipelines using Scala and Spark. Data engineers will learn best practices for development, testing, and deployment in cloud environments, with a focus on optimizing performance and ensuring data quality. The course provides the necessary tools to transform raw data into actionable insights, making it highly relevant in today’s data-driven world.

Data Engineering with Scala and Spark

Data Engineering with Scala and Spark

Instructor: Packt - Course Instructors
Access provided by SR University
Gain insight into a topic and learn the fundamentals.
Intermediate level
Recommended experience
2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
What you'll learn
Set up a development environment for building data pipelines in Scala
Use Spark DataFrames, Datasets, and SQL with Scala for data processing
Profile and clean data using Deequ for improved data quality
Skills you'll gain
Tools you'll learn
Details to know

Shareable certificate
Add to your LinkedIn profile
Assessments
13 assignments
Taught in English
Recently updated!
March 2026
See how employees at top companies are mastering in-demand skills

There are 13 modules in this course
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."





