In this first course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will learn how to set up a version-controlled Python working environment which can utilize third party libraries. You will learn to use Python and the powerful Pandas library for data analysis and manipulation. Additionally, you will also be introduced to Vim and Visual Studio Code, two popular tools for writing software. This course is valuable for beginning and intermediate students in order to begin transforming and manipulating data as a data engineer.



Python and Pandas for Data Engineering
This course is part of Python, Bash and SQL Essentials for Data Engineering Specialization



Instructors: Kennedy Behrman
Access provided by University of Hong Kong
28,317 already enrolled
(258 reviews)
Recommended experience
What you'll learn
- Setup a provisioned Python project environment 
- Use Pandas libraries to read and write data into data structures and files 
- Employ Vim and Visual Studio Code to write Python code 
Skills you'll gain
Details to know

Add to your LinkedIn profile
25 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
In this module, you will learn how to set up an isolated Python environment with third party libraries and apply it by setting up a virtual environment including Pandas and Jupyter.
What's included
15 videos13 readings6 assignments1 discussion prompt6 ungraded labs
In this module, you will learn how to create and use Python Sequences, Dictionaries, Sets, List Comprehensions, and Generators. Additionally, you will learn how to apply these by manipulating client data in a Jupyter notebook.
What's included
12 videos6 readings8 assignments5 ungraded labs
In this module, you will learn how to load data into a Pandas DataFrame and write statements to select columns and rows from a DataFrame. Additionally, you will apply comparison and boolean operators as a method of selecting data.
What's included
10 videos7 readings4 assignments8 ungraded labs
This week, you will learn the basics of some popular development environments and apply it by writing code in Vim and Visual Studio Code. Additionally, you will learn how to check your code into a Git repository.
What's included
12 videos8 readings7 assignments8 ungraded labs
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career




Learner reviews
258 reviews
- 5 stars74.41% 
- 4 stars17.82% 
- 3 stars3.87% 
- 2 stars1.55% 
- 1 star2.32% 
Showing 3 of 258
Reviewed on Mar 9, 2023
The lessons were laid out well and were easy to follow. I enjoyed how the labs reinforced the material that was covered.
Reviewed on Jul 4, 2022
It's all overview tool that need to use for work in data engineer
Reviewed on Dec 5, 2022
Don't let the intro videos with the odd presentation format dissuade you, the rest of the course is excellent and uses normal slides to teach! Very good refresher, thank you!
Explore more from Data Science
 - Fractal Analytics 
 - Duke University 



