In this first course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will learn how to set up a version-controlled Python working environment which can utilize third party libraries. You will learn to use Python and the powerful Pandas library for data analysis and manipulation. Additionally, you will also be introduced to Vim and Visual Studio Code, two popular tools for writing software. This course is valuable for beginning and intermediate students in order to begin transforming and manipulating data as a data engineer.



Python and Pandas for Data Engineering
This course is part of Python, Bash and SQL Essentials for Data Engineering Specialization



Instructors: Kennedy Behrman
Access provided by US Postal Service
28,319 already enrolled
(258 reviews)
Recommended experience
What you'll learn
- Setup a provisioned Python project environment 
- Use Pandas libraries to read and write data into data structures and files 
- Employ Vim and Visual Studio Code to write Python code 
Skills you'll gain
Details to know

Add to your LinkedIn profile
25 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
In this module, you will learn how to set up an isolated Python environment with third party libraries and apply it by setting up a virtual environment including Pandas and Jupyter.
What's included
15 videos13 readings6 assignments1 discussion prompt6 ungraded labs
In this module, you will learn how to create and use Python Sequences, Dictionaries, Sets, List Comprehensions, and Generators. Additionally, you will learn how to apply these by manipulating client data in a Jupyter notebook.
What's included
12 videos6 readings8 assignments5 ungraded labs
In this module, you will learn how to load data into a Pandas DataFrame and write statements to select columns and rows from a DataFrame. Additionally, you will apply comparison and boolean operators as a method of selecting data.
What's included
10 videos7 readings4 assignments8 ungraded labs
This week, you will learn the basics of some popular development environments and apply it by writing code in Vim and Visual Studio Code. Additionally, you will learn how to check your code into a Git repository.
What's included
12 videos8 readings7 assignments8 ungraded labs
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career




Learner reviews
258 reviews
- 5 stars74.41% 
- 4 stars17.82% 
- 3 stars3.87% 
- 2 stars1.55% 
- 1 star2.32% 
Showing 3 of 258
Reviewed on Aug 15, 2022
Great introduction to how to set up your python environment.
Reviewed on Jul 4, 2022
It's all overview tool that need to use for work in data engineer
Reviewed on Oct 26, 2023
It's an excellent course that covers all major areas for Python development
Explore more from Data Science
 - Fractal Analytics 
 - Duke University 



