In this first course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will learn how to set up a version-controlled Python working environment which can utilize third party libraries. You will learn to use Python and the powerful Pandas library for data analysis and manipulation. Additionally, you will also be introduced to Vim and Visual Studio Code, two popular tools for writing software. This course is valuable for beginning and intermediate students in order to begin transforming and manipulating data as a data engineer.



Python and Pandas for Data Engineering
This course is part of Python, Bash and SQL Essentials for Data Engineering Specialization



Instructors: Kennedy Behrman
Access provided by Grow with Google - Internal Portal
28,246 already enrolled
(257 reviews)
Recommended experience
What you'll learn
- Setup a provisioned Python project environment 
- Use Pandas libraries to read and write data into data structures and files 
- Employ Vim and Visual Studio Code to write Python code 
Skills you'll gain
Details to know

Add to your LinkedIn profile
25 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
In this module, you will learn how to set up an isolated Python environment with third party libraries and apply it by setting up a virtual environment including Pandas and Jupyter.
What's included
15 videos13 readings6 assignments1 discussion prompt6 ungraded labs
In this module, you will learn how to create and use Python Sequences, Dictionaries, Sets, List Comprehensions, and Generators. Additionally, you will learn how to apply these by manipulating client data in a Jupyter notebook.
What's included
12 videos6 readings8 assignments5 ungraded labs
In this module, you will learn how to load data into a Pandas DataFrame and write statements to select columns and rows from a DataFrame. Additionally, you will apply comparison and boolean operators as a method of selecting data.
What's included
10 videos7 readings4 assignments8 ungraded labs
This week, you will learn the basics of some popular development environments and apply it by writing code in Vim and Visual Studio Code. Additionally, you will learn how to check your code into a Git repository.
What's included
12 videos8 readings7 assignments8 ungraded labs
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career




Learner reviews
257 reviews
- 5 stars74.31% 
- 4 stars17.89% 
- 3 stars3.89% 
- 2 stars1.55% 
- 1 star2.33% 
Showing 3 of 257
Reviewed on Jun 29, 2022
Good for quick basics of working with bash, github, python, virtual environments and such
Reviewed on Dec 12, 2022
Good lectures good knowledge and very interesting communicationskills
Reviewed on Jan 3, 2024
Exceptional course content, would recommend to anyone who got a break from hands on data engineering core skills !
Explore more from Data Science
 - Fractal Analytics 
 - Duke University 



