DeepLearning.AI
Data I/O and Preprocessing with Python and SQL
DeepLearning.AI

Data I/O and Preprocessing with Python and SQL

Sean Barnes

Instructor: Sean Barnes

Top Instructor

Access provided by Toshiba Software India

4,029 already enrolled

Gain insight into a topic and learn the fundamentals.
5.0

(11 reviews)

Beginner level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
5.0

(11 reviews)

Beginner level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • You’ll work with real-world data as it exists in practice: messy, unstructured, and spread across sources.

  • You’ll learn to extract data from websites, APIs, and databases, and clean it using both Python and SQL, an essential step in any analysis pipeline.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

16 assignments

Taught in English
Recently updated!

May 2025

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your Data Analysis expertise

This course is part of the DeepLearning.AI Data Analytics Professional Certificate
When you enroll in this course, you'll also be enrolled in this Professional Certificate.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate from DeepLearning.AI

There are 4 modules in this course

This module introduces techniques for acquiring data from a wide range of sources, with a focus on web scraping and text processing. You'll begin by exploring how data flows into analysis pipelines and gain hands-on experience using tools like Pandas and Beautiful Soup to extract, clean, and structure data. You'll apply text preprocessing methods to handle missing values and parse HTML. Plus, you’ll consider the ethical implications of scraping data from the web.

What's included

22 videos3 readings4 assignments1 programming assignment3 ungraded labs

This module focuses on acquiring data using APIs, as well as applying numerical cleaning techniques. You’ll learn how to retrieve data from web-based APIs, handle authentication securely, and transform raw JSON responses into usable dataframes. The module also covers techniques for cleaning and preparing numerical data, including scaling, binning, normalization, and outlier handling.

What's included

17 videos2 readings4 assignments1 programming assignment3 ungraded labs

This module introduces the fundamentals of data storage and retrieval using databases and SQL. You’ll learn how data is structured in relational systems; explore core concepts like entities, relationships, and schemas; and gain hands-on experience writing SQL queries. You’ll also explore how to query databases from a Python notebook, as well as how generative AI tools can support SQL-based tasks.

What's included

15 videos3 readings4 assignments1 programming assignment2 ungraded labs

In this module, you’ll expand your SQL skills into data preprocessing, validation, and joins (combining tables). You’ll learn how to use SQL for filtering, conditional logic, and handling missing values, and apply validation techniques using aggregation and grouping. The module also explores different types of joins and demonstrates how to use them to combine and analyze data across multiple tables—especially in real-world scenarios like analyzing sports performance data.

What's included

17 videos11 readings4 assignments2 programming assignments4 ungraded labs

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Instructor ratings
5.0 (5 ratings)
Sean Barnes

Top Instructor

DeepLearning.AI
5 Courses37,026 learners

Offered by

DeepLearning.AI

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5.0

11 reviews

  • 5 stars

    100%

  • 4 stars

    0%

  • 3 stars

    0%

  • 2 stars

    0%

  • 1 star

    0%

Showing 3 of 11

CC
5

Reviewed on Oct 22, 2025

NR
5

Reviewed on Jun 20, 2025

MN
5

Reviewed on Jun 27, 2025

Explore more from Data Science