IBM

Data Analysis with Python

Access provided by MUST University

637,059 already enrolled

Gain insight into a topic and learn the fundamentals.

19,666 reviews

Intermediate level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
94%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.

19,666 reviews

Intermediate level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
94%
Most learners liked this course

What you'll learn

  • Construct Python programs to clean and prepare data for analysis by addressing missing values, formatting inconsistencies, normalization, and binning

  • Analyze real-world datasets through exploratory data analysis (EDA) using libraries such as Pandas, NumPy, and SciPy to uncover patterns and insights

  • Apply data operation techniques using dataframes to organize, summarize, and interpret data distributions, correlation analysis, and data pipelines

  • Develop and evaluate regression models using Scikit-learn, and use these models to generate predictions and support data-driven decision-making

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

12 assignments¹

AI Graded see disclaimer
Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

This module introduces the foundational skills required to begin data analysis using Python. You will learn how to understand dataset structures, identify key variables, and import data from different sources using Python libraries such as Pandas and NumPy. The module also explores how to retrieve data from databases using SQLite and perform basic dataset exploration. Through hands-on labs, you will practice importing and examining real-world datasets such as laptop pricing and used car pricing.

What's included

6 videos3 readings2 assignments2 app items

This module focuses on preparing data for analysis through essential data wrangling techniques. You will learn how to clean, transform, and format datasets by handling missing values, converting data types, normalizing numerical values, and creating bins for analysis. The module also introduces methods for transforming categorical variables into numerical representations suitable for modeling. Through hands-on exercises, you will apply these techniques to real-world datasets.

What's included

6 videos2 readings2 assignments2 app items

This module develops your ability to analyze and understand datasets through exploratory data analysis techniques. You will learn how to calculate descriptive statistics, perform correlation analysis, and apply grouping techniques to uncover relationships between variables. The module also introduces data visualization methods and statistical tests such as the chi-square test for categorical variables. Through practical labs, you will analyze datasets to identify trends, patterns, and potential insights.

What's included

5 videos3 readings2 assignments2 app items1 plugin

This module introduces the fundamentals of building predictive models using regression techniques. You will learn how to construct simple linear, multiple linear, and polynomial regression models to analyze relationships between variables. The module also covers methods for evaluating model performance using metrics such as R-squared and Mean Squared Error. Visualization techniques such as residual plots and KDE plots are used to assess how well models fit the data.

What's included

6 videos3 readings2 assignments2 app items

This module focuses on improving model performance through evaluation and optimization techniques. You will learn how to detect overfitting and underfitting and apply strategies to improve model generalization. The module introduces ridge regression and hyperparameter tuning using grid search to refine predictive models. Through hands-on labs, you will evaluate and improve regression models using real-world datasets.

What's included

4 videos3 readings2 assignments2 app items

In this module, you will apply the full data analysis workflow learned throughout the course. You will import, clean, analyze, and model real-world datasets to generate insights and predictions. The module includes a practice project and a final project that simulate real data analysis scenarios. You will also complete a final exam to demonstrate your understanding of key concepts in Python-based data analysis.

What's included

7 readings2 assignments3 app items

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Instructor ratings
(3,302 ratings)
Joseph Santarcangelo
IBM
37 Courses2,430,245 learners

Offered by

IBM

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    76.39%

  • 4 stars

    18.18%

  • 3 stars

    3.65%

  • 2 stars

    0.92%

  • 1 star

    0.83%

Showing 3 of 19666

LM

Reviewed on Mar 9, 2020

BM

Reviewed on Jul 16, 2020

AA

Reviewed on Dec 8, 2021

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.