Back to Statistics for Data Science with Python
IBM

Statistics for Data Science with Python

This Statistics for Data Science course is designed to introduce you to the basic principles of statistical methods and procedures used for data analysis. After completing this course you will have practical knowledge of crucial topics in statistics including - data gathering, summarizing data using descriptive statistics, displaying and visualizing data, examining relationships between variables, probability distributions, expected values, hypothesis testing, introduction to ANOVA (analysis of variance), regression and correlation analysis. You will take a hands-on approach to statistical analysis using Python and Jupyter Notebooks – the tools of choice for Data Scientists and Data Analysts. At the end of the course, you will complete a project to apply various concepts in the course to a Data Science problem involving a real-life inspired scenario and demonstrate an understanding of the foundational statistical thinking and reasoning. The focus is on developing a clear understanding of the different approaches for different data types, developing an intuitive understanding, making appropriate assessments of the proposed methods, using Python to analyze our data, and interpreting the output accurately. This course is suitable for a variety of professionals and students intending to start their journey in data and statistics-driven roles such as Data Scientists, Data Analysts, Business Analysts, Statisticians, and Researchers. It does not require any computer science or statistics background. We strongly recommend taking the Python for Data Science course before starting this course to get familiar with the Python programming language, Jupyter notebooks, and libraries. An optional refresher on Python is also provided. After completing this course, a learner will be able to: ✔Calculate and apply measures of central tendency and measures of dispersion to grouped and ungrouped data. ✔Summarize, present, and visualize data in a way that is clear, concise, and provides a practical insight for non-statisticians needing the results. ✔Identify appropriate hypothesis tests to use for common data sets. ✔Conduct hypothesis tests, correlation tests, and regression analysis. ✔Demonstrate proficiency in statistical analysis using Python and Jupyter Notebooks.

Status: Statistical Analysis
Status: Descriptive Statistics
Course13 hours

Featured reviews

AS

5.0Reviewed May 2, 2022

It is few of the Data Science courses in my learning series. This is one of the Best in Series. Thanks to the team.

OA

4.0Reviewed Apr 4, 2021

I highly recommend this course for anyone that is having problems with basic statisitcs.

RS

4.0Reviewed Apr 6, 2021

The videos, readings, and labs were not sufficient for me to feel prepared for the assessments. I ended up using outside resources just to understand what was being presented here.

KA

5.0Reviewed Jun 10, 2022

A very good course to clear the basics pf stat of statistics for data science

ED

5.0Reviewed Nov 19, 2020

Excellent course to help clear doubts for the level of statistics needed for data science. It a great experience. well done IBM!

HD

5.0Reviewed Jan 13, 2021

A well structured course, simple and direct to the point, with a little of exercising you'll come out with a huge understanding of the statistical concepts.

FH

5.0Reviewed Jan 2, 2025

This is the best course for the students who want to be a data scientist.

JL

5.0Reviewed Jan 19, 2021

The final assignment is very well designed, I was able to review the entire course material and consolidate the learning. I have now a good understanding of hypothesis testing.

AY

5.0Reviewed Jan 12, 2021

One of the best course I have taken online. Way of teaching was outstanding.

MH

5.0Reviewed Sep 1, 2021

A worth-to-try course if you are curious about implementing some statistical tests in Python.

TS

4.0Reviewed Mar 1, 2021

very interesting course, however, IBM Watson Studio was difficult to use

SM

5.0Reviewed Dec 1, 2022

It is an amazing and useful course about the basics of statistics in data science. I learn many things.