University of Colorado Boulder
Statistical Inference and Hypothesis Testing in Data Science Applications
University of Colorado Boulder

Statistical Inference and Hypothesis Testing in Data Science Applications

Jem Corcoran

Instructor: Jem Corcoran

Access provided by Rio Tinto

7,658 already enrolled

Gain insight into a topic and learn the fundamentals.
4.6

(50 reviews)

Intermediate level

Recommended experience

4 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
4.6

(50 reviews)

Intermediate level

Recommended experience

4 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Define a composite hypothesis and the level of significance for a test with a composite null hypothesis.

  • Define a test statistic, level of significance, and the rejection region for a hypothesis test. Give the form of a rejection region.

  • Perform tests concerning a true population variance.

  • Compute the sampling distributions for the sample mean and sample minimum of the exponential distribution.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

1 quiz, 5 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Data Science Foundations: Statistical Inference Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

Welcome to the course! This module contains logistical information to get you started!

What's included

3 readings1 discussion prompt1 ungraded lab

In this module, we will define a hypothesis test and develop the intuition behind designing a test. We will learn the language of hypothesis testing, which includes definitions of a null hypothesis, an alternative hypothesis, and the level of significance of a test. We will walk through a very simple test.

What's included

6 videos11 readings1 quiz1 programming assignment2 ungraded labs

In this module, we will expand the lessons of Module 1 to composite hypotheses for both one and two-tailed tests. We will define the “power function” for a test and discuss its interpretation and how it can lead to the idea of a “uniformly most powerful” test. We will discuss and interpret “p-values” as an alternate approach to hypothesis testing.

What's included

7 videos7 readings1 assignment1 programming assignment1 ungraded lab

In this module, we will learn about the chi-squared and t distributions and their relationships to sampling distributions. We will learn to identify when hypothesis tests based on these distributions are appropriate. We will review the concept of sample variance and derive the “t-test”. Additionally, we will derive our first two-sample test and apply it to make some decisions about real data.

What's included

7 videos7 readings1 assignment1 programming assignment1 ungraded lab

In this module, we will consider some problems where the assumption of an underlying normal distribution is not appropriate and will expand our ability to construct hypothesis tests for this case. We will define the concept of a “uniformly most powerful” (UMP) test, whether or not such a test exists for specific problems, and we will revisit some of our earlier tests from Modules 1 and 2 through the UMP lens. We will also introduce the F-distribution and its role in testing whether or not two population variances are equal.

What's included

6 videos6 readings2 assignments

In this module, we develop a formal approach to hypothesis testing, based on a “likelihood ratio” that can be more generally applied than any of the tests we have discussed so far. We will pay special attention to the large sample properties of the likelihood ratio, especially Wilks’ Theorem, that will allow us to come up with approximate (but easy) tests when we have a large sample size. We will close the course with two chi-squared tests that can be used to test whether the distributional assumptions we have been making throughout this course are valid.

What's included

5 videos5 readings1 assignment1 programming assignment1 ungraded lab

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹

 

Instructor

Instructor ratings
4.9 (14 ratings)
Jem Corcoran
University of Colorado Boulder
7 Courses39,190 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.6

50 reviews

  • 5 stars

    78%

  • 4 stars

    14%

  • 3 stars

    4%

  • 2 stars

    0%

  • 1 star

    4%

Showing 3 of 50

DP
5

Reviewed on Feb 8, 2024

MM
4

Reviewed on Jul 6, 2023

GV
5

Reviewed on Jul 27, 2022

Explore more from Data Science