This course will introduce you to the linear regression model, which is a powerful tool that researchers can use to measure the relationship between multiple variables. We’ll begin by exploring the components of a bivariate regression model, which estimates the relationship between an independent and dependent variable. Building on this foundation, we’ll then discuss how to create and interpret a multivariate model, binary dependent variable model and interactive model. We’ll also consider how different types of variables, such as categorical and dummy variables, can be appropriately incorporated into a model. Overall, we’ll discuss some of the many different ways a regression model can be used for both descriptive and causal inference, as well as the limitations of this analytical tool. By the end of the course, you should be able to interpret and critically evaluate a multivariate regression analysis.

This course is part of the Data Literacy Specialization

Offered By

## About this Course

### Skills you will gain

### Offered by

#### Johns Hopkins University

The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world.

## Syllabus - What you will learn from this course

**3 hours to complete**

## Regression Models: What They Are and Why We Need Them

While graphs are useful for visualizing relationships, they don't provide precise measures of the relationships between variables. Suppose you want to determine how an outcome of interest is expected to change if we change a related variable. We need more than just a scatter plot to answer this question. What should you do, for example, if you want to calculate whether air quality changes when vehicle emissions decline? Or if you want to calculate how consumer purchasing behavior changes if a new tax policy is implemented? To calculate these predicted effects, we can use a regression model. This module will first introduce correlation as an initial means of measuring the relationship between two variables. The module will then discuss prediction error as a framework for evaluating the accuracy of estimates. Finally, the module will introduce the linear regression model, which is a powerful tool we can use to develop precise measures of how variables are related to each other.

**3 hours to complete**

**5 videos**

**4 readings**

**4 practice exercises**

**3 hours to complete**

## Fitting and Evaluating a Bivariate Regression Model

Now that you've got a handle on the basics of regression analysis, the next step is to consider how to evaluate and modify a basic regression model. This module will introduce you to a common measure of model fit and the three core assumptions of regression analysis. In addition, we'll explore the special circumstance of conducting a regression analysis with a binary (AKA dummy) treatment variable. Dummy variables, which take on two values, are frequently used in statistics. Understanding how to use and interpret dummy variables provides a foundation for developing a multivariate regression model, which we'll get to in the next module.

**3 hours to complete**

**4 readings**

**4 practice exercises**

**3 hours to complete**

## Multivariate Regression Models

The bivariate regression model is an essential building block of statistics, but it is usually insufficient in practice as a useful model for descriptive, causal or predictive inference. This is because there are usually multiple variables that impact a particular dynamic. Whether you are modeling political behavior, environmental processes or drug treatment outcomes, it is almost always necessary to account for multiple influences on an outcome of interest. This module will introduce the multivariate model of regression analysis and explain the appropriate ways to interpret and evaluate the results from a multivariate analysis.

**3 hours to complete**

**4 videos**

**4 readings**

**4 practice exercises**

**3 hours to complete**

## Extensions of the Multivariate Model

Once you've mastered the OLS multivariate model, you're ready to learn about a wide array of regression modeling techniques. Remember, researchers should always employ modeling tools that best enable them to answer the question at hand. This module will focus on two tools in particular, interaction terms and models for binary dependent variables. Keep in mind, however, that there are numerous regression modeling tools that you can learn and implement based on the research question you're trying to answer. After you've developed a solid understanding of regression basics, you should feel capable of expanding this knowledge base as you move forward as a producer and consumer of analytics.

**3 hours to complete**

**5 videos**

**2 readings**

**2 practice exercises**

## About the Data Literacy Specialization

This specialization is intended for professionals seeking to develop a skill set for interpreting statistical results. Through four courses and a capstone project, you will cover descriptive statistics, data visualization, measurement, regression modeling, probability and uncertainty which will prepare you to interpret and critically evaluate a quantitative analysis.

## Frequently Asked Questions

When will I have access to the lectures and assignments?

What will I get if I subscribe to this Specialization?

Is financial aid available?

More questions? Visit the Learner Help Center.