Packt
Regression Analysis for Statistics & Machine Learning in R
Packt

Regression Analysis for Statistics & Machine Learning in R

Access provided by HSBC

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Understand the principles of Ordinary Least Square (OLS) regression and its application in R.

  • Analyze and evaluate statistical and ML-based regression models to address issues like multicollinearity.

  • Apply techniques for variable selection and evaluate model accuracy using cross-validation methods.

  • Create and interpret Generalized Linear Models (GLMs), utilizing logistic regression as a binary classifier.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

9 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 7 modules in this course

In this module, we will introduce you to the essential concepts and tools for regression analysis in R. You'll learn the differences between statistical analysis and machine learning, get familiar with R and R Studio, and start working with data. We'll guide you through the steps of data cleaning and perform some initial exploratory data analysis to set a solid foundation for your future learning.

What's included

8 videos1 reading1 assignment

In this module, we will delve into Ordinary Least Squares (OLS) regression, covering both theory and practical implementation in R. You will learn how to interpret OLS results, calculate and apply confidence intervals, and explore various OLS regression techniques, including models without intercepts, ANOVA, and multiple linear regression with interaction and dummy variables. Additionally, we will discuss the essential conditions that OLS models must satisfy to ensure accurate and reliable results.

What's included

12 videos1 assignment1 plugin

In this module, we will address the challenge of multicollinearity in OLS regression models. You will learn how to detect multicollinearity and manage regression analyses with correlated predictors. The module covers advanced regression techniques such as Principal Component Regression, Partial Least Square Regression, Ridge Regression, and LASSO Regression, providing you with a comprehensive toolkit to handle multicollinearity effectively in R.

What's included

7 videos1 assignment1 plugin

In this module, we will explore the critical aspects of variable and model selection in regression analysis. You will understand why selection is essential, learn how to choose the most appropriate OLS regression model, and identify model subsets. We'll cover evaluating regression model accuracy from a machine learning perspective and assessing performance using diverse metrics. Additionally, you will implement LASSO Regression for variable selection and analyze the contribution of predictors in explaining the variation in the outcome variable.

What's included

8 videos1 assignment1 plugin

In this module, we will tackle common violations of OLS regression model assumptions. You will learn how to apply data transformations to correct issues, use robust regression methods to manage outliers, and address heteroscedasticity to ensure the reliability and validity of your regression models. This module equips you with essential techniques to refine your analysis and improve model performance.

What's included

4 videos1 assignment1 plugin

In this module, we will introduce you to Generalized Linear Models (GLMs) and their various applications. You will learn the fundamentals of GLMs, including logistic regression for binary response variables, multinomial logistic regression, and regression techniques for count data. Additionally, we will cover methods to evaluate the goodness of fit for these models. This module will enhance your understanding of how GLMs extend traditional linear regression models to handle a wider range of data types and distributions.

What's included

7 videos1 assignment1 plugin

In this module, we will explore advanced methods for working with non-parametric and non-linear data. You will learn to implement polynomial and non-linear regression techniques, use Generalized Additive Models (GAMs) and their boosted versions, and develop Multivariate Adaptive Regression Splines (MARS) models. We will also cover CART regression trees, Conditional Inference Trees, Random Forests, and Gradient Boosting Regression. Additionally, you will gain insights into selecting suitable machine learning models for complex data scenarios, enhancing your ability to handle diverse data structures in R.

What's included

10 videos3 assignments

Instructor

Packt - Course Instructors
Packt
1,031 Courses242,267 learners

Offered by

Packt

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Explore more from Data Science