Predict Diabetes with a Random Forest using R

4.5
stars
95 ratings
Offered By
Coursera Project Network
2,708 already enrolled
In this Guided Project, you will:

Complete a random Training and Test Set from one Data Source using an R function.

Practice data distribution using R and ggplot2.

Apply a Random Forest model.

Clock2 Hours
IntermediateIntermediate
CloudNo download needed
VideoSplit-screen video
Comment DotsEnglish
LaptopDesktop only

In this 1-hour long project-based course, you will learn how to (complete a training and test set using an R function, practice looking at data distribution using R and ggplot2, Apply a Random Forest model to the data, and examine the results using RMSE and a Confusion Matrix). Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Skills you will develop

Random ForestComputer ProgrammingR ProgrammingModelling

Learn step-by-step

In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:

  1. Task 1: In this task the Learner will be introduced to the Course Objectives, which is to how to execute a Random Forest Model using R and the Pima Indians data set. There will be a short discussion about the Interface and an Instructor Bio.

  2. Task 2: The Learners will get experience looking at the data using ggplot2. This is important in order for the practitioner to see the balance of the data, especially as it relates to the Response Variable.

  3. Task 3: The Learner will get experience creating Testing and Training Data Sets. There are multiple ways to do this and the Instructor will go over two of them in this Task.

  4. Task 4: The Learner will get experience with the syntax of the Caret, an R package. There will be a discussion on how you can apply hundreds of algorithms to a single problem using the same syntax using Caret as well.

  5. Task 5: The Learner will get experience evaluation models in this Task. RMSE will be discussed as well as the Confusion Matrix. The conclusion of the course will use the two evaluation metrics see how well the model performed on the test data set.

How Guided Projects work

Your workspace is a cloud desktop right in your browser, no download required

In a split-screen video, your instructor guides you step-by-step

Instructor

Reviews

TOP REVIEWS FROM PREDICT DIABETES WITH A RANDOM FOREST USING R

View all reviews

Frequently asked questions

Frequently Asked Questions

More questions? Visit the Learner Help Center.