Can I download the work from my Guided Project after I complete it?

You can download and keep any of your created files from the Guided Project. To do so, you can use the “File Browser” feature while you are accessing your cloud desktop.

Is financial aid available?

Financial aid is not available for Guided Projects.

Can I audit a Guided Project and watch the video portion for free?

Auditing is not available for Guided Projects.

How much experience do I need to do this Guided Project?

At the top of the page, you can press on the experience level for this Guided Project to view any knowledge prerequisites. For every level of Guided Project, your instructor will walk you through step-by-step.

Can I complete this Guided Project right through my web browser, instead of installing special software?

Yes, everything you need to complete your Guided Project will be available in a cloud desktop that is available in your browser.

What is the learning experience like with Guided Projects?

You'll learn by doing through completing tasks in a split-screen environment directly in your browser. On the left side of the screen, you'll complete the task in your workspace. On the right side of the screen, you'll watch an instructor walk you through the project, step-by-step.

Diabetes Prediction With Pyspark MLLIB

Ends in 3 days! Save 40% on your access to 10,000+ programs and make a real impact in your career. Save now.

Diabetes Prediction With Pyspark MLLIB

Instructor: Priya Jha

1,553 already enrolled

Included with Learn more

Ask Coursera

Guided Project

Learn, practice, and apply job-ready skills with expert guidance

Intermediate level

Some related experience required

1.5 hours

Learn at your own pace

Hands-on learning

Learn more

Guided Project

Learn, practice, and apply job-ready skills with expert guidance

Intermediate level

Some related experience required

1.5 hours

Learn at your own pace

Hands-on learning

Learn more

What you'll learn

Learn to Build and Train Logistic Regression Classifier using Pyspark MLLIB
Learn to set up Pyspark on the Google Colab Environment
Learn to work with Pyspark Dataframe

Skills you'll practice

Tools you'll use

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

No downloads or installation required

Only available on desktop

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Learn, practice, and apply job-ready skills in less than 2 hours

Receive training from industry experts
Gain hands-on experience solving real-world job tasks
Build confidence using the latest tools and technologies

About this Guided Project

In this 1 hour long project-based course, you will learn to build a logistic regression model using Pyspark MLLIB to classify patients as either diabetic or non-diabetic. We will use the popular Pima Indian Diabetes data set. Our goal is to use a simple logistic regression classifier from the pyspark Machine learning library for diabetes classification. We will be carrying out the entire project on the Google Colab environment with the installation of Pyspark.You will need a free Gmail account to complete this project. Please be aware of the fact that the dataset and the model in this project, can not be used in the real-life. We are only using this data for the educational purpose.

By the end of this project, you will be able to build the logistic regression classifier using Pyspark MLlib to classify between the diabetic and nondiabetic patients.You will also be able to setup and work with Pyspark on Google colab environment. Additionally, you will also be able to clean and prepare data for analysis. You should be familiar with the Python Programming language and you should have a theoretical understanding of the Logistic Regression algorithm. You will need a free Gmail account to complete this project. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Learn step-by-step

In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:

Introduction & Install Dependencies
Clone and Explore Dataset
Data Cleaning and Preparation
Correlation analysis and Feature Selection
Split Dataset and Build the Logistic Regression Model
Evaluate and Save the model
Model Prediction on a new set of unlabelled data

4 project images

Instructor

Priya Jha

16 Courses86,984 learners

Offered by

Coursera

How you'll learn

Skill-based, hands-on learning
Practice new skills by completing job-related tasks.
Expert guidance
Follow along with pre-recorded videos from experts using a unique side-by-side interface.
No downloads or installation required
Access the tools and resources you need in a pre-configured cloud workspace.
Available only on desktop
This Guided Project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Status: Free Trial
EDUCBA
Machine Learning with Python: Diabetes Prediction
Course
Status: Free Trial
EDUCBA
PySpark: Apply & Evaluate Predictive ML Models
Course
Coursera
Diabetes Disease Detection with XG-Boost and Neural Networks
Guided Project
Status: Free Trial
Edureka
Machine Learning with PySpark
Course

Frequently asked questions

By purchasing a Guided Project, you'll get everything you need to complete the Guided Project including access to a cloud desktop workspace through your web browser that contains the files and software you need to get started, plus step-by-step video instruction from a subject matter expert.

Because your workspace contains a cloud desktop that is sized for a laptop or desktop computer, Guided Projects are not available on your mobile device.

Guided Project instructors are subject matter experts who have experience in the skill, tool or domain of their project and are passionate about sharing their knowledge to impact millions of learners around the world.

Diabetes Prediction With Pyspark MLLIB

Diabetes Prediction With Pyspark MLLIB

What you'll learn

Skills you'll practice

Tools you'll use

Details to know

See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours