IBM

Data Science with R - Capstone Project

IBM

Data Science with R - Capstone Project

This course is part of multiple programs.

Jeff Grossman
Yan Luo

Instructors: Jeff Grossman

Access provided by Volvo Cars

18,038 already enrolled

Gain insight into a topic and learn the fundamentals.

111 reviews

Intermediate level
Some related experience required
3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.

111 reviews

Intermediate level
Some related experience required
3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Write a web scraping program to extract data from an HTML file using HTTP requests and convert the data to a data frame.

  • Prepare data for modelling by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.

  • Interpret datawithexploratory data analysis techniques by calculating descriptive statistics, graphing data, and generating correlation statistics.

  • Build a Shiny app containing a Leaflet map and an interactive dashboard then create a presentation on the project to share with your peers.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

5 assignments¹

AI Graded see disclaimer
Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

In this module, you will be introduced to the Data Science with R Capstone Project and the problem scenario you will be working on throughout the project. You will explore the datasets used in the project and understand how data can be collected from different sources. You will learn how to gather data using web scraping techniques to extract information from HTML pages and how to use HTTP requests with the OpenWeather API to retrieve weather data. The collected data will then be organized into structured formats such as data frames for further analysis.

What's included

2 videos1 assignment3 app items5 plugins

In this module, you will learn how to clean and prepare datasets for analysis through various data wrangling techniques. You will work with web-scraped data and apply methods such as renaming columns, cleaning text using regular expressions, and removing unnecessary links or characters. You will also learn how to handle missing data, convert categorical values into numeric formats, and perform data normalization to prepare the dataset for further analysis. Through hands-on labs, you will practice using functions and data manipulation techniques to transform raw data into a clean and structured format.

What's included

1 video1 assignment2 app items3 plugins

At this stage of the Capstone Project, you have gained some valuable working knowledge of data collection and data wrangling. You have also learned a lot about SQL querying and visualization. Congratulations! Now it's time to apply some of your new knowledge and learn about Exploratory Data Analysis (EDA) techniques, again through practice. You can use the datasets you wrangled in the previous Module. However, if you had any issues completing the wrangling, no worries - we have prepared some clean datasets for you to use. You will be asked to complete three labs:

What's included

1 video1 assignment3 app items3 plugins

In this module, you will learn how to build and evaluate regression models to predict hourly bike-sharing demand using weather and datetime data. You will begin by constructing a baseline linear regression model and then improve the model by incorporating polynomial, interaction, and regularization terms. Through hands-on labs, you will compare different models and evaluate their performance using metrics such as R-squared and RMSE. You will also analyze the influence of predictor variables by examining and visualizing their coefficients.

What's included

1 video1 assignment2 app items2 plugins

In this module, you will learn how to build an interactive dashboard application using R Shiny to visualize bike-sharing demand predictions. You will create a dashboard that integrates a Leaflet map to display predicted demand across different cities and allows users to explore the results through interactive controls such as dropdown menus. You will also enhance the dashboard by incorporating data visualizations with ggplot2 to display detailed bike-sharing demand trends for selected cities. Through hands-on labs, you will gain practical experience in designing and improving interactive data applications.

What's included

1 video1 assignment1 ungraded lab3 plugins

In this final module, you will focus on presenting the results of your capstone project. You will create a comprehensive PowerPoint presentation that highlights the key steps of your analysis, the insights you discovered, and the outcomes of your predictive modeling work. You will learn best practices for structuring and communicating data-driven findings effectively. After preparing your presentation, you will submit your final project either through an AI-graded submission or a peer-reviewed submission.

What's included

2 videos3 readings1 peer review1 app item5 plugins

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
(33 ratings)
Jeff Grossman
IBM
3 Courses 729,773 learners
Yan Luo
IBM
7 Courses 399,051 learners

Offered by

IBM

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    81.98%

  • 4 stars

    9%

  • 3 stars

    2.70%

  • 2 stars

    4.50%

  • 1 star

    1.80%

Showing 3 of 111

BT

Reviewed on Oct 21, 2023

SP

Reviewed on Jun 12, 2024

Explore more from Data Science

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.