Johns Hopkins University

Importing Data in the Tidyverse

Shannon Ellis, PhD
Stephanie Hicks, PhD
Roger D. Peng, PhD

Instructors: Shannon Ellis, PhD

Access provided by Merck

2,055 already enrolled

Gain insight into a topic and learn the fundamentals.
4.7

(50 reviews)

Beginner level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
4.7

(50 reviews)

Beginner level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Describe different data formats

  • Apply Tidyverse functions to import data into R from external formats

  • Obtain data from a web API

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

5 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Tidyverse Skills for Data Science in R Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

A basic data type in the tidyverse is the tibble. Tibbles store tabular data and are a modern take on the standard R data frame. They have many user-friendly features that are an improvement over standard data frames when doing interactive data analysis. The remainder of this module covers tabular data in spreadsheet formats like Excel, CSV, TSV, and other delimited files.

What's included

15 readings1 assignment

Data can come in non-tabular formats, especially unstructured data or data that otherwise would not fit into a table. JSON and XML are common formats for storing arbitrarily structured data and this module covers the packages used to read in those data formats. In addition, relational databases are common for storing very large collections of tables where you do not need to read in the entire dataset at once. There are many relational database formats and we will cover the SQLite format, which is a compact and simple to use format.

What's included

10 readings1 assignment

Reading in data from various Internet sources can be a useful way to build analyses that need to be regularly updated. The rvest and httr packages are useful for connecting to web sites, web APIs and other online sources of data.

What's included

11 readings1 assignment

Working with others in a data science project often involves reading output or data produced using other statistical analysis packages or other software. This module covers packages for reading in these foreign formats, as well as images and data from Google Drive.

What's included

3 readings1 assignment

Now we will demonstrate how to import data using our case study examples. When working through the steps of the case studies, you can use either RStudio on your own computer or Coursera lab spaces provided for each case study.

What's included

11 readings2 ungraded labs

This project will give you the opportunity to read in data from multiple sources and conduct some simple operations on those data.

What's included

2 readings1 assignment

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.3 (14 ratings)
Shannon Ellis, PhD
Johns Hopkins University
5 Courses6,759 learners
Stephanie Hicks, PhD
Johns Hopkins University
5 Courses6,759 learners
Roger D. Peng, PhD
Johns Hopkins University
37 Courses1,662,030 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.7

50 reviews

  • 5 stars

    77.55%

  • 4 stars

    18.36%

  • 3 stars

    4.08%

  • 2 stars

    0%

  • 1 star

    0%

Showing 3 of 50

EL
5

Reviewed on Nov 22, 2022

FC
5

Reviewed on Jan 28, 2021

VM
5

Reviewed on Mar 27, 2021

Explore more from Data Science