This course introduces you to sampling and exploring data, as well as basic probability theory and Bayes' rule. You will examine various types of sampling methods, and discuss how such methods can impact the scope of inference. A variety of exploratory data analysis techniques will be covered, including numeric summary statistics and basic data visualization. You will be guided through installing and using R and RStudio (free statistical software), and will use this software for lab exercises and a final project. The concepts and techniques in this course will serve as building blocks for the inference and modeling courses in the Specialization.



Introduction to Probability and Data with R
This course is part of Data Analysis with R Specialization
Instructor: Mine Çetinkaya-Rundel
Access provided by The National Institute of Engineering
303,096 already enrolled
(5,813 reviews)
Skills you'll gain
Details to know

Add to your LinkedIn profile
11 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 8 modules in this course
This course introduces you to sampling and exploring data, as well as basic probability theory. You will examine various types of sampling methods and discuss how such methods can impact the utility of a data analysis. The concepts in this module will serve as building blocks for our later courses.Each lesson comes with a set of learning objectives that will be covered in a series of short videos. Supplementary readings and practice problems will also be suggested from OpenIntro Statistics, 3rd Edition, https://leanpub.com/openintro-statistics/, (a free online introductory statistics textbook, that I co-authored). There will be weekly quizzes designed to assess your learning and mastery of the material covered that week in the videos. In addition, each week will also feature a lab assignment, in which you will use R to apply what you are learning to real data. There will also be a data analysis project designed to enable you to answer research questions of your own choosing. Since this is a Coursera course, you are welcome to participate as much or as little as you’d like, though I hope that you will begin by participating fully. One of the most rewarding aspects of a Coursera course is participation in forum discussions about the course materials. Please take advantage of other students' feedback and insight and contribute your own perspective where you see fit to do so. You can also check out the resource page (https://www.coursera.org/learn/probability-intro/resources/crMc4) listing useful resources for this course. Thank you for joining the Introduction to Probability and Data community! Say hello in the Discussion Forums. We are looking forward to your participation in the course.
What's included
1 video2 readings
Welcome to Introduction to Probability and Data! I hope you are just as excited about this course as I am! In the next five weeks, we will learn about designing studies, explore data via numerical summaries and visualizations, and learn about rules of probability and commonly used probability distributions. If you have any questions, feel free to post them on this module's forum (https://www.coursera.org/learn/probability-intro/module/rQ9Al/discussions?sort=lastActivityAtDesc&page=1) and discuss with your peers! To get started, view the learning objectives (https://www.coursera.org/learn/probability-intro/supplement/rooeY/lesson-learning-objectives) of Lesson 1 in this module.
What's included
6 videos2 readings2 assignments
To complete this assignment you will use R and RStudio installed on your local computer or through RStudio Cloud.
What's included
2 readings1 assignment
Welcome to Week 2 of Introduction to Probability and Data! Hope you enjoyed materials from Week 1. This week we will delve into numerical and categorical data in more depth, and introduce inference.
What's included
7 videos3 readings2 assignments
To complete this assignment you will use R and RStudio installed on your local computer or through RStudio Cloud.
What's included
2 readings1 assignment
Welcome to Week 3 of Introduction to Probability and Data! Last week we explored numerical and categorical data. This week we will discuss probability, conditional probability, the Bayes’ theorem, and provide a light introduction to Bayesian inference. Thank you for your enthusiasm and participation, and have a great week! I’m looking forward to working with you on the rest of this course.
What's included
9 videos3 readings2 assignments
To complete this assignment you will use R and RStudio installed on your local computer or through RStudio Cloud.
What's included
2 readings1 assignment
Great work so far! Welcome to Week 4 -- the last content week of Introduction to Probability and Data! This week we will introduce two probability distributions: the normal and the binomial distributions in particular. As usual, you can evaluate your knowledge in this week's quiz. There will be no labs for this week. Please don't hesitate to post any questions, discussions and related topics on this week's forum (https://www.coursera.org/learn/probability-intro/module/VdVNg/discussions?sort=lastActivityAtDesc&page=1). Also this week, you will be asked to complete an initial data analysis project with a real-world data set. The project is designed to help you discover and explore research questions of your own, using real data and statistical methods we learn in this class. Please read the project instructions to complete this self-assessment.
What's included
6 videos5 readings2 assignments
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Why people choose Coursera for their career




Learner reviews
5,813 reviews
- 5 stars78.11% 
- 4 stars17.17% 
- 3 stars2.56% 
- 2 stars0.70% 
- 1 star1.44% 
Showing 3 of 5813
Reviewed on Mar 26, 2020
The instructions for the final project need to be much clearer. I had a hard time figuring it out, and all of the projects I peer-edited were done poorly. Otherwise, I enjoyed the course very much!
Reviewed on Jun 18, 2019
The contents of the course about statistics are friendly to the beginners and easy to understand, however, the R learning is a little bit hard to those who have no computer or coding background.
Reviewed on Oct 18, 2021
Great course, which is very well explained. I loved how every module has a lab assignment, which makes theory easier to understand. Final project was very interesting too! Highly recommend.
Explore more from Data Science
 - Stanford University 
 - University of California, Santa Cruz 
 - Johns Hopkins University 
 - IIMA - IIM Ahmedabad 

