In this course you will learn how to use survey weights to estimate descriptive statistics, like means and totals, and more complicated quantities like model parameters for linear and logistic regressions. Software capabilities will be covered with R® receiving particular emphasis. The course will also cover the basics of record linkage and statistical matching—both of which are becoming more important as ways of combining data from different sources. Combining of datasets raises ethical issues which the course reviews. Informed consent may have to be obtained from persons to allow their data to be linked. You will learn about differences in the legal requirements in different countries.



Combining and Analyzing Complex Data
This course is part of Survey Data Collection and Analytics Specialization

Instructor: Richard Valliant, Ph.D.
Access provided by McKinsey
9,258 already enrolled
(65 reviews)
Skills you'll gain
Details to know

Add to your LinkedIn profile
4 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
After completing Modules 1 and 2 of this course you will understand how to estimate descriptive statistics, overall and for subgroups, when you deal with survey data. We will review software for estimation (R, Stata, SAS) with examples for how to estimate things like means, proportions, and totals. You will also learn how to estimate parameters in linear, logistic, and other models and learn software options with emphasis on R. Module 3 and 4 discuss how you can add additional data to your analysis. This requires knowing about record linkage techniques, and what it takes to get permission to link data.
What's included
7 videos6 readings1 assignment
Module 2 covers how to estimate linear and logistic model parameters using survey data. After completing this module, you will understand how the methods used differ from the ones for non-survey data. We also cover the features of survey data sets that need to be accounted for when estimating standard errors of estimated model parameters.
What's included
8 videos8 readings1 assignment
Module starts with the current debate on using more (linked) administrative records in the U.S. Federal Statistical System, and a general motivation for linking records. Several examples will be given on why it is useful to link data. Challenges of record linkage will be discussed. A brief overview over key linkage techniques is included as well.
What's included
4 videos12 readings1 assignment1 discussion prompt
This module will discuss key issues in obtaining consent to record linkage. Failure to consent can lead to bias estimates. Current research examples will be given as well as practical suggestions on how to obtain linkage consent.
What's included
5 videos3 readings1 assignment
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
65 reviews
- 5 stars50.76% 
- 4 stars26.15% 
- 3 stars12.30% 
- 2 stars4.61% 
- 1 star6.15% 
Showing 3 of 65
Reviewed on Feb 11, 2021
Great course! Thanks, Professsor Valliant and Professor Frauke Kreuter.
Explore more from Data Science
 - University of Michigan 
 - CertNexus 
 - Johns Hopkins University 


