When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 3 modules in this course
Important note: The second assignment in this course covers the topic of Graph Analysis in the Cloud, in which you will use Elastic MapReduce and the Pig language to perform graph analysis over a moderately large dataset, about 600GB. In order to complete this assignment, you will need to make use of Amazon Web Services (AWS). Amazon has generously offered to provide up to $50 in free AWS credit to each learner in this course to allow you to complete the assignment. Further details regarding the process of receiving this credit are available in the welcome message for the course, as well as in the assignment itself. Please note that Amazon, University of Washington, and Coursera cannot reimburse you for any charges if you exhaust your credit.
While we believe that this assignment contributes an excellent learning experience in this course, we understand that some learners may be unable or unwilling to use AWS. We are unable to issue Course Certificates for learners who do not complete the assignment that requires use of AWS. As such, you should not pay for a Course Certificate in Communicating Data Results if you are unable or unwilling to use AWS, as you will not be able to successfully complete the course without doing so.
Making predictions is not enough! Effective data scientists know how to explain and interpret their results, and communicate findings accurately to stakeholders to inform business decisions. Visualization is the field of research in computer science that studies effective communication of quantitative results by linking perception, cognition, and algorithms to exploit the enormous bandwidth of the human visual cortex. In this course you will learn to recognize, design, and use effective visualizations.
Just because you can make a prediction and convince others to act on it doesn’t mean you should. In this course you will explore the ethical considerations around big data and how these considerations are beginning to influence policy and practice. You will learn the foundational limitations of using technology to protect privacy and the codes of conduct emerging to guide the behavior of data scientists. You will also learn the importance of reproducibility in data science and how the commercial cloud can help support reproducible research even for experiments involving massive datasets, complex computational infrastructures, or both.
Learning Goals: After completing this course, you will be able to:
1. Design and critique visualizations
2. Explain the state-of-the-art in privacy, ethics, governance around big data and data science
3. Use cloud computing to analyze large datasets in a reproducible way.
Statistical inferences from large, heterogeneous, and noisy datasets are useless if you can't communicate them to your colleagues, your customers, your management and other stakeholders. Learn the fundamental concepts behind information visualization, an increasingly critical field of research and increasingly important skillset for data scientists. This module is taught by Cecilia Aragon, faculty in the Human Centered Design and Engineering Department.
What's included
14 videos1 peer review
Show info about module content
14 videos•Total 49 minutes
01 Introduction: What and Why•4 minutes
02 Introduction: Motivating Examples•4 minutes
03 Data Types: Definitions•4 minutes
04 Mapping Data Types to Visual Attributes•4 minutes
05 Data Types Exercise•3 minutes
06 Data Types and Visual Mappings Exercises•4 minutes
07 Data Dimensions•3 minutes
08 Effective Visual Encoding•4 minutes
09 Effective Visual Encoding Exercise•2 minutes
10 Design Criteria for Visual Encoding•3 minutes
11 The Eye is not a Camera•4 minutes
12 Preattentive Processing•4 minutes
13 Estimating Magnitude•4 minutes
14 Evaluating Visualizations•3 minutes
1 peer review•Total 60 minutes
Crime Analytics: Visualization of Incident Reports•60 minutes
Privacy and Ethics
Module 2•1 hour to complete
Module details
Big Data has become closely linked to issues of privacy and ethics: As the limits on what we *can* do with data continue to evaporate, the question of what we *should* do with data becomes paramount. Motivated in the context of case studies, you will learn the core principles of codes of conduct for data science and statistical analysis. You will learn the limits of current theory on protecting privacy while still permitting useful statistical analysis.
What's included
14 videos
Show info about module content
14 videos•Total 85 minutes
Motivation: Barrow Alcohol Study•6 minutes
Barrow Study Problems•5 minutes
Reifying Ethics: Codes of Conduct•7 minutes
ASA Code of Conduct: Responsibilities to Stakeholders•5 minutes
Other Codes of Conduct•7 minutes
Examples of Codified Rules: HIPAA•4 minutes
Privacy Guarantees: First Attempts•6 minutes
Examples of Privacy Leaks•6 minutes
Formalizing the Privacy Problem•7 minutes
Differential Privacy Defined•10 minutes
Global Sensitivity•5 minutes
Laplacian Noise•5 minutes
Adding Laplacian Noise and Proving Differential Privacy•5 minutes
Weaknesses of Differential Privacy•8 minutes
Reproducibility and Cloud Computing
Module 3•5 hours to complete
Module details
Science is facing a credibility crisis due to unreliable reproducibility, and as research becomes increasingly computational, the problem seems to be paradoxically getting worse. But reproducibility is not just for academics: Data scientists who cannot share, explain, and defend their methods for others to build on are dangerous. In this module, you will explore the importance of reproducible research and how cloud computing is offering new mechanisms for sharing code, data, environments, and even costs that are critical for practical reproducibility.
What's included
17 videos1 assignment1 programming assignment
Show info about module content
17 videos•Total 71 minutes
Reproducibility and Data Science•5 minutes
Reproducibility Gold Standard•5 minutes
Anecdote: The Ocean Appliance•4 minutes
Code + Data + Environment•4 minutes
Cloud Computing Introduction•2 minutes
Cloud Computing History•5 minutes
Code + Data + Environment + Platform•4 minutes
Cloud Computing for Reproducible Research•4 minutes
Advantages of Virtualization for Reproducibility•5 minutes
Complex Virtualization Scenarios•4 minutes
Shared Laboratories•4 minutes
Economies of Scale•5 minutes
Provisioning for Peak Load•2 minutes
Elasticity and Price Reductions•5 minutes
Server Costs vs. Power Costs•3 minutes
Reproducibility for Big Data•5 minutes
Counter-Arguments and Summary•4 minutes
1 assignment•Total 30 minutes
AWS Credit Opt-in Consent Form•30 minutes
1 programming assignment•Total 180 minutes
Graph Analysis in the Cloud•180 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Since our founding in 1861, the University of Washington has been a hub for learning, innovation, problem solving and community building. Driven by a mission to serve the greater good, our students, faculty and staff tackle today’s most pressing challenges with courage and creativity, making a difference across Washington state — and around the world.
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
3.4
142 reviews
5 stars
33.80%
4 stars
22.53%
3 stars
16.90%
2 stars
7.74%
1 star
19.01%
Showing 3 of 142
F
FQ
4·
Reviewed on Nov 11, 2016
Great and useful first week about visualization, although I wish it would cover more material . The ethics and cloud computing felt somewhat incomplete, but useful as well.
B
BL
5·
Reviewed on Aug 6, 2019
Too little people participated and long peer review time.But the course content is good.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.