Duke University
Data Science with NumPy, Sets, and Dictionaries
Duke University

Data Science with NumPy, Sets, and Dictionaries


Gain insight into a topic and learn the fundamentals

Genevieve M. Lipp
Nick Eubank
Kyle Bradbury

Instructors: Genevieve M. Lipp

Beginner level

Recommended experience

30 hours to complete
3 weeks at 10 hours a week
Flexible schedule
Learn at your own pace

Details to know

Earn a career certificate

Add to your LinkedIn profile


4 quizzes

Taught in English

See how employees at top companies are mastering in-demand skills


Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review


There are 4 modules in this course

This module, you will learn the basics of object oriented programming as well as how to use sets and dictionaries to store and work with data in Python. You will apply these concepts with Python to perform some mathematical operations and analytical tasks, including solving geometric problems with circles and counting words in a document.

What's included

10 videos4 readings4 programming assignments

This module, you will learn how to utilize NumPy--one of the most useful Python packages we use in data science--as well as learn additional data structures, arrays, beginning with the simplest type of an array, a vector. With NumPy and your new understanding of vectors, you will develop histograms as well as analyze household income distribution data in the United States, drawing your own data-driven conclusions.

What's included

1 video9 readings2 quizzes3 ungraded labs

This module, you will first learn how NumPy handles data in your program using views and copies of your data. You will then learn how to work with more complex arrays called matrices, as well as how you can subset, filter, and modify data in matrices. Finally, you will write your own programs to manipulate data matrices and report your results for a given dataset.

What's included

1 video14 readings1 quiz3 ungraded labs

In this module, you will learn how to use NumPy to summarize data from matrices (e.g., calculating averages, minimums, maximums, etc.) as well as how to begin to analyze and manipulate image data. You will also explore two new data science techniques: how to make your analysis of data matrices more computationally efficient (vectorization) and how to randomize data (randomization).

What's included

1 video11 readings1 quiz2 ungraded labs


Genevieve M. Lipp
Duke University
9 Courses255,598 learners

Offered by

Duke University

Recommended if you're interested in Data Analysis

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

New to Data Analysis? Start here.


Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions