University of Washington
Data Manipulation at Scale: Systems and Algorithms
University of Washington

Data Manipulation at Scale: Systems and Algorithms

Bill Howe

Instructor: Bill Howe

Access provided by Kalinga Institute of Industrial Technology

62,439 already enrolled

Gain insight into a topic and learn the fundamentals.
4.3

(768 reviews)

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
90%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.3

(768 reviews)

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
90%
Most learners liked this course

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Data Science at Scale Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 5 modules in this course

Understand the terminology and recurring principles associated with data science, and understand the structure of data science projects and emerging methodologies to approach them. Why does this emerging field exist? How does it relate to other fields? How does this course distinguish itself? What do data science projects look like, and how should they be approached? What are some examples of data science projects?

What's included

22 videos4 readings1 programming assignment

Relational Databases are the workhouse of large-scale data management. Although originally motivated by problems in enterprise operations, they have proven remarkably capable for analytics as well. But most importantly, the principles underlying relational databases are universal in managing, manipulating, and analyzing data at scale. Even as the landscape of large-scale data systems has expanded dramatically in the last decade, relational models and languages have remained a unifying concept. For working with large-scale data, there is no more important programming model to learn.

What's included

24 videos1 programming assignment

The MapReduce programming model (as distinct from its implementations) was proposed as a simplifying abstraction for parallel manipulation of massive datasets, and remains an important concept to know when using and evaluating modern big data platforms.

What's included

26 videos1 programming assignment

NoSQL systems are purely about scale rather than analytics, and are arguably less relevant for the practicing data scientist. However, they occupy an important place in many practical big data platform architectures, and data scientists need to understand their limitations and strengths to use them effectively.

What's included

36 videos

Graph-structured data are increasingly common in data science contexts due to their ubiquity in modeling the communication between entities: people (social networks), computers (Internet communication), cities and countries (transportation networks), or corporations (financial transactions). Learn the common algorithms for extracting information from graph data and how to scale them up.

What's included

21 videos

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Instructor ratings
4.1 (15 ratings)
Bill Howe
University of Washington
4 Courses91,537 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.3

768 reviews

  • 5 stars

    57.14%

  • 4 stars

    25.06%

  • 3 stars

    8.96%

  • 2 stars

    4.80%

  • 1 star

    4.02%

Showing 3 of 768

MS
4

Reviewed on Jan 4, 2016

DK
5

Reviewed on Jan 23, 2016

AA
4

Reviewed on Dec 2, 2015

Explore more from Data Science