Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales.
This course is part of the Data Science at Scale Specialization
Offered By


About this Course
Could your company benefit from training employees on in-demand skills?
Try Coursera for BusinessSkills you will gain
- Relational Algebra
- Python Programming
- Mapreduce
- SQL
Could your company benefit from training employees on in-demand skills?
Try Coursera for BusinessOffered by
Syllabus - What you will learn from this course
Data Science Context and Concepts
Relational Databases and the Relational Algebra
MapReduce and Parallel Dataflow Programming
NoSQL: Systems and Concepts
Graph Analytics
Reviews
- 5 stars57.23%
- 4 stars25.39%
- 3 stars9.07%
- 2 stars4.73%
- 1 star3.55%
TOP REVIEWS FROM DATA MANIPULATION AT SCALE: SYSTEMS AND ALGORITHMS
A great way to start, and become familiar with the nature, requirements & analytics of today's data.
Its pretty decent. I liked the assignments. However there were some typos in the lecture slides and also the grader output is not very friendly.
Engaging problemset makes sure that you will get your hands dirty with data. And that is great! Definitely worth your time.
covers a lot of ground quickly, but you still get a good understanding of the underlying theory or technologies
About the Data Science at Scale Specialization

Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
More questions? Visit the Learner Help Center.