In this course, you'll learn how to manage big datasets, how to load them into clusters and cloud storage, and how to apply structure to the data so that you can run queries on it using distributed SQL engines like Apache Hive and Apache Impala. You’ll learn how to choose the right data types, storage systems, and file formats based on which tools you’ll use and what performance you need.
This course is part of the Modern Big Data Analysis with SQL Specialization
About this Course
What you will learn
Use different tools to browse existing databases and tables in big data systems
Use different tools to explore files in distributed big data filesystems and cloud storage
Create and manage big data databases and tables using Apache Hive and Apache Impala
Describe and choose among different data types and file formats for big data systems
Skills you will gain
- Data Management
- Distributed File Systems
- Cloud Storage
- Big Data
Syllabus - What you will learn from this course
Orientation to Data in Clusters and Cloud Storage
Defining Databases, Tables, and Columns
Data Types and File Types
Managing Datasets in Clusters and Cloud Storage
- 5 stars79.57%
- 4 stars15.49%
- 3 stars3.52%
- 2 stars1.05%
- 1 star0.35%
TOP REVIEWS FROM MANAGING BIG DATA IN CLUSTERS AND CLOUD STORAGE
The course spanned across multiple areas. It was great learning.
This is one of the best courses I've attended on Coursera! Kudos to the course organizers and instructors.
Super useful course with a lot of hands on practices. Though the VM is running slow on my computer.
Absolutely amazing! Have learnt a lot from this course!
About the Modern Big Data Analysis with SQL Specialization
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
What are the hardware and software requirements for the exercise environment?
More questions? Visit the Learner Help Center.