In this course, you'll learn how to manage big datasets, how to load them into clusters and cloud storage, and how to apply structure to the data so that you can run queries on it using distributed SQL engines like Apache Hive and Apache Impala. You’ll learn how to choose the right data types, storage systems, and file formats based on which tools you’ll use and what performance you need.
Offered By
About this Course
What you will learn
Use different tools to browse existing databases and tables in big data systems
Use different tools to explore files in distributed big data filesystems and cloud storage
Create and manage big data databases and tables using Apache Hive and Apache Impala
Describe and choose among different data types and file formats for big data systems
Skills you will gain
Offered by

Cloudera
At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
Syllabus - What you will learn from this course
Orientation to Data in Clusters and Cloud Storage
Defining Databases, Tables, and Columns
Data Types and File Types
Managing Datasets in Clusters and Cloud Storage
Reviews
TOP REVIEWS FROM MANAGING BIG DATA IN CLUSTERS AND CLOUD STORAGE
This was definitely the most challenging course I have done so far, but I am loving it! Glynn and Ian do a fantastic job of guiding you through the content in an engaging and practical manner.
This is Very good course for a beginners, it gives you lots of exercises to practice in vm and course material is Really really good but only thing is you have to read a lot ,
It would have been nice if videos would have been present instead of reading. Also a more deep diving would have been done in concepts like bucketing and indexing.
This is one of the systematic specializations which makes the harder and otherwise overwhelming subject so easy to navigate, follow and learn.
About the Modern Big Data Analysis with SQL Specialization
This Specialization teaches the essential skills for working with large-scale data using SQL.

Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
What are the hardware and software requirements for the exercise environment?
More questions? Visit the Learner Help Center.