In this course, you'll learn how to manage big datasets, how to load them into clusters and cloud storage, and how to apply structure to the data so that you can run queries on it using distributed SQL engines like Apache Hive and Apache Impala. You’ll learn how to choose the right data types, storage systems, and file formats based on which tools you’ll use and what performance you need.
This course is part of the Modern Big Data Analysis with SQL Specialization
About this Course
What you will learn
Use different tools to browse existing databases and tables in big data systems
Use different tools to explore files in distributed big data filesystems and cloud storage
Create and manage big data databases and tables using Apache Hive and Apache Impala
Describe and choose among different data types and file formats for big data systems
Skills you will gain
- Data Management
- Distributed File Systems
- Cloud Storage
- Big Data
Syllabus - What you will learn from this course
Orientation to Data in Clusters and Cloud Storage
Defining Databases, Tables, and Columns
Data Types and File Types
Managing Datasets in Clusters and Cloud Storage
- 5 stars79.35%
- 4 stars15.65%
- 3 stars3.55%
- 2 stars1.06%
- 1 star0.35%
TOP REVIEWS FROM MANAGING BIG DATA IN CLUSTERS AND CLOUD STORAGE
The course spanned across multiple areas. It was great learning.
The courses provided in this specialization are very good and gave more than expectations.
Although more reading than video, I found readings to be more convenient when you need to get back to a particular lesson and recap something.
i really enjoyed completing this course . i gained to know many new things
About the Modern Big Data Analysis with SQL Specialization
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
What are the hardware and software requirements for the exercise environment?
More questions? Visit the Learner Help Center.