I want to purchase this Specialization for my employees! How can I do that?

Please go to https://www.coursera.org/enterprise for more information, to contact Coursera, and to pick a plan. For each plan, you decide the number of courses each person can take and hand-pick the collection of courses they can choose from.

Is this course really 100% online? Do I need to attend any classes in person?

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Can I just enroll in a single course?

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

Modern Big Data Analysis with SQL Specialization

Modern Big Data Analysis with SQL Specialization

Learn Data Analysis for Big Data.

Master using SQL for data analysis on distributed big data systems

Instructors: Glynn Durham

29,985 already enrolled

Included with

Learn more

3 course series

Get in-depth knowledge of a subject

from 1,431 reviews of courses in this program

Beginner level

No prior experience required

4 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

3 course series

Get in-depth knowledge of a subject

from 1,431 reviews of courses in this program

Beginner level

No prior experience required

4 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

This Specialization teaches the essential skills for working with large-scale data using SQL.

Maybe you are new to SQL and you want to learn the basics. Or maybe you already have some experience using SQL to query smaller-scale data with relational databases. Either way, if you are interested in gaining the skills necessary to query big data with modern distributed SQL engines, this Specialization is for you.

Most courses that teach SQL focus on traditional relational databases, but today, more and more of the data that’s being generated is too big to be stored there, and it’s growing too quickly to be efficiently stored in commercial data warehouses. Instead, it’s increasingly stored in distributed clusters and cloud storage. These data stores are cost-efficient and infinitely scalable.

To query these huge datasets in clusters and cloud storage, you need a newer breed of SQL engine: distributed query engines, like Hive, Impala, Presto, and Drill. These are open source SQL engines capable of querying enormous datasets. This Specialization focuses on Hive and Impala, the most widely deployed of these query engines.

This Specialization is designed to provide excellent preparation for the Cloudera Certified Associate (CCA) Data Analyst certification exam. You can earn this certification credential by taking a hands-on practical exam using the same SQL engines that this Specialization teaches—Hive and Impala.

Applied Learning Project

Each course in this Specialization includes a hands-on, peer-graded assignment. To earn the Specialization Certificate, you must successfully complete the hands-on, peer-graded assignment in each course. For this Specialization, there is not a separate Capstone Project like there is in some other Coursera Specializations.

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects
Develop a deep understanding of key concepts
Earn a career certificate from Cloudera

Specialization - 3 course series

This Specialization teaches the essential skills for working with large-scale data using SQL.

Applied Learning Project

Foundations for Big Data Analysis with SQL

Course 1, 12 hours

What you'll learn

Distinguish operational from analytic databases, and understand how these are applied in big data
Understand how database and table design provides structures for working with data
Appreciate how differences in volume and variety of data affects your choice of an appropriate database system
Recognize the features and benefits of SQL dialects designed to work with big data systems for storage and analysis

Skills you'll gain

Category: Relational Databases

Category: SQL

Category: Big Data

Category: Database Design

Category: Unstructured Data

Category: Operational Databases

Category: Database Systems

Category: NoSQL

Category: Databases

Category: Data Management

Category: Data Warehousing

Category: Database Management Systems

Category: Data Analysis

Category: Data Storage

Category: Virtual Machines

Analyzing Big Data with SQL

Course 2, 18 hours

What you'll learn

Understand the basics of SELECT statements
Understand how and why to filter results
Explore grouping and aggregation to answer analytic questions
Work with sorting and limiting results

Skills you'll gain

Category: SQL

Category: Apache Hive

Category: Analytics

Category: Data Management

Category: Data Analysis

Category: Data Manipulation

Category: PostgreSQL

Category: Big Data

Category: MySQL

Category: Databases

Category: Data Integration

Category: Query Languages

Category: Virtual Machines

Managing Big Data in Clusters and Cloud Storage

Course 3, 21 hours

What you'll learn

Use different tools to browse existing databases and tables in big data systems
Use different tools to explore files in distributed big data filesystems and cloud storage
Create and manage big data databases and tables using Apache Hive and Apache Impala
Describe and choose among different data types and file formats for big data systems

Skills you'll gain

Category: Database Management

Category: Data Storage

Category: Apache Hive

Category: Data Storage Technologies

Category: Amazon S3

Category: Data Management

Category: Data Import/Export

Category: Databases

Category: SQL

Category: Performance Tuning

Category: Big Data

Category: Database Management Systems

Category: Query Languages

Category: Cloud Storage

Category: Amazon Web Services

Category: File Systems

Category: Data Access

Category: Data Store

Category: Extract, Transform, Load

Category: Metadata Management

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Glynn Durham

2 Courses54,934 learners

Offered by

Cloudera

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

Yes, the courses in this Specialization are intended to be taken in order:

A fourth course entitled Advanced SQL for Big Data Analysis is currently under development. When it is completed, it will be added to this Specialization.

To use the hands-on environment for the courses in this Specialization, you need to download and install a virtual machine and the software on which to run it. Before continuing, be sure that you have access to a computer that meets the following hardware and software requirements: • Windows, macOS, or Linux operating system (iPads and Android tablets will not work) • 64-bit operating system (32-bit operating systems will not work) • 8 GB RAM or more • 25GB free disk space or more • Intel VT-x or AMD-V virtualization support enabled (on Mac computers with Intel processors, this is always enabled; on Windows and Linux computers, you might need to enable it in the BIOS) • For Windows XP computers only: You must have an unzip utility such as 7-Zip or WinZip installed (Windows XP’s built-in unzip utility will not work)

Successfully completing this Specialization confers a Coursera Specialization Certificate. This is different from the Cloudera Certified Associate (CCA) Data Analyst credential. You can earn the CCA Data Analyst credential by passing a 120-minute performance-based exam. For pricing and other details, see CCA Data Analyst. If you complete this Specialization, including the honors lessons, then you should be well prepared to take the certification exam, but we cannot guarantee that you will pass it and earn the certification credential.

Each course in this Specialization includes a hands-on, peer-graded assignment. To earn the Specialization Certificate, you must earn the Course Certificate for each course in this Specialization. This requires that you successfully complete the hands-on, peer-graded assignment in each course. For this Specialization, there is not a separate Capstone Project like there is in some other Coursera Specializations.