Apache Hive: Design, Query & Optimize Big Data

Économisez sur les compétences qui vous font briller avec 40 % de réduction sur 3 mois de Coursera Plus. Économisez maintenant

Ce cours n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues. Consultez les langues disponibles.

Apache Hive: Design, Query & Optimize Big Data

Ce cours fait partie de Spécialisation "Hadoop & Big Data Foundations Mastery Course"

Instructeur : EDUCBA

Inclus avec

5 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

1 semaine à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

5 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

1 semaine à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Design and manage Hive databases, tables, and partitions.
Implement joins, UDFs, and SerDe for data transformation.
Optimize queries and tune performance for big data workflows.

Compétences que vous acquerrez

Catégorie : Data Storage
Catégorie : Data Transformation
Catégorie : Data Processing
Catégorie : SQL
Catégorie : Data Management
Catégorie : Performance Tuning
Catégorie : Database Management
Catégorie : Database Development
Catégorie : Data Warehousing

Outils que vous découvrirez

Catégorie : Apache Hive
Catégorie : Apache Hadoop
Catégorie : Query Languages
Catégorie : Extensible Markup Language (XML)

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Évaluations

25 devoirs

Enseigné en Anglais

91% of learners achieved a positive career outcome

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Élaborez votre expertise du sujet

Ce cours fait partie de la Spécialisation "Hadoop & Big Data Foundations Mastery Course"

Lorsque vous vous inscrivez à ce cours, vous êtes également inscrit(e) à cette Spécialisation.

Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable

Il y a 5 modules dans ce cours

Learners will be able to design Hive databases and tables, implement partitions and bucketing, apply joins, configure SerDe, create custom UDFs, and optimize queries for efficient big data processing. By the end of the course, participants will not only understand Hive fundamentals but also apply advanced operations such as indexing, views, Slowly Changing Dimensions (SCDs), XML data handling, variable substitution, and performance tuning.

This course provides a step-by-step pathway from beginner to advanced Hive skills, ensuring a solid foundation in HiveQL while introducing real-world scenarios that mirror enterprise big data challenges. Unlike generic SQL courses, this program is specifically tailored to Hive within the Hadoop ecosystem, highlighting its schema-on-read model, distributed query execution, and integration with Hadoop’s scalability. Learners will gain hands-on practice with query optimization, compression, and Hive architecture, making them confident in handling large-scale datasets. Upon completion, they will be able to analyze, transform, and optimize big data effectively, preparing for careers in data engineering, analytics, and Hadoop ecosystem management.

This module introduces Apache Hive and its core fundamentals, including databases, tables, partitions, and bucketing. Learners will explore how Hive enables SQL-like queries on Hadoop, manage datasets, and apply key commands for efficient data handling.

Inclus

13 vidéos5 devoirs

13 vidéosTotal 85 minutes

Introduction to HIVE11 minutes
HIVE Data Base10 minutes
Load Data Command6 minutes
How to Replace Column4 minutes
External Table6 minutes
HIVE Metastore3 minutes
What is Hive Partition10 minutes
Creating Partition Table9 minutes
Insert Overwrite Table4 minutes
Dynamic Partition True2 minutes
Hive Bucketing5 minutes
Decomposing Data Sets6 minutes
Hive Joins9 minutes

5 devoirsTotal 70 minutes

Getting Started with Hive10 minutes
Tables and Data Management Basics10 minutes
Partitions and Bucketing10 minutes
Dataset Operations and Decomposition10 minutes
Hive Fundamentals30 minutes

This module focuses on Hive joins, serialization and deserialization (SerDe), and user-defined functions (UDFs). Learners will practice how to extend HiveQL functionality and apply advanced data transformation techniques.

Inclus

12 vidéos5 devoirs

12 vidéosTotal 88 minutes

Hive Joins Continue10 minutes
Skew Join3 minutes
What is Serde7 minutes
Serde in Hive9 minutes
Hive UDF10 minutes
Hive UDF Continues7 minutes
More Hive UDF7 minutes
Maxcale Function3 minutes
Hive Example Use Case12 minutes
Introduction to Hive Concepts and Hands-on Demonstration6 minutes
Internal Table and External Table6 minutes
Inserting Data Into Tables7 minutes

5 devoirsTotal 70 minutes

Advanced Joins10 minutes
Serialization and Deserialization10 minutes
Hive Functions and Use Cases10 minutes
Core Hive Demonstration10 minutes
Joins, SerDe, and UDFs30 minutes

This module covers Hive operations, functions, and expressions, along with advanced partitioning strategies. Learners will gain hands-on experience with sorting, joins, alter commands, and table sampling for data optimization.

Inclus

12 vidéos5 devoirs

12 vidéosTotal 81 minutes

Date and Mathematical Functions9 minutes
Conditional Statements7 minutes
Explode and Lateral View8 minutes
Sorting6 minutes
Join9 minutes
Map Join2 minutes
Static and Dynamic Partitioning7 minutes
More on Dynamic Partitioning7 minutes
Alter Command6 minutes
MSCK Command9 minutes
Bucketing8 minutes
Table Sampling3 minutes

5 devoirsTotal 70 minutes

Functions and Expressions10 minutes
Sorting and Joins10 minutes
Partitioning and Alter Commands10 minutes
Commands, Bucketing, and Sampling10 minutes
Hive Operations and Partitioning30 minutes

This module explores Hive views, indexing techniques, and configuration of Hive variables. Learners will learn to create reusable query structures, apply compact and bitmap indexes, and configure variable substitution for query optimization.

Inclus

12 vidéos5 devoirs

12 vidéosTotal 70 minutes

Archiving3 minutes
Ranks9 minutes
Creating Views9 minutes
Advantages of views and Altering Views7 minutes
What is Indexing6 minutes
Compact and Bitmap Index Running Time5 minutes
Hive Commands in Bash Shell5 minutes
Hive Variables - Hiveconf4 minutes
Hive Variables -Hiveconf in Bash Shell5 minutes
Configuring a Hive Var Variable9 minutes
Variable Substitution2 minutes
Word Count6 minutes

5 devoirsTotal 70 minutes

Archiving and Ranking10 minutes
Views in Hive10 minutes
Indexing and Commands in Hive10 minutes
Hive Variables and Substitution10 minutes
Views, Indexing, and Variables30 minutes

This module introduces Hive’s internal architecture, execution modes, and advanced features. Learners will explore SCDs, XML data handling, immutable tables, compression techniques, and performance configurations.

Inclus

23 vidéos5 devoirs

23 vidéosTotal 141 minutes

Hive Architecture3 minutes
Parallelism in Hive6 minutes
Table Properties in Hive6 minutes
Null Format Properties6 minutes
Null Format Properties Continues4 minutes
Purge Commands in Hives5 minutes
Slowing Changing Dimension7 minutes
Implement the SCD9 minutes
Example of the SCD4 minutes
How to Load XML Data in Hive5 minutes
How to Load XML Data in Hive Continue9 minutes
No Drop and Offline in Hive8 minutes
Immutable Table9 minutes
How to Create Hive RC File9 minutes
Multiple Tables6 minutes
Merging Hive Created Files and Function rLike6 minutes
Various Configuration Settings in Hive9 minutes
Various Configuration Settings in Hive Continues3 minutes
Compressing Various Files in Hive6 minutes
Different Modes in Hive4 minutes
File Compression in Hive6 minutes
Type of Mode in Hive4 minutes
Comparison of Internal and External Table8 minutes

5 devoirsTotal 70 minutes

Architecture and Table Properties10 minutes
Storage and Data Dimensions10 minutes
XML and Table Management10 minutes
Configuration, Compression, and Modes10 minutes
Hive Architecture and Advanced Features30 minutes

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeur

EDUCBA

1 366 Cours301 751 apprenants

Offert par

EDUCBA

En savoir plus sur Data Analysis

EDUCBA
Big Data Analytics with Hive, Pig & MapReduce
Cours
EDUCBA
Hadoop Projects: Analyze Big Data with Hive & Pig
Cours
EDUCBA
Big Data with Hadoop: Apply MapReduce, Pig & Hive
Cours
Packt
Apache Spark with Scala – Hands-On with Big Data!
Cours

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Étudiant(e) depuis 2018

’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’

Jennifer J.

Étudiant(e) depuis 2020

’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’

Larry W.

Étudiant(e) depuis 2021

’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’

Chaitanya A.

’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’

Foire Aux Questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.