Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

Spécialisation Real-Time, Real Fast: Kafka & Spark for Data Engineers

Acquérir des compétences de haut niveau avec Coursera Plus pour 199 $ (régulièrement 399 $). Économisez maintenant.

Ce spécialisation n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues.

Spécialisation Real-Time, Real Fast: Kafka & Spark for Data Engineers

Real-Time Kafka & Spark Data Engineering. Build fault-tolerant streaming pipelines processing millions of events with Kafka & Spark.

Instructeurs :

Inclus avec

Série de 4 cours

Approfondissez votre connaissance d’un sujet

niveau Intermédiaire

Expérience recommandée

4 semaines à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Série de 4 cours

Approfondissez votre connaissance d’un sujet

niveau Intermédiaire

Expérience recommandée

4 semaines à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Design and optimize Kafka clusters for high throughput, low latency, and fault tolerance in production environments
Build end-to-end streaming pipelines with Spark Structured Streaming, exactly-once semantics, and schema evolution
Implement real-time dashboards, orchestration, and disaster recovery for enterprise streaming architectures

Compétences que vous acquerrez

Catégorie : Performance Tuning
Catégorie : Data Governance
Catégorie : Data Pipelines
Catégorie : Apache Kafka
Catégorie : Data Architecture
Catégorie : Data Transformation
Catégorie : Apache Spark
Catégorie : Scalability
Catégorie : Disaster Recovery
Catégorie : Prometheus (Software)
Catégorie : Docker (Software)
Catégorie : Data Processing
Catégorie : Fraud detection
Catégorie : Event-Driven Programming
Catégorie : Operational Databases
Catégorie : System Monitoring
Catégorie : Grafana
Catégorie : Data Integrity
Catégorie : Real Time Data
Catégorie : PySpark

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Enseigné en Anglais

Récemment mis à jour !

janvier 2026

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Améliorez votre expertise en la matière

Acquérez des compétences recherchées auprès d’universités et d’experts du secteur
Maîtrisez un sujet ou un outil avec des projets pratiques
Développez une compréhension approfondie de concepts clés
Obtenez un certificat professionnel auprès de Coursera

Spécialisation - série de 4 cours

Learn the complete lifecycle of real-time data engineering with Apache Kafka and Spark through hands-on projects that mirror production challenges at companies like Netflix, LinkedIn, and Uber. This comprehensive specialization teaches you to design high-availability streaming architectures, optimize Kafka clusters for millions of events per second, implement exactly-once processing semantics, manage schema evolution without downtime, and build real-time dashboards that power instant business decisions. Starting with Kafka performance tuning and progressing through Spark Structured Streaming, CDC pipelines, and production orchestration, you'll gain the skills to architect, implement, and operate enterprise-grade streaming systems. Each course includes practical labs where you'll configure distributed systems, diagnose performance bottlenecks, handle failures gracefully, and deploy pipelines that transform high-velocity data into immediate business value.

Projet d'apprentissage appliqué

Throughout this specialization, you'll complete hands-on projects that simulate real-world streaming challenges: configure Kafka clusters for high availability, implement exactly-once processing pipelines, build CDC systems with schema evolution, create real-time fraud detection engines, develop live operational dashboards, and design multi-region recovery strategies. Each project progresses from foundational setup through production deployment, using Docker environments and cloud-ready architectures that you can immediately apply in professional settings.

Optimize Kafka for Speed & Availability

COURS 13 heures

Ce que vous apprendrez

Configure Kafka topics with appropriate replication factors, partition counts, and durability settings to ensure high availability.
Diagnose performance bottlenecks using consumer lag metrics, broker health indicators, and throughput analysis.
Optimize producer and consumer configurations including batching, compression, and parallelism to maximize throughput while meeting latency SLAs.

Compétences que vous acquerrez

Catégorie : Apache Kafka

Catégorie : Prometheus (Software)

Catégorie : Real Time Data

Catégorie : Scalability

Catégorie : Performance Tuning

Catégorie : Command-Line Interface

Catégorie : Distributed Computing

Catégorie : Data Loss Prevention

Catégorie : Content Strategy

Catégorie : Process Optimization

Catégorie : Grafana

Catégorie : System Monitoring

Catégorie : System Configuration

Process Real-Time Data with Spark Streams

COURS 25 heures

Ce que vous apprendrez

Explain the execution model of Spark Structured Streaming and build a simple pipeline from a file source to a console sink.
Develop streaming pipelines that integrate with Kafka, apply event-time processing with watermarks, and write reliable outputs to Delta Lake.
Build an end-to-end Spark streaming pipeline that can be deployed in real-world production environments.

Compétences que vous acquerrez

Catégorie : Apache Spark

Catégorie : Apache Kafka

Catégorie : Real Time Data

Catégorie : Data Transformation

Catégorie : Data Processing

Catégorie : Data Integrity

Catégorie : PySpark

Catégorie : Event Management

Catégorie : Event Monitoring

Catégorie : Data-Driven Decision-Making

Catégorie : Data Pipelines

Catégorie : Scalability

Catégorie : JSON

Build Real-Time Dashboards with Spark

COURS 34 heures

Ce que vous apprendrez

Explain Spark’s streaming model and produce a dashboard-ready table from a simple file source.
Construct a real-time pipeline that ingests from Kafka, processes with Spark, and stores result in Delta using event-time windows and watermarks.
Operate a production-oriented dashboard with refresh policies, monitoring, and failure recovery.

Compétences que vous acquerrez

Catégorie : Data Persistence

Catégorie : Dashboard

Catégorie : Continuous Monitoring

Catégorie : JSON

Catégorie : Real Time Data

Catégorie : Data Pipelines

Catégorie : Apache Kafka

Catégorie : Scalability

Catégorie : Data Integrity

Catégorie : Business Intelligence

Catégorie : Business Metrics

Catégorie : Apache Spark

Catégorie : PySpark

Stream & Unify Data Schemas with CDC

COURS 45 heures

Ce que vous apprendrez

Explain CDC fundamentals (binlog/WAL) and schema evolution strategies.
Configure a Schema Registry pipeline locally using Debezium and Kafka.
Use streaming SQL (Flink/ksqlDB) to map, cast, and merge divergent schemas into a canonical model.

Compétences que vous acquerrez

Catégorie : Data Validation

Catégorie : Data Storage Technologies

Catégorie : Apache Kafka

Catégorie : Data Modeling

Catégorie : Data Integrity

Catégorie : Data Pipelines

Catégorie : Data Capture

Catégorie : Continuous Monitoring

Catégorie : Database Design

Catégorie : PostgreSQL

Catégorie : Continuous Integration

Catégorie : Data Transformation

Catégorie : Software Versioning

Catégorie : Data Mapping

Catégorie : Real Time Data

Catégorie : SQL

Catégorie : Schematic Diagrams

Catégorie : Query Languages

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeurs

Coursera

0 Cours0 apprenants

Offert par

Coursera

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Étudiant(e) depuis 2018

’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’

Jennifer J.

Étudiant(e) depuis 2020

’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’

Larry W.

Étudiant(e) depuis 2021

’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’

Chaitanya A.

’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’

Foire Aux Questions

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Plus de questions

Visitez le Centre d'Aide pour les Étudiants

Aide financière disponible,

Ce spécialisation n'est pas disponible en Français (France)

Spécialisation Real-Time, Real Fast: Kafka & Spark for Data Engineers

Ce que vous apprendrez

Compétences que vous acquerrez

Détails à connaître

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

Améliorez votre expertise en la matière