• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Data Preparation Techniques Using Databricks

Results for "data preparation techniques using databricks"


  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Optimizing Spark and Cloud Data Storage for Analytics

    Skills you'll gain: Cloud Security, Apache Spark, Transaction Processing, Data Lakes, PySpark, Data Security, Data Infrastructure, Performance Tuning, Cloud Computing, Cloud Computing Architecture, Amazon S3, Cloud Storage, Data Storage Technologies, Data Storage, Cloud Deployment, Data Warehousing, Data Management, Infrastructure Architecture, Data Integrity, Infrastructure as Code (IaC)

    Beginner · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Building Automated Data Pipelines with Spark,dbt,and Airflow

    Skills you'll gain: Data Flow Diagrams (DFDs), Apache Airflow, Data Pipelines, Data Modeling, Data Integration, Data Architecture, Data Warehousing, Apache Spark, Extract, Transform, Load, Database Development, Data Processing, Data Transformation, Data Quality, Data Validation, Configuration Management, Enterprise Security

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    C

    Coursera

    Building Smarter Data Pipelines: SQL, Spark, Kafka & GenAI

    Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Snowflake Schema, Star Schema, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Data Modeling, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management

    4.3
    Rating, 4.3 out of 5 stars
    ·
    96 reviews

    Intermediate · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Data Engineering & Pipeline Reliability for Machine Learning

    Skills you'll gain: Data Preprocessing, Data Cleansing, Data Pipelines, Feature Engineering, Data Quality, MLOps (Machine Learning Operations), Extract, Transform, Load, Data Transformation, Dataflow, Data Integrity, Apache Airflow, Data Validation, Version Control, Package and Software Management, Git (Version Control System), Quality Assurance, Exploratory Data Analysis, Cost Management, Resource Utilization, Virtual Environment

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    I

    IBM

    Introduction to Big Data with Spark and Hadoop

    Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Data Transformation, Debugging

    4.4
    Rating, 4.4 out of 5 stars
    ·
    478 reviews

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    D

    Duke University

    Databricks to Local LLMs

    Skills you'll gain: Databricks, Model Deployment, Generative AI, Data Lakes, Extract, Transform, Load, MLOps (Machine Learning Operations), Data Transformation, Data Pipelines, Hugging Face, Large Language Modeling, Responsible AI, Analytics, Data Analysis, Data Processing, Data Science, Machine Learning

    3.9
    Rating, 3.9 out of 5 stars
    ·
    12 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Automate, Optimize, and Benchmark Data Pipelines

    Skills you'll gain: Performance Analysis, Performance Testing, Performance Measurement, Benchmarking, Data Modeling, Data Processing, Extract, Transform, Load, Data-Driven Decision-Making, Statistical Analysis

    Advanced · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    M

    Microsoft

    Data Preparation and Evaluation with Copilot

    Skills you'll gain: Microsoft Copilot, Data Quality, Anomaly Detection, Generative Adversarial Networks (GANs), Data Ethics, Generative AI, Data Pipelines, Data Cleansing, Data Preprocessing, Data Synthesis, Data Validation, Responsible AI, Natural Language Processing

    4.8
    Rating, 4.8 out of 5 stars
    ·
    17 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pragmatic AI Labs

    Production Governance and MLOps on Databricks

    Skills you'll gain: Databricks, Role-Based Access Control (RBAC), MLOps (Machine Learning Operations), Data Lakes, Data Governance, GitHub, Model Deployment, Data Management, CI/CD, Data Quality, Git (Version Control System), Continuous Integration, Data Engineering, Continuous Monitoring, Python Programming, Command-Line Interface

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    W

    Whizlabs

    Exam Prep DP-100: Microsoft Azure Data Scientist Associate

    Skills you'll gain: Model Deployment, Responsible AI, Statistical Modeling, Microsoft Azure, MLOps (Machine Learning Operations), Statistical Methods, Prompt Engineering, Data Science, Cloud Deployment, Retrieval-Augmented Generation, Artificial Intelligence and Machine Learning (AI/ML), Cloud Management, Model Evaluation, Data Management, AI Workflows, Azure Synapse Analytics, Cloud Computing, Data Pipelines, Continuous Monitoring, Machine Learning

    Intermediate · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Transform, Analyze, and Optimize Your Data

    Skills you'll gain: Data Transformation, Operational Databases, Database Management, Azure Synapse Analytics, Database Design, Data Wrangling, Data Architecture, Apache Cassandra, Apache Hive, Amazon Redshift

    Advanced · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    D

    Duke University

    Applied Python Data Engineering

    Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Data Storytelling, Statistical Visualization, Site Reliability Engineering, Docker (Software), Databricks, Containerization, Interactive Data Visualization, Plot (Graphics), Plotly, Data Pipelines, Matplotlib, Kubernetes, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming

    3.8
    Rating, 3.8 out of 5 stars
    ·
    120 reviews

    Intermediate · Specialization · 1 - 3 Months

1234…834

In summary, here are 10 of our most popular data preparation techniques using databricks courses

  • Optimizing Spark and Cloud Data Storage for Analytics: Coursera
  • Building Automated Data Pipelines with Spark,dbt,and Airflow: Coursera
  • Building Smarter Data Pipelines: SQL, Spark, Kafka & GenAI: Coursera
  • Data Engineering & Pipeline Reliability for Machine Learning: Coursera
  • Introduction to Big Data with Spark and Hadoop: IBM
  • Databricks to Local LLMs: Duke University
  • Automate, Optimize, and Benchmark Data Pipelines: Coursera
  • Data Preparation and Evaluation with Copilot: Microsoft
  • Production Governance and MLOps on Databricks: Pragmatic AI Labs
  • Exam Prep DP-100: Microsoft Azure Data Scientist Associate: Whizlabs

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Accounting
  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • Human Resources (HR)
  • Microsoft Excel
  • Project Management
  • Python
  • SQL

Professional Certificates

  • Google AI Certificate
  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM AI Engineering Certificate
  • IBM AI Product Manager Certificate
  • IBM Data Science Certificate
  • Intuit Academy Bookkeeping Certificate

Courses & Specializations

  • AI Essentials Specialization
  • AI For Business Specialization
  • AI For Everyone Course
  • AI in Healthcare Specialization
  • Deep Learning Specialization
  • Excel Skills for Business Specialization
  • Financial Markets Course
  • Machine Learning Specialization
  • Prompt Engineering for ChatGPT Course
  • Python for Everybody Specialization

Career Resources

  • Career Aptitude Test
  • CAPM Certification Requirements
  • CompTIA A+ Certification Requirements
  • CompTIA Security+ Certification Requirements
  • Essential IT Certifications
  • Free IT Certifications and Courses
  • High-Income Skills to Learn
  • How to Learn Artificial Intelligence
  • PMP Certification Requirements
  • Popular Cybersecurity Certifications

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok