Apache Hadoop

Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. Coursera's Apache Hadoop catalogue teaches you about the core concepts and components of this powerful framework. You'll learn about Hadoop's architecture, its key components like Hadoop Distributed File System (HDFS) and MapReduce, as well as advanced topics such as data ingestion with tools like Flume and Sqoop. You will also delve into data processing using Hive and Pig, and explore scalable machine learning algorithms. By mastering Apache Hadoop, you will be equipped to handle big data challenges, contributing to business insights and decision making.
29credentials
2online degrees
72courses

Explore the Hadoop Course Catalog

  • Status: Free Trial

    Skills you'll gain: Big Data, Apache Hadoop, Data Infrastructure, Data Processing, Analytics, Data Science, Distributed Computing, Linux, Software Installation, Scalability, System Configuration

  • Status: Free Trial

    Skills you'll gain: Data Pipelines, Apache Hadoop, Extract, Transform, Load, Data Transformation, Apache Hive, Data-Driven Decision-Making, Big Data, Data Warehousing, Apache Spark, Data Integration, Data Processing, Data Management, Data Analysis, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Apache Hadoop, Data Lakes, Big Data, Linux Commands, Linux, File Systems, Data Management, Command-Line Interface, Data Processing, Software Installation, Distributed Computing, System Configuration

  • Status: Free Trial

    Skills you'll gain: Apache Kafka, Apache Spark, Apache Hadoop, Distributed Computing, Dataflow, Java Programming, Java, Middleware, Scala Programming, Data Structures, System Programming, Programming Principles, Servers, Application Frameworks, Debugging, Algorithms, Performance Tuning, Network Protocols, Computer Science

  • Status: Free Trial

    Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, Interactive Data Visualization, Plotly, Data Pipelines, Kubernetes, Matplotlib, Dashboard, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming

  • Skills you'll gain: Apache Spark, Managed Services, Google Cloud Platform, Big Data, Apache Hadoop, Data Management, Servers

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Machine Learning, Generative AI, PySpark, Applied Machine Learning, Supervised Learning, Apache Hadoop, Data Pipelines, Unsupervised Learning, Feature Engineering, Data Processing, Extract, Transform, Load, Predictive Modeling, Data Transformation, Regression Analysis

  • Status: Free Trial

    Skills you'll gain: Apache Airflow, Data Modeling, Data Pipelines, Data Storage, Data Architecture, Requirements Analysis, Data Processing, Data Warehousing, Query Languages, Apache Hadoop, Extract, Transform, Load, Data Lakes, Amazon Web Services, File Systems, Apache Spark, Database Systems, Feature Engineering, Data Integration, AWS Kinesis, Data Management

  • Status: Free Trial

    University of California San Diego

    Skills you'll gain: Big Data, Apache Hadoop, Scalability, Data Processing, Data Science, Distributed Computing, Unstructured Data, Data Infrastructure, Data Analysis

  • Status: Free

    Amazon Web Services

    Skills you'll gain: Apache Hadoop, Apache Spark, Big Data, Amazon Web Services, Business Intelligence, Amazon S3, Command-Line Interface, Data-Driven Decision-Making, Data Processing, Analytics

  • Status: New
    Status: Preview

    O.P. Jindal Global University

    Skills you'll gain: Big Data, Apache Spark, Apache Hadoop, Apache Hive, Databases, Analytics, Data Storage Technologies, Data Mining, NoSQL, Applied Machine Learning, Real Time Data, Distributed Computing, SQL, Data Processing, Query Languages, Scripting Languages

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Hadoop, Big Data, Data Infrastructure, Cloud Management, Data Processing, Data Storage, Java, Operating System Administration, Systems Administration, Distributed Computing, Command-Line Interface