Johns Hopkins University
Large-Scale Database Systems Specialization
Johns Hopkins University

Large-Scale Database Systems Specialization

Master Distributed Databases and Cloud Analytics. Gain advanced skills in distributed database systems, cloud computing, data reliability, and machine learning to design and optimize large-scale data solutions.

David Silberberg

Instructor: David Silberberg

Access provided by PSGR Krishnammal College for Women

Get in-depth knowledge of a subject
Intermediate level

Recommended experience

12 weeks to complete
at 5 hours a week
Flexible schedule
Learn at your own pace
Get in-depth knowledge of a subject
Intermediate level

Recommended experience

12 weeks to complete
at 5 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Master database systems, including transaction management, query optimization, and data warehousing principles for large-scale environments.

  • Develop proficiency in cloud computing concepts, using Hadoop and Accumulo for data processing and storage in distributed systems.

  • Apply machine learning techniques such as clustering and collaborative filtering to analyze big data and enhance system reliability.

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

  • Learn in-demand skills from university and industry experts
  • Master a subject or tool with hands-on projects
  • Develop a deep understanding of key concepts
  • Earn a career certificate from Johns Hopkins University

Specialization - 3 course series

What you'll learn

  • Understand distributed database architectures and translate SQL queries into relational algebra expressions for optimization.

  • Apply horizontal partitioning techniques to enhance database performance and scalability based on query patterns.

  • Implement vertical partitioning strategies to improve query response times and manage data more efficiently in distributed systems.

Skills you'll gain

Category: Distributed Computing
Category: Database Design
Category: Database Architecture and Administration
Category: Database Systems
Category: Scalability
Category: Relational Databases
Category: Performance Tuning
Category: Data Integrity
Category: Database Management Systems
Category: SQL
Category: Database Management
Category: Query Languages
Category: Databases

What you'll learn

  • Learners will gain skills in implementing database security through views, dynamic authorization, and semantic integrity rules.

  • Students will understand query optimization techniques, cost evaluation, and the creation of optimized query plans to enhance database performance.

  • Learners will master distributed query optimization, using cost models, MapReduce, and HDFS for efficient data storage, compression, and processing.

Skills you'll gain

Category: Distributed Computing
Category: Data Storage Technologies
Category: Big Data
Category: Data Access
Category: Query Languages
Category: Apache Hadoop
Category: Data Processing
Category: Algorithms
Category: Performance Tuning
Category: Data Integrity
Category: SQL
Category: Database Architecture and Administration
Category: Databases

What you'll learn

  • Learn transaction management principles, including ACID properties, concurrency control, and deadlock management techniques for distributed systems.

  • Explore reliability protocols, recovery algorithms, and commit protocols like ARIES, ensuring data consistency and durability.

  • Understand cloud computing with Hadoop, utilizing MapReduce for large-scale data processing, and apply machine learning techniques like clustering.

Skills you'll gain

Category: Transaction Processing
Category: Apache Hadoop
Category: Cloud Computing
Category: Scalability
Category: Machine Learning
Category: Data Warehousing
Category: Big Data
Category: Data Processing
Category: Database Architecture and Administration
Category: Distributed Computing
Category: Disaster Recovery
Category: Relational Databases
Category: Data Integrity
Category: Database Systems

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

David Silberberg
Johns Hopkins University
3 Courses722 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."