Distributed Databases

Farida Aliyeva
3 min readDec 27, 2020

Servus, my dear loyal fans, and today we are going to talk about Distributed Databases (DDBMS). The key concept of DDBMS is that there are a number of databases that may be geographically distributed all over the world. A distributed DBMS manages the distributed database in a manner so that it appears as one single database to all its users. Later on, we will discuss its advantages.

A distributed database is a collection of multiple interconnected databases, which are spread physically across various locations that communicate via a computer network. A centralized distributed database management system integrates data logically in a way that it can be managed as if it were all stored in the same location. The DDBMS synchronizes all the data periodically and ensures that data updates and deletes performed at one location will be automatically reflected in the data stored elsewhere. However, a centralized database consists of a single database file located at one site using a single network.

Features

To better understand the features of DDBMS we compared them to centralized databases. And here are the results:

DDBMS is used to create, retrieve, update, and delete distributed databases, ensures that the modified data at any site is universally updated. It is used in application areas where large volumes of data are processed and accessed by numerous users simultaneously. Mainly designed for heterogeneous database platforms and maintains confidentiality and data integrity of the databases.

Advantages

Modular Development

DDBMS is capable of modular development, which means that systems can be expanded by adding new computers and local data to the new site and connecting them to the distributed system without interruption in the current functions.

Reliability

In DDBMS when a component fails in the system will continue to function at reduced performance until the error is fixed. However, when failures occur in centralized databases, the system comes to a complete stop.

Better Response

A Faster response can be achieved if data is distributed in an efficient manner which implies the user requests to be met from local data itself. On the other hand, in centralized systems, all queries have to pass through the central computer for processing, which increases the response time.

Low Communication Cost

Administrators can achieve lower communication costs for distributed database systems if the data is located close to where it is used the most. However, in centralized systems, this is not possible

Why move to DDBMS

Need to share the data

Multiple organizations/ departments often need to communicate with each other and share their data and resources. This demands common databases or replicated databases that should be used in a synchronized manner. And DDBMS is the best choice for that.

Supports both OLTP and OLAP

Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) work with diversified systems that may have common data. Distributed database systems help both these processes by providing synchronized data.

Database Recovery

One of the common techniques used in DDBMS is the replication of data across different sites. Replication of data automatically helps in data recovery if the database in any site is damaged. Users can access data from other sites while the damaged site is being reconstructed. Thus, database failure may become almost invisible to users.

Thank you for your precious time:)
References:
https://www.tutorialspoint.com/distributed_dbms/distributed_dbms_databases.htm

--

--

Farida Aliyeva

Data Scientist at SDH | MS Graduate in Computer Science and Data Analytics