20 lines
978 B
Markdown
20 lines
978 B
Markdown
---
|
|
title: Fault Tolerance
|
|
---
|
|
## Fault Tolerance
|
|
|
|
Fault tolerance is the property that enables a system to continue its intended operation, possibly at a reduced level, rather than failing completely, when some portion of the system fails.
|
|
|
|
A **database** is fault-tolerant when it can access a secondary shard when the primary is unavailable.
|
|
|
|
This is achieved through:
|
|
|
|
* Database Replication
|
|
* Fault Detection & Failover
|
|
|
|
A database that maintains multiple copies of all data in different physical nodes located across independent physical sub-systems, such as server racks and network routers, has a higher probability to continue operating when the primary copy of the data is unavailable due to its ability to read data from multiple replications.
|
|
|
|
In large-scale distribution systems, it becomes increasingly important to have reliable failure detection systems that can identify failing storage drives and provide failover units in order to maximise service uptime.
|
|
|
|
|