become our partner

Failures are prohibited: how we made automatic failover

The Program Committee has not yet taken a decision on this talk

Photo
Igor Zolotarev

VK, Tarantool

Abstracts

Cartridge is a tool for managing distributed applications based on Tarantool NoSQL Database. One of the main requirements for our applications is fault tolerance. The load from fallen nodes in the cluster should be switched to live ones. This does not seem like a difficult task, but there are many pitfalls in practice.

In Cartridge, this problem is solved by an automatic failover. I will talk about the history of its development and implementation features, describe the problems we encountered during its maintenance, and share valuable ideas on the development of similar systems.