
Gossip Protocol
- Shreyas Mishra
- Distributed systems
- March 10, 2025
Table of Contents
Gossip
Introduction to gossip protocol
Why do we need gossip protocol ?
Centralized Coordination Overhead
- centralized coordinators creates bottlenecks and single points of failure. Leader failures paralyze the system until a new leader is elected
Inefficient Failure Detection
- Heartbeat-based monitoring requires constant communication, consuming excessive bandwidth and overload the network but slow detection of node failures leads to stale data and service disruptions
Unreliable Multicast Communication
- Multicast protocols struggle with packet loss, network splits, and scalability in large clusters. A multicast message to 1,000 nodes could overload the network and fail to reach all nodes
Slow State Synchronization
- Manual or slow periodic reconciliation of node states can cause delays and inconsistencies when nodes operate with outdated data, risking decision-making errors (e.g., duplicate transactions)
how gossip protocol works
- Nodes communicate with randomly selected peers.
- Each node operates with limited local knowledge of the system.
- Communication occurs at regular intervals.
- The size of transmitted data is limited per gossip round.
- The protocol assumes network paths may fail.
- Interactions are infrequent to reduce overhead.
Types of gossip protocol
- Anti Entropy
- Nodes periodically compare their entire dataset with other nodes to identify and rectify inconsistencies
- Rumor Mongering
- sharing only the latest updates
- might flood the network with frequent cycles
- Aggregation Gossip Protocol
- Dissemination Protocol Variants:
- Event Dissemination Protocols: Gossip periodically about events without triggering gossip directly.
- Background Data Dissemination Protocols: Continuously gossip about node-associated information, suitable for environments where propagation latency isn’t critical2.
epidemic based protocol
- SWIM
- Serf (Hashicorp built on top of swim)