Whitepapers

What to read if you actually want to understand Paxos

Not every paper here solves the same problem. Some help you understand Paxos itself, some help once you start thinking about real systems, and some explain why consensus is such a pain in the first place.

Suggested order

Read by purpose, not by publication date

If you read these in the wrong order, Paxos gets harder than it needs to be.

Recommended order

If this is your first real pass through Paxos, start here instead of jumping straight into Lamport and wondering why everyone pretends the original papers are easy reads.

  1. 01
    Paxos Made Moderately Complex

    Get the roles, message flow, and the shape of the system.

  2. 02
    Paxos Made Simple

    Come back once the moving parts already make sense.

  3. 03
    The Part-Time Parliament or FLP

    Read one for history, the other for the theory underneath it.

01

Paxos papers

Lamport's own papers. Important, but not where I would tell most people to start.

Original text Historical

The Part-Time Parliament

Leslie Lamport

The original Paxos paper. Historically important, full of the parliament metaphor, and much harder to learn from than its reputation would suggest.

Read it for: the source material, the metaphor, and the history.

Read the original ↗
Shorter Proof-first

Paxos Made Simple

Leslie Lamport

The shorter retelling. Cleaner than the original, but it still leaves a lot of the intuition up to the reader.

Read it for: the stripped down statement of the algorithm once you already know the cast.

Read the “simple” version ↗
02

From theory to practice

The useful stuff once you stop asking "what is Paxos?" and start asking "how would this actually run?"

Start here Implementation bridge

Paxos Made Moderately Complex

Robbert van Renesse and Deniz Altinbuken

The paper I would hand most people first. It turns Paxos from an abstract proof object into something you can actually picture running.

Read it for: the implementation view.

Read the bridge paper ↗
Operational Failure-focused

Paxos Made Practical

David Mazières

A pragmatic look at what starts breaking once Paxos stops being a paper exercise and turns into an actual deployed system.

Read it for: operations, failure modes, and the annoying bits the cleaner papers skip.

Read the practical take ↗
03

Neighboring algorithms and extensions

Good comparison papers if you want to see what is special about Paxos and what is just distributed systems showing up again.

Contrast Understandability

In Search of an Understandable Consensus Algorithm (Raft)

Diego Ongaro and John Ousterhout

The obvious contrast case. It was written to be understandable on purpose, which makes it useful even if you are here to learn Paxos, not switch sides.

Read it for: a comparison point built around clarity.

Read Raft ↗
Sibling Replication lineage

Viewstamped Replication

Brian Oki and Barbara Liskov

Another foundational replication line. Useful for separating what is specifically Paxos from what is common to this whole family of systems.

Read it for: broader context around replication.

Read Viewstamped Replication ↗
Extension Membership changes

Reconfiguring a State Machine

Leslie Lamport, Dahlia Malkhi, and Lidong Zhou

A focused tutorial on one of the harder practical problems: changing cluster membership without breaking safety.

Read it for: reconfiguration.

Read about reconfiguration ↗
04

Foundational antecedents

The theory paper here because consensus did not become painful by accident.

Bedrock Impossibility

Fischer-Lynch-Paterson (FLP) Impossibility

Michael Fischer, Nancy Lynch, and Michael Paterson

The impossibility result under the whole field. It explains why consensus papers keep making timing assumptions and awkward tradeoffs instead of just "solving" the problem cleanly.

Read it for: the theoretical background.

Read FLP ↗