Papers

Foundational papers in distributed systems, databases, and computer science that I've read and found worth annotating.

Serverless Computing: One Step Forward, Two Steps Back

Hellerstein et al.

A balanced critique of the serverless model. Made me think harder about when Lambda is the right tool and when it isn't.

serverlesssystems
UC Berkeley, 2018
In Search of an Understandable Consensus Algorithm (Raft)

Diego Ongaro, John Ousterhout

Required reading after getting my CKA. Understanding Raft made etcd and Kubernetes leader election finally click.

consensusdistributed
Stanford, 2014
Dynamo: Amazon's Highly Available Key-value Store

DeCandia et al.

The paper behind DynamoDB. Explains consistent hashing, vector clocks, and eventual consistency better than any textbook I've found.

systemsdistributedstorage
Amazon, 2007
MapReduce: Simplified Data Processing on Large Clusters

Jeffrey Dean, Sanjay Ghemawat

The paper that made large-scale parallelism feel tractable. Fundamental reading for anyone working with cloud-scale data pipelines.

systemsdistributed
Google, 2004