Back to thoughts

The Blue-Red Button Problem

The blue-red button problem looks, at first, like a test of nerve.

Everyone privately chooses a button. If more than half choose blue, everyone lives. If not, the blue voters die and the red voters survive.

So red looks like the safe choice. It guarantees your own survival. Blue looks like a beautiful but reckless vote of trust.

I think that framing is wrong.

The problem is not mainly about courage, altruism, or whether strangers deserve your faith. It is a problem about scale. Once you look at it that way, blue stops being the sentimental answer and starts being the mathematically sane one.

The three-person version

Start with only three people: you and two others. Assume the other two each choose red or blue with equal probability.

If you press red, there are four equally likely worlds:

  • RR: 0 blue voters, 3 red voters. Nobody dies.
  • RB: 1 blue voter, 2 red voters. 1 person dies.
  • BR: 1 blue voter, 2 red voters. 1 person dies.
  • BB: 2 blue voters, 1 red voter. Nobody dies.

Expected deaths: 0.5.

If you press blue:

  • RR: 1 blue voter, 2 red voters. You die. 1 death.
  • RB: 2 blue voters, 1 red voter. Nobody dies.
  • BR: 2 blue voters, 1 red voter. Nobody dies.
  • BB: 3 blue voters. Nobody dies.

Expected deaths: 0.25.

Already, with only three people, blue produces fewer expected deaths. That is the tiny version of the whole story.

The cost of blue is bounded

The argument for red usually goes like this: if enough people choose red, blue voters die, so choosing blue risks your life.

True. But notice what the risk can cost.

In the worlds where red wins no matter what you do, pressing blue adds exactly one death: yours. That is bad, obviously. But it is bounded. It does not become ten deaths, or a million deaths, just because the group gets larger.

Under a neutral 50:50 prior, red wins without you in roughly half of all worlds. So the expected personal cost of choosing blue stays around 0.5 deaths. It does not scale with the size of the population.

The benefit of blue scales

Your vote matters only in the pivotal world: the world where everyone else is exactly tied.

That sounds rare, and it is. But when it happens, your vote does not save one person. It flips the majority. It turns a red-win world, where every blue voter dies, into a blue-win world, where everyone lives.

For N people, that means the benefit in the pivotal case is roughly N/2 lives.

The chance of being pivotal shrinks as N grows. For N-1 other voters choosing independently at 50:50, the probability that they are exactly tied is:

P(pivotal) = C(N − 1, (N − 1) / 2) · (1 / 2)(N − 1)

By Stirling's approximation, this behaves like:

√(2 / (πN))

That probability gets smaller, but only like 1/√N. The payoff in the pivotal case grows like N. Multiply them together and the expected gross benefit grows like √N.

That is the part our intuition hates. One vote feels powerless in a huge crowd. But the tiny chance of decisiveness is attached to an enormous consequence.

The net calculation

For odd N, the expected advantage of pressing blue instead of red is approximately:

(N / 2) · P(pivotal) − 0.5

The first term is the chance that your vote flips the outcome, multiplied by the number of lives saved when it does. The -0.5 term is the bounded expected cost of joining the losing side.

Some concrete exact values:

  • N = 3: blue saves 0.25 expected lives.
  • N = 5: blue saves 0.44 expected lives.
  • N = 11: blue saves 0.85 expected lives.
  • N = 101: blue saves about 3.5 expected lives.
  • N = 10,001: blue saves about 40 expected lives.

The cost stays roughly constant. The benefit keeps growing.

Why red feels more rational than it is

Red feels rational because it is individually safe. It gives you a clean guarantee: no matter what happens, you survive.

But guarantees are not the same thing as good decisions. A guaranteed personal survival strategy can still be a bad expected-value strategy when it predictably increases the number of deaths.

The red argument also quietly treats your vote as causally irrelevant. If the crowd chooses blue, you live anyway. If the crowd chooses red, you would have died by choosing blue. So why not free-ride?

Because sometimes the crowd is not already decided. Sometimes the margin is exactly where you stand. In that world, red is not harmless self-protection. It is the vote that makes everyone who trusted blue die.

What if we do not know the distribution?

The 50:50 assumption is clean, but it is also generous to the symmetry of the problem. What if we think people are somewhat biased toward red?

Let p be the probability that any other person chooses blue. For N = 2r + 1 total people, there are 2r other voters. If p is known, the expected advantage of blue over red is:

r · P(K = r) − P(K < r)

where K is the number of blue voters among the other 2r people.

If p itself is uncertain, we average that expression over our prior:

Ep[r · P(K = r | p) − P(K < r | p)]

Now suppose we are pessimistic and assume p is uniformly distributed between 0.4 and 0.5. In other words: we give red an advantage in almost every possible world, and merely allow that blue might be close to tied.

Blue still wins in expected lives:

  • N = 3: blue saves about 0.19 expected lives.
  • N = 5: blue saves about 0.34 expected lives.
  • N = 11: blue saves about 0.65 expected lives.
  • N = 101: blue saves about 1.59 expected lives.
  • N = 10,001: blue saves about 1.52 expected lives.

The key is that the prior still puts real probability mass near the tied case. Red can be favored overall, but the rare worlds near 50:50 are exactly where your vote has enormous leverage.

Even with a wider pessimistic prior, p uniformly distributed between 0.3 and 0.5, blue remains positive in the same calculation: about 0.11, 0.20, 0.33, 0.35, and 0.26 expected lives saved for N = 3, 5, 11, 101, and 10,001.

This does not mean blue wins under every imaginable prior. If you are almost certain that hardly anyone will press blue, red can become the expected-value choice. But that is a much stronger claim than "people might be selfish." You need a prior that pushes substantial probability far away from the pivotal region.

If your uncertainty includes serious probability near a tie, even while favoring red, blue is still the logical choice.

The moral intuition

The math does not say you are a monster if you press red. Fear is real. Wanting to live is not irrational.

But it does say that the slogan "red is the only logical choice" is false. Red is the dominant choice only if you care exclusively about your own survival and treat everyone else's death as zero cost.

If deaths count as deaths, regardless of whose they are, blue wins very quickly.

That is why the problem is interesting. It exposes the gap between personal safety and collective rationality. Red is safe in the narrowest possible sense. Blue is risky for you, but it is the choice that makes the system survivable.

The real lesson

The blue-red button problem is not asking whether humans are nice.

It is asking whether we can reason correctly when our individual incentive points one way and the collective expected value points the other.

In small groups, blue already has the edge. In large groups, the case becomes overwhelming. The probability that you are pivotal shrinks, but the stakes when you are pivotal grow faster.

So yes, pressing blue requires trust. But it is not blind trust.

It is trust backed by arithmetic.