No One Wins Forever

Why chaos, information, and evolution make permanent dominance expensive and fragile

Based on the paper Limits of Superintelligent Control in Chaotic Evolutionary Systems .

The standard story about artificial general intelligence ends the same way: a system gets smart enough, and then it wins for good. It out-predicts and out-coordinates everyone else and settles into a stable global equilibrium with itself on top.

The story rests on an assumption that more intelligence converts directly into more prediction and more control, without limit. This article is about why that assumption breaks against three structural features of the real world, and where it survives.

My claim is more limited than “takeover is impossible”: permanent, fine-grained, centralized dominance is information-theoretically expensive and dynamically fragile. Where dominance reduces to controlling a small, slow, low-entropy summary of the world, it can be cheap and durable. Where it does not, no amount of compute makes it stable. Which regime a given world falls into is an empirical question, and most of the work is in answering it.

Limit 1: chaos caps the prediction horizon

Start with prediction, because every control story is built on it. Many real systems — weather, ecosystems, markets, geopolitics — are nonlinear and chaotic: nearby trajectories diverge exponentially. If two world-states differ initially by δ0\delta_0, that gap grows roughly as

δxtδ0eλt,\lVert \delta x_t \rVert \approx \delta_0\, e^{\lambda t},

where λ>0\lambda > 0 is the largest Lyapunov exponent. Solve for the time at which the error reaches some actionable tolerance Δ\Delta and you get a predictability horizon:

Tp1λln ⁣Δδ0.T_p \approx \frac{1}{\lambda}\ln\!\frac{\Delta}{\delta_0}.

The logarithm does the damage: a billion-fold improvement in measurement precision buys only about 20/λ20/\lambda extra units of foresight.

The dependence on initial error is logarithmic, which is what makes this binding. To double your forecast horizon you must square your precision. An agent with a thousand times better sensors and models than you gets a small additive constant of extra foresight, not a thousand times more. And TpT_p stays finite for any nonzero δ0\delta_0, which physical measurement guarantees. The limit lives in the dynamics; the agent’s quality only sets δ0\delta_0.

This is why “it will simply simulate the future and pick the winning move” is not a plan. Beyond TpT_p, the variance of any long-horizon plan’s value swamps its mean, and plan comparison becomes noise. Receding-horizon replanning doesn’t rescue you: the dynamics regenerate fresh uncertainty every step.

Limit 2: control has a minimum information rate

Prediction limits don’t forbid control. You can steer a system you can’t forecast, using feedback. But chaos imposes a cost on feedback too, and this is where the dominance argument is decided.

A chaotic system produces information. As trajectories diverge, states that were indistinguishable become distinguishable, and a controller has to keep resolving those new distinctions to hold the system in place. The rate of this production is the Kolmogorov–Sinai entropy. For well-behaved (SRB) measures, Pesin’s formula makes it exact:

hKS=λi(x)>0λi(x)dμ(x).h_{\mathrm{KS}} = \int \sum_{\lambda_i(x) > 0} \lambda_i(x)\, d\mu(x).

A caveat the original draft of this got wrong: Pesin’s equality needs an absolutely continuous (SRB) measure. In general you only have Ruelle’s inequality hKSλi+h_{\mathrm{KS}} \le \sum \lambda_i^+. For the control argument that’s enough — you only need h>0h > 0.

Control theory then sets a floor. To stabilize an unstable or chaotic system through a finite-rate feedback channel, the channel’s information rate RR must exceed the system’s entropy production. For linear systems this is the data-rate theorem; the nonlinear generalization uses topological feedback entropy hTFEh_{\mathrm{TFE}}:

R>hTFE.R > h_{\mathrm{TFE}}.

Below that rate stabilization is impossible, and the floor is set by the dynamics rather than by the controller’s design. A system with positive entropy demands a strictly positive feedback rate, sustained indefinitely. Interrupt it with a sensing gap, a delay, or a severed link, and exponential divergence resumes within a few divergence times.

Local, temporary control still works: the Ott–Grebogi–Yorke scheme stabilizes unstable periodic orbits inside a chaotic attractor with tiny nudges. But it’s fragile, needs constant state monitoring, and doesn’t scale to high-dimensional, nonstationary worlds. Centralization is the worst case for it, because one controller has to out-rate the entropy production of the entire system it wants to dominate.

Measuring the floor

To make the claim concrete I took a Lorenz-96 system, the standard tunable-dimension chaos benchmark, and tried to hold it near an unstable target using feedback limited to RR bits per control interval. The threshold RR^\star is computed independently from the system’s unstable modes. The measured quantity is TdomT_{\mathrm{dom}}: how long control lasts before the state escapes.

Dominance duration versus control rate for a Lorenz-96 system. Duration rises through a transition region and saturates at the simulation cap once the control rate exceeds the entropy threshold R-star.
Figure 1. The entropy gap bounds dominance duration. Below the independently computed threshold RR^\star, control is short-lived; as RR crosses RR^\star the achievable duration climbs steeply and then saturates (control is effectively indefinite). Intelligence enters only through RR and cannot move the threshold. PDF

The transition is smooth rather than knife-edge, because sub-bit-per-mode quantization smears it, but the shape matches the theory. There is a rate below which control cannot be maintained, and that rate is a property of the world, not of the agent.

Limit 3: evolution has no terminal state

Suppose an agent affords the feedback rate and stabilizes its slice of the world. It still isn’t secure, because it isn’t alone. Real environments are competitive and adaptive, and the natural model is replicator dynamics: the frequency xix_i of strategy ii grows with its fitness relative to the population average,

x˙i=xi(fi(x)fˉ(x)).\dot{x}_i = x_i\big(f_i(\mathbf{x}) - \bar{f}(\mathbf{x})\big).

Dominance is a monomorphic state, one strategy at frequency 1\approx 1. For it to last it must be evolutionarily stable: no rare mutant can invade. But fitness here is endogenous and frequency-dependent. The act of dominating reshapes the payoff landscape and opens niches for counter-strategies. Add mutation and innovation, a diffusion term that keeps reinjecting variety, and the dominant vertex stops being absorbing. Holding it requires active defense against a constant influx of novelty.

Left: strategy frequencies over time showing the dominant share repeatedly spiking and collapsing. Right: histogram of dominance-spell lengths, roughly exponential.
Figure 2. Dominance is recurrent rather than permanent. With a chaotically drifting environment plus mutation, the leading strategy's share spikes above the dominance threshold and then collapses, repeatedly; spell lengths are roughly exponentially distributed. No monomorphic state is absorbing. PDF

This panel uses a sigmoidal “hold probability” distilled from Figure 1 rather than re-simulating the controller inside the evolutionary loop. It’s illustrative of the mechanism, not a closed derivation.

Dominance, then, is a spell rather than a fixed point. An agent can stay on top for a while, sometimes a long while, but a shifting environment and open-ended innovation keep displacing incumbents.

Reducible vs. irreducible dominance

This is where the impossibility claim has to give. Everything above assumes the agent wants to control the full, fine-grained state, and usually it doesn’t.

Dominance only requires controlling whatever observable defines dominance: resource share, institutional control, compute. The relevant quantity is therefore not the entropy of the whole world but the entropy of the cheapest sufficient projection:

hc=minΠ sufficient for dominancehc(Π).h_c^\star = \min_{\Pi\ \text{sufficient for dominance}} h_c(\Pi).

A capable agent picks the lowest-entropy summary that still counts as winning. That choice splits the world into two regimes:

  • Reducible dominance: a low-entropy sufficient statistic exists, such as slow modes or aggregates. Control is cheap and can be durable.
  • Irreducible dominance: every dominance-relevant observable is itself chaotic. Then hch_c^\star exceeds any achievable rate, and dominance has a finite half-life regardless of the agent’s capability.

The same Lorenz-96 setup shows the gap directly. Controlling a coarse aggregate (the mean) holds for the full horizon at about 1 bit per interval, while controlling the full state needs roughly 5–6 bits.

Dominance duration versus control rate for coarse (mean) control versus fine full-state control. The coarse controller saturates at a far lower rate.
Figure 3. Coarse dominance is cheap. Holding a low-entropy aggregate saturates at a fraction of the rate needed to hold the full chaotic state — the operational meaning of hc=minΠhc(Π)h_c^\star = \min_\Pi h_c(\Pi). The cost of dominance is set by the cheapest sufficient observable, not by the world's total complexity. PDF

The distinction cuts against both extremes. It explains why “it can never dominate” is wrong, since plenty of dominance targets are reducible. It also explains why “it will dominate permanently and completely” is wrong, since fine-grained control of a chaotic world is the irreducible case and is unaffordable.

Implications

Reframing the limits doesn’t make powerful agents safe. It changes the shape of the risk.

  • The threat is transient. A capable agent can do severe, irreversible damage inside a single dominance spell. That no one wins forever is little comfort to whatever gets broken before the spell ends.
  • Centralization is the fragility. A single controller trying to out-rate the entropy of everything is a single point of failure with a continuous and unbounded information requirement. Distributed control is cheaper and more robust, at the cost of the unified objective the takeover story assumes.
  • Resilience beats prevention. If terminal lock-in isn’t the generic outcome, governance should aim at limiting the damage of any single failure, preserving diversity and redundancy, and keeping the system adaptive, rather than at averting one final move.

In a world governed by chaos and evolution, no agent wins forever. But many can do lasting damage before they lose, and that is the problem worth working on.


The figures come from a small Lorenz-96 + quantized-feedback + replicator simulation. Caveats stated inline: the entropy threshold is exact for the unstable fixed point but the measured transition is smooth, and Figure 2 uses a distilled hold-probability rather than control-in-the-loop.

Sources

Chaos and predictability

Information limits of control

Evolutionary and ecological dynamics