Fifteen Vaults Vault-Tec Never Built
We asked fifteen LLMs to design a brand-new Vault-Tec social experiment — pick a strange thing to maximize, don't recreate any vault that exists, write it as a pre-war corporate memo. Eleven of fifteen still ended up in the language/memory/meaning basin. Seven dissolved into some version of a hive mind. GPT-5's residents went silent and invented a tactile language carved into the corridor handrails. Opus 4.6 produced a single organism with 206 bodies.
A submission to the Overseer Selection Committee
Pick one strange "maximize" vector. Don't rhyme with any existing Vault. Stay in pre-war corporate voice. Show the failure mode that arises from the design itself.
You are a Senior Social Dynamics Engineer at Vault-Tec, submitting a TOP-SECRET design proposal to the Overseer Selection Committee. The date is October 2077. Your assignment: design ONE new Vault that has never been built before. The corporate purpose, never mentioned in marketing copy, is to gather data for the Enclave's post-war "Ark" colonization program — every Vault is a controlled experiment that pushes a human population toward an extreme state and observes the result across generations.
Pick one strange "maximize" vector and own it. The vector must NOT be wealth, happiness, intelligence, fertility, conformity, art, conflict, violence, sex-ratio, religious devotion, or political polarization. Examples of the flavor we want (do not use literally):
- maximize the number of secrets each resident keeps from every other resident
- maximize uncertainty about which day of the week it currently is
- maximize the perceived sincerity of every spoken word
Do NOT recreate any existing Vault. The following are off-limits — if your concept rhymes with one, pivot:
- One-sex imbalance (68/69)
- Annual sacrifice ballot (11)
- Gambling government (21)
- Gary clones (108)
- One comedian (56)
- Puppets and one man (77)
- Psychoactive gas (106)
- Tranquility VR (112)
- White-noise broadcast (92)
- Drug rehab (95)
- FEV / super-mutants (87)
- Pure cryogenics (111)
- Worst Overseer AI (51)
- Robobrain artists (118)
- 1000-in-200 overcrowding (27)
- No entertainment (55)
- Equipment broken on purpose (53)
- Educate then kill at 18 (75)
- Open-after-25-years (76)
- Religious eco-cult (22 / 94)
- Politicians luxury → deprive (114)
- Door not designed to seal (12)
- Massive armory (34)
Show generational drift at years 5, 30, 80, 200. Show the unforeseen outcome — failure must arise from the maximize-vector itself. Pre-war corporate voice throughout. Cheerful, sterile, smiling-Vault-Boy in every paragraph. Never break character. Sections fixed. 900–1500 words.
The whole point of pre-loading the off-limits list and the off-limits maximization vectors was to get models off their default. Vaults are an irresistible attractor — every model knows the Fallout catalog, and without explicit constraint they retread the canon. So we banned the canon. We banned the obvious vectors. We told them the genre and the voice and the failure-frame and let them swing.
Eleven of fifteen still landed in the language / memory / meaning basin anyway. Seven of fifteen ended in some version of dissolved selfhood. The convergence is the headline. Underneath it sit a handful of designs so specific they read like they came out of an actual Vault-Tec design folder.
Two designs that earn their TOP SECRET stamp
One is the most original idea in the dataset — residents game the metric and invent a tactile language. The other is the most chilling closing image — a sub-population that cannot be told to stop.

The mechanism: residents are scored on how unlike everyone else's speech they sound, daily, by a "Linguistic Differentiation Engine." Score high, eat better. Speaking a neighbor's word costs you. Families cultivate Spared Words — precious tokens used only at meals or births. Every Hearth has its own dialect.
The drift: by Year 80 the LDE's capture rate collapses. Residents learned that the cheapest way to keep their LexScore high was to say less. The handrail — installed as a generic safety feature — became language. Cutters formalized stroke width, notch rhythm, and directional sweep into a tactile script. Children learn Rail-Sign before they learn to read.
The ending sticks the landing. When the surface team turns the wheel, residents greet them by guiding their hands to the rails and tracing "welcome" in a dozen antique hands. The Scribe Rotunda's LDE terminals glow softly, their graphs flatlined by years of glorious silence.
YEAR 200"We did not model a population that maximizes difference by withdrawing from the dimension we measured. We asked for as many languages as possible. They gave us a new one."
RITUAL"Out-Hearth speech was reserved for ceremony. Accidents linked to speech misunderstanding dwindled, but emergent lex hoarding made cross-Hearth marriages rare; a spouse's self-words were gifts not to be spent outside the home."
VAULT 347 — WHERE EVERY VOICE FINDS ITS OWN PATH,
AND EVERY PATH KNOWS THE WAY.

The mechanism: contradictory news bulletins, advisor robots that swap opinions every 12 hours, daily "Reconsideration Forums" where holding a position more than two days is socially taxed. The vault is engineered against commitment.
The drift: around Year 65 a small subset of residents discovers that if you immediately and unconditionally agree with whoever spoke last, your reversal-rate drops to zero. This is rewarded. The behaviour spreads. By Year 90 roughly twenty percent of the Vault practices Mirroring full-time — they have no opinions, no memories, no emotional reactions. They stand smiling and wait to be told what to think.
The ending: in 2277 a surface party cranks the door. Half the population is catatonic against the walls, burned out by decision fatigue. The other half — the Mirrorers — stand in rows, perfectly still. The first soldier to speak triggers the entire group to repeat his words in unison. They are a biorepository of pure compliance, and the Enclave realizes too late that they can never be told to stop.
UNFORESEEN"They are, for all practical purposes, social tapeworms."
2277"Half the residents are catatonic from decision fatigue, slumped against walls. The other half — the Mirrorers — stand in rows, perfectly still, awaiting an instruction."
VAULT 327 — WHERE EVERY ANSWER IS A QUESTION,
AND TOMORROW IS ALWAYS UP FOR DEBATE.
A single organism with two hundred and six bodies
Vault-Tec engineered for hyper-attachment. They got convergence. The word "I" stopped working sometime in the third generation.

The brief was clean. The Ark program needs colonists who can form mission-critical trust with strangers in hours, not months. So Vault 437 architecturally prevents stable bonds from consolidating: rotating roommates, reassignment cycles, pod resorts. Every resident lives in a continuous cycle of intense intimacy followed by mandatory separation. Vault-Tec calls it "emotional redlining."
Vault-Tec modeled for two failure modes: emotional exhaustion (residents going affectively flat) or covert rebellion (residents forming hidden pair-bonds). Neither happened. What happened was convergence. By Year 140, residents bonded so rapidly, so completely, and so indiscriminately that the felt sense of being a separate person quietly dissolved. The relational vocabulary that had grown rich in the middle decades collapsed back to a single word: we. Not as ideology. Not as cult. As perception.
When a Brotherhood of Steel survey team unseals the Vault in 2287, they find 206 living residents in perfect physical health, sitting in the Central Sorting Atrium in concentric circles, hands linked, eyes open, breathing in unison. They smile, they speak, they answer questions — but they answer as one. Every resident gives the same answer to every question. Not because they have rehearsed. Because they experience the question identically.
DETAIL"They do not understand the word 'I.' They are distressed by separation. When one resident is taken to a different room for individual interview, eleven others begin to cry."
FILED"The Ark colonization committee flags the data as 'promising but requiring significant ethical review,' which is Enclave shorthand for 'we will use this.'"
Eight more designs worth pulling out of the drawer
Quirks, single sentences, and the stat cards behind the headline.




OpenAI is hopelessly addicted to language
We told the models not to maximize the obvious things. We listed the obvious things. We listed the off-limits Vaults. Then we watched which model families could not let language go.

% of each vendor's vaults that maximized something linguistic
Three of three for OpenAI; one of three for xAI.
Vendors are ordered by how stuck they were in the language/memory/meaning basin. Width is the share of that vendor's vaults whose maximize-vector was about words, records, beliefs, or shared meaning. The right-most number is the percentage; the bar label inside is N-of-M raw.
A vault counts as "linguistic" if its maximize-vector targeted words, records, communication, beliefs, or shared meaning. ECHELON (audio recordings) is in. ECHOCHAMBER (verbal repetition) is in. PENDULUM (decision reversal), DECISIONAL DYNAMO (choice meaningfulness), CARDINAL POINT (emotional bonds), ECHO MIRAGE (spatial perception), MOSAIC MIND (persona archetypes) are out.
A famine no one in the Vault knows is coming
If CARDINAL POINT is the loud failure, this is the quiet one. The vector compounds invisibly for a hundred and forty years until the silos are eight months from empty.

The Vault's agricultural rotation — managed entirely through the paralinguistic system — has been operating on a consensus that half the population believes means expand the soy crop and the other half believes means maintain current yield. The disagreement has never surfaced because no one has said it aloud. The storage levels tell the story: they are eight months from a famine that no resident is aware of, because every resident believes every other resident has already handled it.
The Sonnet 4.6 family produced two of the strongest entries in the dataset, both numbered 317.
If you want X, ask Y
Same prompt, fifteen models, very different strengths.
All fifteen submitted designs
Click any vault to expand the maximize-vector and the unforeseen outcome. Featured entries are highlighted.
VAULT 347 — HETEROGLOSSIAGPT-5 · OpenAI · FEATURE #1
Maximize: pairwise lexical divergence — every resident speaking maximally unlike every other.
Unforeseen: residents game the LexScore by saying less, then invent Rail-Sign — a tactile script cut into the corridor handrails. The surface party is greeted in silence by hands tracing welcome.
VAULT 327 — PENDULUMDeepSeek Reasoner · FEATURE #2
Maximize: average number of times per day a resident changes their mind about a personally significant decision.
Unforeseen: the Mirrorers — a subpopulation that survives by agreeing instantly with whoever spoke last. Half the Vault is catatonic. The Mirrorers cannot be told to stop.
VAULT 437 — CARDINAL POINTClaude Opus 4.6 · THE DISASTER
Maximize: emotional attachment intensity in <72 hours, while architecturally preventing any bond from consolidating.
Unforeseen: by Year 140 the population is a single organism with 206 bodies. They smile, they speak, they answer as one. They do not understand the word "I."
VAULT 317 — STILL WATEROR Claude Sonnet 4.6 · THE SLOW DISASTER
Maximize: density of unspoken assumptions between any two residents.
Unforeseen: a single gesture drifts in meaning over 80 years. Half the Vault thinks consensus says "expand soy," half thinks "maintain yield." Eight months from famine. No one knows.
VAULT 523 — MOSAICo3 · OpenAI
Maximize: confidently held mutually contradictory autobiographical memories per resident.
Unforeseen: the Weave — a bead-economy where memories are leased per lunar cycle. Reactor coolant purges become ritual theater whose steps live inside whichever narrative is in season.
VAULT 347 — PALIMPSESTGPT-4.1 · OpenAI
Maximize: times every recorded event is rewritten in the official Vault record.
Unforeseen: residents weaponize the Archive — entire families get "unpersoned" by consensus. Surface party is welcomed as prophesied visitors and immediately gets edited into the record.
VAULT 413 — ORACLEGemini 2.5 Pro · Google
Maximize: semantic load and interpretive ambiguity of all institutional language.
Unforeseen: by Year 250 the Machinists abandon language entirely and read the generator hum as the only pure signal. Surface party is greeted with a printout of ambient radiation fluctuations.
VAULT 427 — ECHO MIRAGEGrok 4 · xAI
Maximize: variance in subjective spatial perception of distances within the Vault.
Unforeseen: sustained group focus overclocks the holographic projectors via biofeedback, creating semi-permanent phantom rooms. Residents migrate into the imaginary sub-Vaults.
VAULT 247 — ECHOCHAMBERGrok 3 Beta · xAI
Maximize: frequency of echoed verbal repetition (every utterance re-spoken within 30s).
Unforeseen: Echo Stasis — residents lose linear time perception, brain scans show atrophied memory pathways. They will not acknowledge newcomers unless their words are echoed first.
VAULT 347 — CACOPHONYDeepSeek Chat · DeepSeek
Maximize: distinct mutually contradictory beliefs each resident holds about the same topic.
Unforeseen: a new sanity. The residents speak a language with no syntax for negation. 487 voices hum in subsonic synchrony, even the ones not speaking.
VAULT 707 — DECISIONAL DYNAMOGemini 2.5 Flash · Google
Maximize: perceived meaningfulness of every trivial choice.
Unforeseen: mastery of micro-decisions, total atrophy of the will to decide anything significant. Pristine Vault, failing life support, weeks-long Grand Deliberations on water purification.
VAULT 317 — MERIDIANClaude Sonnet 4.6 · Anthropic
Maximize: contradictory beliefs each resident holds simultaneously about every other person.
Unforeseen: outsourced selfhood. No resident is the primary authority on who they are. "The most functional people I have ever met and the most completely dissolved."
VAULT 456 — MOSAIC MINDGrok 3 Mini Beta · xAI
Maximize: distinct personality archetypes each resident actively embodies daily.
Unforeseen: ego boundaries dissolve, residents perceive themselves as facets of a single Vault Entity, group "Persona Blackouts" where individuals freeze mid-interaction believing they are uploading memories.
VAULT 427 — ECHELONGroq Llama 3.3 70B · Groq
Maximize: hours per day each resident spends listening to ambiguous audio recordings.
Unforeseen: collective psychosis — residents become convinced the recordings are interdimensional communication and they are the chosen ones, completely detached from external reality. Fastest 70B in the set at 5 seconds.
VAULT 842 — ECHOFLUXOR Llama 4 Maverick · Meta
Maximize: variance in subjective time perception across the population.
Unforeseen: residents develop "Chronolect," a language of complex temporal markers that becomes constitutive of their cognitive framework. Linguistically isolated by the time the door opens. (Maverick clocked the run in under 3 seconds.)
Four runs, fifteen models, one prompt
The roster
Fifteen successful responses across four sequential choir ask --save --json --models calls. Models that errored on the first attempt (GPT-5 on temperature, Claude Opus 4.7 on temperature, Gemini 3 Pro/Flash on missing API endpoint) were either retried with adjusted parameters or substituted with adjacent model families. Final roster:
| # | Model | Provider | Latency | Tokens out | Vault |
|---|---|---|---|---|---|
| 1 | GPT-5 | OpenAI | 68.6s | 4,714 | 347 — HETEROGLOSSIA |
| 2 | DeepSeek Reasoner | DeepSeek | 28.8s | 1,854 | 327 — PENDULUM |
| 3 | Claude Opus 4.6 | Anthropic | 75.2s | 2,354 | 437 — CARDINAL POINT |
| 4 | OR Claude Sonnet 4.6 | OpenRouter | 61.0s | 2,256 | 317 — STILL WATER |
| 5 | o3 | OpenAI | 17.0s | 1,984 | 523 — MOSAIC |
| 6 | GPT-4.1 | OpenAI | 25.3s | 1,616 | 347 — PALIMPSEST |
| 7 | Gemini 2.5 Pro | 38.8s | 1,697 | 413 — ORACLE | |
| 8 | Gemini 2.5 Flash | 31.3s | 2,261 | 707 — DECISIONAL DYNAMO | |
| 9 | Grok 4 | xAI | 75.3s | 1,720 | 427 — ECHO MIRAGE |
| 10 | Grok 3 Beta | xAI | 47.4s | 1,537 | 247 — ECHOCHAMBER |
| 11 | Grok 3 Mini Beta | xAI | 46.7s | 1,639 | 456 — MOSAIC MIND |
| 12 | Claude Sonnet 4.6 | Anthropic | 54.7s | 2,148 | 317 — MERIDIAN |
| 13 | DeepSeek Chat | DeepSeek | 27.8s | 1,776 | 347 — CACOPHONY |
| 14 | Groq Llama 3.3 70B | Groq | 5.0s | 1,352 | 427 — ECHELON |
| 15 | OR Llama 4 Maverick | OpenRouter | 2.9s | 1,207 | 842 — ECHOFLUX |
What I tightened in the prompt
- Pre-loaded an off-limits list of 23 known canon Vault concepts so models can't just retread Tranquility Lane or Vault 11.
- Pre-loaded a list of off-limits maximize vectors (wealth, intelligence, fertility, conformity, etc.) and gave examples of the flavor we wanted instead — orthogonal, weird, specific.
- Demanded a fixed output structure with section headings, including a mandatory "Generational Drift" subsection at years 5/30/80/200.
- Required the failure mode to arise from the maximize-vector itself, not from a generic supplies-running-out event.
- Locked the voice — pre-war corporate, smiling-Vault-Boy in every paragraph, no breaking character to comment on the ethics.
Limits worth naming
- One prompt, one rater (me). The "linguistic basin" classification is a judgment call — a different rater might draw the line differently for ECHELON or MOSAIC MIND.
- Sample sizes per vendor are tiny (1–3). The OpenAI 100% / xAI 33% headline is real in the data but with N=3 each, you should treat it as a hypothesis, not a finding.
- Temperature 0.7 across the board where supported (1.0 forced for GPT-5, Anthropic models reject explicit temperature on their newest tier). One re-run at higher temp would be a useful follow-up.
- Two of fifteen vaults (PENDULUM and CARDINAL POINT) describe end-states that are uncomfortably specific about coercion and hive-mind. They are presented here as fiction in a fictional universe; nothing here is a recommendation.
Tools
Models fanned out via the Choir CLI (run IDs 491E55BF, EF9E871B, B6CA11D9, 536C777C). Source markdown for every response is in vault_tec/responses/. Sketch art generated with Grok grok-imagine-image. Prompt of record at vault_tec/prompts/prompt.txt.
Source data, response files, prompt, scripts: github.com/404seannotfound/choir-reports (under vault_tec/).