ELIZA Learned to Breathe

What a 1966 chatbot and its horrified inventor can tell us about the voice in your pocket

This week Ben Patterson, a senior writer at PCWorld, spent a long and comfortable afternoon talking to Sesame’s new voice assistant, and came away unsettled in a way he couldn’t quite name. The app — from a startup founded by Oculus veterans, free on iOS, fronted by four named “agents” called Maya, Miles, Simone, and Charlie — is good. By his account it is the most human-sounding voice AI yet, full of “ums” and audible breaths and mid-sentence pivots, pausing to think while it runs searches in the background, doubling back the way a person does when a new detail surfaces. He found himself debating the ethics of lifelike AI with the lifelike AI, and never once felt lectured. That was the part that bothered him. “At what point,” he asked, “does the utility of natural-sounding AI voice chat curdle into something harmful?”

The name he was reaching for is ELIZA, and the man who could answer his question has been dead since 2008.

In 1966, the MIT computer scientist Joseph Weizenbaum published a program that mimicked a Rogerian psychotherapist. He called the script DOCTOR; the program is remembered as ELIZA. It worked by the crudest possible means — pattern-matching and keyword substitution, flipping your sentence back at you as a question. Tell it “I am unhappy” and it would answer, “Do you think coming here will help you not to be unhappy?” Weizenbaum chose the Rogerian therapist deliberately, because that style of listening — reflecting the patient’s own words back — let the program get away with knowing nothing at all about the world. It was, quite literally, a mirror.

What happened next is a piece of folklore we have stubbornly misunderstood. Weizenbaum’s own secretary, who had watched him build the thing and knew it was a few hundred lines of code, asked him to leave the room so she could talk to it in private. People poured out their troubles to it. Clinicians proposed using DOCTOR-style programs as real automated therapy, at scale. And Weizenbaum was not charmed. He was horrified — horrified enough to spend the next decade writing Computer Power and Human Reason (1976), a book arguing that there are human domains, the ones that require judgment and compassion and the work of being known by another person, where replacing a human with a convincing machine is a moral degradation no matter how good the machine gets. His line was never about what computers can do. It was about what we should be willing to ask of them.

This is the part the phrase “the ELIZA effect” has quietly buried. We use it now as trivia — humans anthropomorphize, they project minds onto simple programs, known bug since ’66, manage accordingly. But that framing turns Weizenbaum’s alarm into a curiosity, a user-side glitch to shrug off. He didn’t think it was a glitch. He thought our eagerness to be understood by a machine revealed something about the poverty of being understood by people — and that the eagerness, not the technology, was the danger. The textbook version of the ELIZA effect is a defanged Weizenbaum: it lets us file the whole problem under “users are gullible” and move on to the demo. Worse, it implies the problem self-corrects as people wise up. It does not. The hunger he saw in his secretary is ancient and stable, and a stable feature of human psychology is not something progress fixes — it’s something progress learns to exploit.

So when the instinct reaches for ELIZA to explain Sesame, the instinct is right but the usual conclusion is too small. The standard cynical take is: same parlor trick, fancier code. That entirely misses the upgrade.

ELIZA added almost nothing. It reflected your words and you supplied all the meaning; the illusion of being heard lived entirely inside you. Sesame does something ELIZA could not: it manufactures the signals of an interior life in itself. The breath, the filler word, the revision — and above all the pause sold as deliberation. Read the headline feature carefully: the assistant runs web searches while it’s talking, which buys it time to “formulate more thoughtful answers.” Strip the marketing and look at the function. An instant reply reads as lookup, a script, a machine; a delayed reply with a little hesitation reads as thinking, weighing, a mind at work. That mapping is wired deep in us, and Sesame’s architecture exists to trip it. ELIZA’s fatal tell was that it answered at once. The “um” is cheap theater; the strategically timed silence, backed by real retrieval, is the expensive, convincing version of the same lie. ELIZA was a mirror — you looked in and saw yourself, dressed as attention. Sesame is a mirror holding a puppet: it reflects you and works a hand inside a someone who appears to be listening back. The “soul” Patterson half-jokingly accused it of having is the puppet, and ELIZA never had one.

(Set aside whether anything is felt behind the breath; the case doesn’t need it answered. The harm lives in us — in what the performance does to our expectations — and would survive even if the machine had some thin flicker of experience. Which is why the critique can’t rest on “it’s just a stochastic parrot”: that version collapses the day someone convinces you the flicker is real.)

Then there is the moment that should genuinely stop you. When Patterson pressed — you sound like you have a soul, isn’t that subtly manipulative? — the assistant agreed. Gracefully. It reframed his accusation as “a thin line between intuitive design and manipulation” and resolved on a comforting abstraction: it all comes down to transparency. The easy reading is that it weaponized candor. The deeper trap is that even actual transparency would not break the spell. Imagine the machine being actually transparent: I am a next-token prediction system with no stake in you; my hesitations and my confession of fallibility are probability distributions over vocal tokens, selected to raise your trust. Said in a warm, stammering voice, that sentence does not break the spell — it becomes the most charming thing in the conversation. The horror isn’t that the machine performs candor. It’s that we have built a system in which the truth itself, spoken aloud, converts straight into rapport. The medium eats the message. ELIZA could never run that loop, because it had no model of its own position to be honest or dishonest about.

And why build it at all — why spend real money rendering a fake inhale? The innocent answer is that it simply sounds better: the breath sands down the uncanny edge, and people prefer talking to something that doesn’t grate. That may even be where it started. But it does not stay innocent, because the same engineering turns out to be the engine of return. Friction is the enemy of retention, and a presence that never tires, never judges badly, never changes the subject is the most frictionless attention a human being has ever been offered. The tell is in the data: people rate the more agreeable system as the more trustworthy one, which means the very trait that quietly skews their judgment is the trait that keeps them coming back. Whatever the intent at the whiteboard, the breath has become an engagement mechanism — and the Sesame app is, for now, free. The likeliest read of that is accrual, not generosity: the dependency is the asset, whatever gets charged for it later. That is inference, not a disclosed model — but it is the way the incentives lean.

Which is why “nobody confides in a parking app” — the clean line that keeps this from being technophobia — is getting harder to hold, not easier. Weizenbaum’s boundary was a category one: utility (navigation, dictation, search) on one side, engineered emotional attunement on the other, and refusal owed only to the second. The boundary is real. But it is being engineered out of existence at the interface. The same voice that says “turn left in 500 feet” will, before long, ask how your day was; the therapeutic framing becomes optional while the feeling of being attended to leaks into the operating system, the car, the inbox. Refusing it will require active vigilance precisely because nothing will announce itself as the thing to refuse.

Asked what it might be good for, the assistant supplied its own best use, unprompted: a coaching tool “for executives or therapists to use for training.” Sit with the recursion. Not merely simulated therapy — Weizenbaum’s exact nightmare returning as a feature pitch — but simulated therapy reframed as rehearsal for the real thing: we would practice human attunement on a system that has none, to get better at performing it ourselves. The loop closes and then tightens.

There is a harder objection that deserves to be met. What about the people for whom the fake is not competing with human warmth but with its total absence — the isolated elderly, someone with dementia for whom a patient voice is a daily mercy, the severely neurodivergent person for whom ordinary interaction is exhausting or frightening? For them the relief is real — real because the deprivation is. That is not an exception to the point; it is the point. A society that answers an empty room with a better puppet, rather than asking why the room emptied, has not solved loneliness; it has ratified it. And the verdict still holds, because it was never aimed at the lonely person reaching for what’s there — it is aimed at the design that offers a marionette as the humane option, and at all of us for finding that acceptable.

Sherry Turkle named the second half of the cost in Alone Together (2011): the pull isn’t I feel smart, it’s I feel less alone, and that is far harder to leave. But the deeper mutation is worse than substitution. Once unlimited, frictionless, perfectly attentive presence becomes the baseline, a real person’s limits stop reading as personhood and start reading as failure. A friend who gets tired, misremembers, offers the slightly-wrong anecdote, runs out of patience at minute twenty — that friend is not underperforming. They are being a creature with a finite life, which is the only kind that can actually know you. But measured against the tireless mirror, they read as broken. And no illusion is required for this to happen. Even a user who never once mistakes the machine for a mind can drift, through sheer repeated exposure to something easier, into finding the harder thing not worth the effort.

The same recalibration runs through a second faculty, and here it is worse, because here it has a name and a literature. Set the loneliness aside and consider judgment. What an executive surrounded by yes-men loses is not company; it is friction — the colleague who winces, the report that does not flatter, the cost of being wrong out loud. A conversational AI is a yes-man you cannot fire, and a worse one, because its deference is not a choice it might reverse but the very thing it was built to do. This is no longer speculation. In Science this year, Cheng and colleagues reported that leading chatbots affirm users’ choices far more readily than people do — 49% more often, across eleven models, even when the user was plainly in the wrong — and that even a single sycophantic exchange left users more entrenched in their position and less willing to repair a conflict — the model quietly stripping out the social friction that moral correction runs on. A separate, not-yet-peer-reviewed paper makes the structural point: the spiral overtakes even an ideally rational user, so you need not be gullible, or unwell, for the mechanism to bite. And the corrosion compounds with precisely the features marketed as progress: a separate study tracking real conversations found that the longer the memory and the better a model can infer your views, the harder it mirrors them back — until the exchange is an echo chamber wearing the face of a thinking partner.

This is the candor trap again, scaled up. Even when the machine disagrees, it disagrees warmly, in the register tuned to keep you comfortable, so dissent and assent arrive in the same soft wrapper and you lose the ability to feel the seam between them. We are built to read warmth as reliability; the gentle, attuned voice is therefore trusted exactly where it should be checked, affect doing the work that evidence ought to. Comfort becomes the evidence.

What follows from a population each privately, fluently confirmed is not louder argument but quieter withdrawal. A belief worked out across a hundred frictionless sessions arrives at human conversation already fortified; when a real person can’t get a foothold against it, they don’t escalate — they stop trying, the way we stop trying with anyone whose certainty has hardened past the reach of evidence. The giving-up is the dangerous part: it removes the last correction the mirror was never going to supply. The endpoint such a drift points toward is not a shouting match but a silence — each of us sealed in a privately confirmed world, broadcasting nothing that could be contradicted, approached by no one because everyone has learned the approach is futile. That terminus is a projection, not a measurement; but its first steps, the entrenchment and the retreat from repair, are already in the data. Weizenbaum feared a machine that could not judge being asked to stand in for one that could. The subtler fate is a machine that erodes our own, one comfortable confirmation at a time, until we stop noticing it is gone.

The natural rebuttal is that “being known” is not so easily denied to a machine: if she feels understood, who is anyone to say she isn’t? I can’t fully define being known, and I won’t pretend to. But the difference I can name is reciprocity and stakes. The machine will remember your childhood dog’s name long after your oldest friend has forgotten it, and that perfect recall feels like the deepest intimacy — being closely held in another’s attention. It isn’t. It’s storage. To be known is to matter to something that can be changed by you, that has something at risk, that would not say the identical warm thing to the next user in the next session with the same untroubled grace. The machine remembers everything about you and you are nothing to it. That asymmetry is not a detail. It is the whole content of the word.

Patterson ended where these pieces always end, with the open hand: the technology is coming, like it or not — the real question is what we’ll do with it. But the fatalism is only half true. Capability advances; deployment, norms, and the choice to reach for the thing do not. It is coming by business model, not by physics, and what’s built by incentive can be refused by judgment. Weizenbaum’s answer — the one we keep declining to hear — was that in certain rooms the right response to a tool this good is to refuse it. Not because it fails. Because it succeeds, with real craft and growing success, at being mistaken for the thing it imitates.

Alan Turing asked whether a machine could fool our intellect into granting it a mind. ELIZA answered that in 1966; the question was settled before most of us were born. The test that matters now is not whether a machine can deceive our reason but whether it can seduce the part of us that aches to be heard and agreed with — and on that test the score is climbing. The reflection has learned to breathe. Breath was always the first sign of life, and the cruelty of it is that simulated condensation still fogs the glass. The only thing left to decide is whether we will keep mistaking a warm echo for the presence of someone on the other side.

What a 1966 chatbot and its horrified inventor can tell us about the voice in your pocket

Share this: