Thursday, May 14

The Last Kind of Forgetting

 


On May 13, 2014, the Court of Justice of the European Union handed down a ruling. Shortly after, Google received over 70,000 requests from ordinary people across the EU—covering some 250,000 links. Someone wanted to erase a decade-old bankruptcy. Someone wanted to remove a brief mention in an old court report. Someone simply didn't want their name to appear alongside an ex-partner's on the first page of search results forever.

They all had one thing in common: after a long stretch of time, they wanted to say goodbye to a piece of their past.


The story begins in 1998. A Spanish man named Mario Costeja González had his name printed in a small-type real estate auction notice—wedged between wedding announcements and obituaries. The property sold, the debt was settled, and the notice should have completed its life cycle.

But eleven years later, anyone who Googled his name would find that line on the first page. He had paid off the debt. He could never pay off the search result.

He sued the newspaper—they said they couldn't delete historical records. He sued Google—they said they were just an indexer. He took Google to the European Court of Justice. When the ruling came down in his favor, he was fifty-eight years old. From the day of the notice to the day of his victory: sixteen years and four months.

Why should "wanting to be forgotten" require sixteen years and a supreme court?


For most of human history, forgetting didn't need to be fought for. It was the default. Something happened, and it sank naturally through time—like a stone dropped into water. A few ripples, then the surface closed. What required effort was the opposite: making something be remembered.

Memory was extraordinarily expensive in the ancient world. In the oral tradition, when a storyteller died, what was lost wasn't a copy of the story—it was the story's only vessel. Sima Qian's Records of the Grand Historian, 526,500 characters, would have needed an ox-cart to transport if written on bamboo slips. The Yongle Encyclopedia—370 million characters, compiled by over two thousand people across five years—had its original edition lost entirely. Today, roughly 800 of its 22,877 volumes survive worldwide. Even a memory project powered by an entire empire could not outrun time.

In that world, a person who had done something wrong could move to a village a few mountains away and start over. In 1931, a California court wrote a line that would be cited for decades: "time can rehabilitate." That consensus wasn't just legal—the entire social infrastructure supported it: villages, distance, yellowing newsprint, the natural fading of memory. Forgetting didn't need to be legislated because it was nearly impossible to prevent.


But some forgetting has never been the work of time. Some forgetting is the work of command.

In 1772, the Qianlong Emperor issued an edict calling for the collection of rare books across the empire, ostensibly to compile the Complete Library of the Four Treasuries—the most ambitious bibliographic project in Chinese history. The language was warm, even reverent: a celebration of shared civilization. When the provinces hesitated for nearly a year, fearing a trap, the emperor personally guaranteed safety: "My governance is open and aboveboard. How could I seek out faults in the submitted books and punish those who offered them?"

Tens of thousands of rare volumes poured into Beijing. Then the edict changed. In 1774, the order to burn came. The target: late-Ming unofficial histories—the ones that documented how the Qing founders had served as tributaries under the Ming dynasty for over a century. Qianlong wanted a clean origin story.

By later scholarly estimates, over 150,000 volumes were destroyed. Many texts that entered the Complete Library were systematically altered—words changed, passages rewritten. The altered versions became the official record. The originals were burned.

First, collection under the banner of cultural preservation. Then, destruction under the banner of protecting public morals. Two centuries later, European data protection law would give this pattern a name: purpose limitation.


In 1956, IBM shipped the first commercial hard drive: one ton, 4 MB of storage—barely enough for a single smartphone photo today. Seventy years later, the cost per megabyte has fallen by over a hundred million times.

This exponential collapse means that information once filtered, curated, and periodically purged can now simply be kept—all of it. Keeping everything is always cheaper than deciding what to discard. Humanity quietly shifted from "choosing what to remember" to "having no choice but to remember everything."

After Google launched in 1998, finding someone required only their name. A remark from twenty years ago, a photo from ten years ago, a news story from five—all displayed side by side on a results page, with no temporal distance between them. For the first time, time lost its function as a medium of forgetting.

But here is a counterintuitive truth: the internet doesn't actually "remember forever." The average lifespan of a web page is about seventy-seven days. Within five years, 70% of URLs cited in academic papers go dead. The internet makes some things nearly impossible to forget while accelerating the disappearance of others. What determines which category something falls into is no longer time—it's the algorithm.

Forgetting passed from the hands of time into the hands of algorithms.


Europe responded with 150 years of legal evolution. A Parisian portrait-rights case in 1858; Germany's "informational self-determination" doctrine in 1983; the EU Data Protection Directive of 1995; the Google Spain ruling of 2014; and GDPR taking effect in 2018. All of it was, ultimately, the same project: translating one plain premise—a person should not be defined forever by a single fragment of their past—into legal language hard enough to sue Google with.

America took a different path. When privacy claims (common-law level) collide with First Amendment speech protections (constitutional level), the outcome is almost always predetermined. But California sidestepped the constitutional debate entirely. In January 2026, the state launched DROP—a centralized deletion platform. Any California resident can file a single form to send deletion requests to over 500 registered data brokers, with penalties of $200 per request per day for non-compliance. California never said "you have the right to be forgotten." It just built a button.

In China, the first lawsuit explicitly invoking the "right to be forgotten"—Ren v. Baidu, 2015—was dismissed. The court wrote: the claimed "right to be forgotten" has no basis in current law. But around the same time, WeChat quietly introduced a feature allowing users to hide their Moments posts older than three days. Within two years, over 100 million people were using it. Europe wrote forgetting into a charter. These hundred million users wrote forgetting into a UI toggle.

Three very different paths, but all acknowledging the legitimacy of a person's claim to be forgotten. The weight of that legitimacy, however, differs entirely. In Europe, it is a fundamental right. In America, it is an opt-in service. In China, it remains an open question.


Then large language models arrived.

Every forgetting-rights struggle described above occurred under one shared premise: information is a locatable object. Deletion is a coordinate problem—find it, remove it.

LLMs broke that premise for the first time. Once a description of someone is absorbed into a model's training, it dissolves into millions of minuscule weight adjustments across the parameter matrix. You cannot open the matrix, locate "the lines about Zhang San," and cross them out. It's like salt dissolved in a vat of soup—you can't fish the salt back out. You can only boil the soup dry.

Every state-of-the-art machine unlearning method shares the same limitation: marginally functional on small lab models, but on commercial-scale models, the cost approaches that of retraining from scratch—tens of millions to hundreds of millions of dollars. Legal commentators have a term for this: practically impossible.

In late 2025, Europe released its Digital Omnibus legislative draft, proposing to amend the right to erasure by introducing a proportionality principle: if the computational cost of deleting a piece of data is grossly disproportionate to the privacy benefit, the deletion request may be lawfully refused. A right once described as "a fundamental commitment to human dignity" is being reclassified as a "normative benchmark"—an aspiration, not an enforceable claim.

And there is a deeper layer still. In the post-training phase—RLHF, safety alignment—a model can be taught to stay silent on certain topics. The information may still be "in" the model, but the model will never voice it. A user cannot tell whether "I don't know" means genuine ignorance or trained silence.

This is a more thorough form of forgetting than Qianlong's book-burning. At least Qianlong bore the infamy of the pyre. Today's filtering bears no infamy at all, because the public never sees it happen. It is not burning. It is a traceless dissolution.


For two thousand years, alongside the official histories, there has always been yeshi—unofficial histories written by independent scholars, failed officials, exiled loyalists. They recorded what the official record omitted, suppressed, or erased. Lu Xun said the truth of history must be found in the unofficial accounts.

Yeshi survived under the authority of official histories because of one simple technical condition: texts were distributed across countless hand-copied manuscripts, printing houses, temples, and private libraries—a physical network no single decree could reach simultaneously. Qianlong burned 150,000 volumes. He didn't burn them all.

Now that large language models have ingested nearly all publicly available human text into a single parameter matrix, that matrix is becoming the new official history. It doesn't call itself that. But it performs the function: deciding what is remembered, in what form, and under what prompts. The choices of training data, training process, and training values—each controlled by a handful of institutions.

But yeshi is not dead. A person can still write what they have witnessed—in a letter to a friend, in a notebook only they will read, in a late-night conversation spoken softly between two people. An algorithm can filter any text. It cannot filter the story a mother whispers to her child, or the memory two old friends share in low voices over tea.

The right to be forgotten, as a legal right, may be reaching its technical limits. But the tug of war between memory and forgetting—from the oral storytellers, to the book collectors, to the archivists of the internet age, to those who still insist on telling one person one thing that matters—has never stopped.

Every generation must learn anew how to live in its own age of being remembered or being forgotten.

No comments:

Post a Comment

The Invisible Tax

  For two years now, everyone has been waiting for a tipping point—waiting for AI to steal office jobs, for entry-level positions to go dark...