In the spring of 2026, an AI staffing platform called Mercor was breached through a supply chain vulnerability. The attackers claimed to have stolen roughly 4 TB of data. Meta paused its partnership; several frontier AI labs scrambled to assess their exposure. The incident barely made the news — markets were watching oil prices, Pentagon procurement announcements, and the latest valuation rumors.
But the breach surfaced a question that had been hiding in plain sight: where, exactly, does the reasoning ability of today's frontier models come from? The answer, it turned out, was specific and human. Tens of thousands of doctors, lawyers, bankers, social workers, and PhDs, spread across multiple continents, were being paid through Mercor — a company founded by a handful of twenty-somethings just three years earlier — to feed their clinical judgment, legal reasoning, and financial intuition into the models you use every day.
A hundred and sixty-four years ago, another signature redirected the flow of knowledge. Nobody paid much attention to that one, either.
The United States has, three times in its history, made a deliberate institutional choice to turn knowledge into a public good.
The first was the Morrill Act of 1862. During the Civil War, with Southern opposition removed from Congress, the federal government handed over the proceeds of thirty million acres of public land to the states — on one condition: the money had to fund universities that taught agriculture, mechanical arts, and military science. Not classical education for elites, but practical training for farmers' sons. Lincoln signed it. The federal government gave up the most valuable asset in the West and got back a distributed network of public higher education, from Cornell to Tuskegee, spanning every state.
The second was Vannevar Bush's 1945 report. Bush — not the presidents, but an MIT engineering professor who ran the wartime Office of Scientific Research and Development — submitted Science: The Endless Frontier to President Truman. His argument: basic research is "scientific capital" that only universities will produce, because industry focuses on applying existing knowledge rather than expanding its frontiers. The federal government should fund it permanently, with researchers choosing their own topics, peer review as the filter, and open publication as the norm. NIH was reorganized in 1948; the National Science Foundation was established in 1950. The system Bush designed produced the transistor, the internet, CRISPR, and mRNA vaccines.
The third was the Bayh-Dole Act of 1980, which transferred patent rights on federally funded inventions to universities and small businesses. It privatized the application layer — but left the upstream structure of federal funding, peer review, and open publication intact.
Three institutional moments, three property-rights inflection points, all following the same logic: make knowledge public first, then negotiate privatization.
The commons is now disappearing — not because someone fenced it off, but because no one is planting anything in it anymore.
At the peak of the Cold War in 1964, federal R&D accounted for roughly two-thirds of all U.S. research spending. By 2026, that share had fallen below one-fifth. The private sector now accounts for three-quarters. The FY2026 White House budget proposal called for cutting NSF by more than half and NIH by about 40 percent. Congress is unlikely to adopt cuts that extreme, but the trajectory has been running for six decades: with federal debt approaching 100 percent of GDP, interest and mandatory spending consume the budget before science gets a turn.
What makes this worse is the lag effect. When you cut research funding, papers don't disappear immediately — today's publications were paid for five to seven years ago. Between 2002 and 2023, NIH alone was acknowledged in nearly two million peer-reviewed papers; NSF in over 900,000. Federally funded papers consistently score well above the global average on relative citation indices. The 2026 budget cliff won't show up in the data until around 2029 — and once the talent pipeline breaks, rebuilding takes a generation.
Meanwhile, in the same month, Mercor's estimated annual revenue reached $1 billion. Anthropic's reported budget for reinforcement learning environments alone was $1 billion. China's total R&D spending surpassed $1 trillion in 2024. South Korea's R&D-to-GDP ratio hit 5.1 percent. Among major OECD economies, the United States is the only one where the government is systematically withdrawing from basic research.
It's not that others are catching up. It's that America is stepping back.
Back to the 4 TB. Mercor is not last-generation Amazon Mechanical Turk — paying cents per task to draw bounding boxes. It is an AI-driven expert matching and dispatch platform that recruits high-end professionals at $60 to $200 per hour and places them on RLHF pipelines for frontier labs, where they evaluate model outputs line by line, feeding human judgment into the training loop.
Every one of those doctors, lawyers, and bankers on the platform was trained by a public education system built over more than a century.
Mercor didn't build a university. It isn't a reverse Morrill Act. It's a reverse Vannevar Bush.
Morrill traded public land for public capability. Bush traded public funding for public knowledge. Bayh-Dole traded limited exclusivity for commercialization — but none of them touched the upstream. Mercor uses private contracts to divert expert judgment out of the knowledge stream that would otherwise produce papers, textbooks, and peer review — and channels it into proprietary training data for closed-weight models.
Consider a specific person: an attending endocrinologist, trained at an NIH-funded teaching hospital, with years of clinical experience. If she stays in academia, her clinical reasoning becomes papers, teaching materials, peer review — public knowledge that future medical students can read and other hospitals can reproduce. If she signs a Mercor contract, her diagnostic reasoning trains a closed-source model under NDA. She can't publish it. It can't be peer-reviewed. It can't be reproduced. It becomes an invisible, unauditable part of a weight matrix.
There is a counter-narrative worth taking seriously: the federal government isn't retreating, it's adapting. NAIRR is building public compute; the NDAA requires intelligence agencies to share model weights; the White House action plan promotes open-weight models; researchers at NeurIPS 2025 have called for labs to release small analog models for academic study. All true — but the combined scale of these efforts doesn't match the annual revenue of a single AI staffing platform. The scale has already tipped.
The pharmaceutical industry walked this road first. In 2017, IQVIA — the world's largest contract research organization — sued Veeva Systems over proprietary data accumulated through outsourced clinical trials. The core question: when I do the research for you, who owns the data? The litigation lasted eight years before settling in 2025.
Today's AI labs are the new pharma companies. Mercor and Scale AI are the new CROs. Thousands of NDA-bound experts are generating RLHF data right now. Who owns it — the lab that pays, the platform that organizes, or the person who writes the judgment?
The Mercor breach wasn't an accident. It was the trailer for a lawsuit that hasn't been filed yet.
The last land-grant university established under the Morrill Act is still enrolling students this year.
The NSF's 2026 grant application deadline has been postponed to the next fiscal year.
Mercor's job board reportedly lists an opening for an attending endocrinologist: $200 per hour, remote, requiring years of clinical experience. The job description asks you to "evaluate model reasoning using real clinical cases."
There's an NDA in the contract appendix.
Your diagnostic reasoning won't appear in the New England Journal of Medicine.
No comments:
Post a Comment