US Government vs Anthropic: The Impossible Demand for Unhackable LLMs

A growing rift has emerged between the U.S. government and Anthropic following the release of the Fable 5 model, sparking a debate over AI safety and regulatory oversight. As officials accuse the AI lab of bypassing executive orders, a deeper technical tension is surfacing: the government’s demand for "unhackable" frontier models.

The Conflict Over Fable 5 and Cyber Directives

The tension stems from Anthropic’s decision to release its latest model, Fable 5, before a designated government clearinghouse—mandated by a recent Trump administration cyber executive order—was fully operational. While the order called for voluntary oversight, government officials claim Anthropic ignored the spirit of the directive, leading to accusations that the company is a "bad actor."

Current discussions involving the Department of Commerce, the CIA, and science advisor Michael Kratsios highlight a massive communication gap. Officials have expressed concern that Anthropic proceeded despite knowing a "jailbreak" risk existed—a tip reportedly provided by Amazon and other tech industry partners. However, the friction appears to be as much about regulatory timing as it is about technical security.

The Technical Reality: Can LLMs Ever Be Unhackable?

The crux of the government's criticism—that Anthropic "took the wrong fork" by ignoring potential jailbreaks—ignores a fundamental reality of Large Language Model (LLM) architecture. In the AI industry, the consensus is that absolute security is currently an impossibility. Even OpenAI has acknowledged that vulnerabilities like prompt injection may never be fully solved.

Anthropic CEO Dario Amodei has previously noted that while a jailbreak in sensitive fields like biology or tech could be "life or death," the industry is still grappling with how to mitigate these risks. By demanding models be essentially unhackable before international shipping, the U.S. government may be setting a precedent that stifles innovation, as no frontier model (including GPT-5.5 or Kimi 2.7) possesses a perfect security shield.

Industry Backlash and the Export Control Debate

बढ़ते तनाव के जवाब में, एलेक्स स्टैमोस और राचेल टोबैक जैसे उद्योग के दिग्गजों सहित 100 से अधिक साइबर सुरक्षा विशेषज्ञों और अधिकारियों ने व्यापार सचिव लुटनिक और राष्ट्रीय साइबर निदेशक केर्नक्रॉस को एक खुला पत्र जारी किया है। वे Anthropic के Fable और Mythos मॉडलों पर निर्यात नियंत्रणों को हटाने की मांग कर रहे हैं।

विशेषज्ञों का तर्क है कि हालांकि Fable सॉफ्टवेयर खामियों की पहचान करने में अत्यधिक सक्षम है, लेकिन यह Opus या Sonnet जैसे अन्य मॉडलों की तुलना में विशिष्ट रूप से खतरनाक नहीं है। महत्वपूर्ण रूप से, वे चेतावनी देते हैं कि सख्त निर्यात नियंत्रण वास्तव में पश्चिमी रक्षकों को पंगु बना रहे हैं। शीर्ष स्तर के अमेरिकी मॉडलों तक पहुंच को प्रतिबंधित करके, सरकार अनजाने में चीनी open-weight मॉडलों को लाभ दे सकती है, जो कथित तौर पर अग्रणी अमेरिकी frontier मॉडलों से केवल कुछ महीने पीछे हैं।

मुख्य बातें