US Government vs Anthropic: The Impossible Demand for Unhackable LLMs

📅4 hours ago⏱3 min read

In this article

US Government vs Anthropic: The Impossible Demand for Unhackable LLMs

A growing rift has emerged between the U.S. government and Anthropic following the release of the Fable 5 model, sparking a debate over AI safety and regulatory oversight. As officials accuse the AI lab of bypassing executive orders, a deeper technical tension is surfacing: the government’s demand for "unhackable" frontier models.

The Conflict Over Fable 5 and Cyber Directives

The tension stems from Anthropic’s decision to release its latest model, Fable 5, before a designated government clearinghouse—mandated by a recent Trump administration cyber executive order—was fully operational. While the order called for voluntary oversight, government officials claim Anthropic ignored the spirit of the directive, leading to accusations that the company is a "bad actor."

Current discussions involving the Department of Commerce, the CIA, and science advisor Michael Kratsios highlight a massive communication gap. Officials have expressed concern that Anthropic proceeded despite knowing a "jailbreak" risk existed—a tip reportedly provided by Amazon and other tech industry partners. However, the friction appears to be as much about regulatory timing as it is about technical security.

The Technical Reality: Can LLMs Ever Be Unhackable?

The crux of the government's criticism—that Anthropic "took the wrong fork" by ignoring potential jailbreaks—ignores a fundamental reality of Large Language Model (LLM) architecture. In the AI industry, the consensus is that absolute security is currently an impossibility. Even OpenAI has acknowledged that vulnerabilities like prompt injection may never be fully solved.

Anthropic CEO Dario Amodei has previously noted that while a jailbreak in sensitive fields like biology or tech could be "life or death," the industry is still grappling with how to mitigate these risks. By demanding models be essentially unhackable before international shipping, the U.S. government may be setting a precedent that stifles innovation, as no frontier model (including GPT-5.5 or Kimi 2.7) possesses a perfect security shield.

Industry Backlash and the Export Control Debate

בתגובה למתיחות הגוברת, למעלה מ-100 מומחי אבטחת סייבר ומנהלים בכירים — כולל ותיקי התעשייה כמו Alex Stamos ו-Rachel Tobac — פרסמו מכתב פתוח לשר המסחר Lutnick ולמנהל הסייבר הלאומי Cairncross. הם קוראים להסרת מגבלות הייצוא על המודלים Fable ו-Mythos של Anthropic.

המומחים טוענים כי בעוד ש-Fable מסוגל מאוד לזהות פגמים בתוכנה, הוא אינו מסוכן באופן ייחודי בהשוואה למודלים אחרים כמו Opus או Sonnet. באופן מכריע, הם מזהירים כי מגבלות ייצוא מחמירות למעשה מעכבות את המגינים המערביים. על ידי הגבלת הגישה למודלים אמריקאיים מהשורה הראשונה, הממשלה עלולה להעניק בשוגג יתרון למודלים סיניים בעלי משקלים פתוחים (open-weight), אשר על פי הדיווחים נמצאים רק חודשים ספורים מאחורי מודלי הקצה האמריקאיים המובילים.

נקודות מרכזיות

חיכוך רגולטורי: Anthropic נמצאת תחת מתקפה בשל שחרור Fable 5 לפני שהוקם מנגנון הפיקוח הוולונטרי של הממשלה.
פרדוקס האבטחה: דרישות הממשלה לבינה מלאכותית "בלתי ניתנת לפריצה" מתנגשות עם המציאות הטכנית שבה הזרקת הנחיות (prompt injection) ופריצת מגבלות (jailbreaking) הן סיכונים מובנים בארכיטקטורות LLM נוכחיות.
סיכונים גיאופוליטיים: מומחי תעשייה מזהירים כי מגבלות ייצוא אגרסיביות על מודלים כמו Fable עלולות להחליש את הגנת הסייבר של ארה"ב, בעוד שהן נכשלות בבלימת ההתקדמות המהירה של הבינה המלאכותית הסינית.

US Government vs Anthropic: The Impossible Demand for Unhackable LLMs

US Government vs Anthropic: The Impossible Demand for Unhackable LLMs

The Conflict Over Fable 5 and Cyber Directives

The Technical Reality: Can LLMs Ever Be Unhackable?

Industry Backlash and the Export Control Debate

נקודות מרכזיות

Continue reading

𝗧𝗵𝗲 𝗨𝗦 𝗚𝗼𝘃𝗲𝗿𝗻𝗺𝗲𝗻𝘁 𝗥𝗲𝗰𝗮𝗹𝗹𝗲𝗱 𝗔𝗻 𝗔𝗜 𝗠𝗼𝗱𝗲𝗹

מומחי אבטחת סייבר מוחים על האיסור האמריקאי על מודלי Fable ו-Mythos של Anthropic

𝗖𝗹𝗮𝘂𝗱𝗲 𝗙𝗮𝗯𝗹𝗲 𝟱 𝗦𝗵𝘂𝘁𝗱𝗼𝘄𝗻: 𝗪𝗵𝗮𝘁 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗲𝗿𝘀 𝗡𝗲𝗲𝗱 𝘁𝗼 𝗞𝗻𝗼𝘄

Big Tech’s High Stakes Gamble: Linking AI Preemption to Child Safety

כיצד השבתת Anthropic הזינה את המאמץ העולמי לבינה מלאכותית ריבונית