Cybersecurity Experts Protest US Ban on Anthropic’s Fable and Mythos Models
A coalition of 76 leading cybersecurity veterans is sounding the alarm against a recent U.S. government export control order targeting Anthropic’s most advanced AI models. The group argues that by restricting access to these tools, the government is inadvertently disarming digital defenders while adversaries continue to advance.
The Conflict: National Security vs. Defensive Capability
The tension began when the U.S. government issued an export control order on Anthropic’s Fable and Mythos models, citing undisclosed national security concerns. In compliance with the order, Anthropic has suspended worldwide access to these models.
The Mythos model was originally designed with such high-level capabilities for vulnerability discovery that Anthropic initially limited access to a select group of roughly 150 organizations across 15 countries. The public-facing version, Fable, was intended to include strict guardrails to prevent misuse in biology, chemistry, and cybersecurity. However, the government's intervention has effectively halted the deployment of these powerful tools for the very people tasked with securing the internet.
The "Jailbreak" Controversy and the Amazon Paper
Anthropic suggests the White House order may stem from concerns regarding "jailbreaking"—methods used to bypass safety guardrails. This concern reportedly originates from a non-public research paper by Amazon researchers.
The paper suggested that users could bypass Fable’s security restrictions to access Mythos-level capabilities. However, cybersecurity experts, including Katie Moussouris (founder of Luta Security), argue this is a fundamental misunderstanding of AI utility. Moussouris contends that the "jailbreak" described was simply the model performing its intended function: fixing open-source code containing known vulnerabilities.
According to Moussouris, asking an AI to fix a bug, explain the patch, and write a test is not a security breach; it is the "find, fix, and test loop" that defines modern defensive security. Attempting to block these behaviors would fundamentally cripple the model's ability to protect software.
High-Stakes Signatories and Industry Implications
Barua hiyo ya wazi imungwa mkono na watu mashuhuri katika jamii ya usalama, wakiwemo mkuu wa zamani wa usalama wa Facebook Alex Stamos, mwanzilishi wa Bugcrowd Casey Ellis, na mtaalamu maarufu wa kriptografia Jon Callas. Hoja yao imejikita katika kutokuwepo kwa usawa muhimu: ikiwa walinzi watanyimwa ufikiaji wa mifumo ya kisasa ya LLM wakati maadui wanatumia mifumo isiyo na vikwazo, hali ya usalama ya kimataifa itadhoofika.
Wataalamu pia walibainisha kuwa udhaifu unaodhaniwa katika Fable si wa kipekee kwa Anthropic. Barua hiyo inadokeza kuwa "udhaifu" kama huo unaweza kujirudia kwenye GPT-5.5 ya OpenAI, Claude Opus 4.8 na Sonnet za Anthropic, na hata mifumo ya kimataifa kama Kimi 2.7.
Kikundi hicho kinatoa wito wa mchakato wa udhibiti wa kidemokrasia unaozingatia sayansi, unaotegemea utafiti wa wazi badala ya marufuku pana na za kukurupuka ambazo zinaweza kuleta madhara zaidi kuliko faida.
Mambo Muhimu ya Kuzingatia
- Upunguzaji wa Silaha za Ulinzi: Wataalamu wa usalama wa mtandao wanaonya kuwa kupiga marufuku mifumo ya Fable na Mythos ya Anthropic kunawaondoa walinzi zana muhimu zinazohitajika kutambua na kurekebisha udhaifu wa programu.
- Utendaji dhidi ya Usalama: Wakosoaji wanahoji kuwa wasiwasi wa "jailbreaking" uliowasilishwa na watafiti unachanganya kazi halali za uandishi wa kodi za ulinzi na matumizi mabaya ya mifumo.
- Wito wa Uwazi: Viongozi wa sekta wanadai mchakato wa kisayansi na wa kidemokrasia wa kutunga sheria kwa ajili ya udhibiti wa usafirishaji wa AI ili kuhakikisha kanuni hizo zinalenga mahali husika na zina ufanisi.