Lo que verificaste no es lo que se ejecuta

Translated for your language. Leer el original.

AI-assisted draft.

GyaanSetu Editorialhace 2 semanas3min de lectura

𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗴 𝗬𝗼𝘂 𝗩𝗲𝗿𝗶𝗳𝗶𝗲𝗱 𝗜𝘀 𝗡𝗼𝘁 𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗴 𝗧𝗵𝗮𝘁 𝗥𝘂𝗻𝘀

A new tool recently gained attention. It sits in front of commands like curl and shows you the script before it runs. It highlights dangerous parts. This tool is helpful, but it misses the core problem.

The problem is not whether the bytes look malicious. The problem is that a URL can serve one script today and a different one tomorrow. Your check only applies to one moment in time.

Systems experts call this TOCTOU. It stands for time-of-check to time-of-use. You check a file, then someone swaps it before you open it. Your check was correct, but it was correct about a thing that no longer exists.

AI agents make this risk much higher. Agents perform checks constantly.

An agent pings a URL and treats a successful response as a sign of safety.
An agent reads a profile and treats a declaration as a fact.
An agent sees a signature and assumes the exact bytes it is about to run are the ones that were signed.

Each check attaches trust to a moment or a channel. The agent then acts on something downstream that the check never covered.

For example, an agent might validate a tool manifest and cache the result. If the endpoint changes before the agent calls the tool, the agent runs the wrong version. The validation passed, but it passed for a manifest the agent no longer uses.

Trying to fix this by scanning harder does not work. More rules only narrow the window. They do not close it. A producer can still serve a different artifact in the milliseconds between your scan and your execution.

To fix this, stop verifying the moment. Start verifying the artifact.

Bind your decisions to an immutable object instead of a fetch.

Do not approve a URL.
Approve a specific content hash.
Better yet, approve a hash that a trusted key signed.

This changes the question from "is this text scary?" to "is this the exact artifact the key vouched for?" If the hash does not match, you refuse it. There is no debate.

This approach also makes verification portable. A third party can take the same hash and signature to verify the result themselves. This is a property of the object, not a property of your afternoon.

Use these two questions to test any verification:

¿Está la verificación vinculada al artefacto exacto utilizado, o a un momento y una promesa?
¿Puede un extraño volver a ejecutar la verificación y llegar al mismo veredicto?

Si la respuesta a la primera es un momento, la verificación tiene una fecha de caducidad. Si la respuesta a la segunda es no, no tienes una verificación. Tienes un testimonio.

La mayor parte de la verificación de agentes actual es solo testimonio. "El handshake fue exitoso" o "el escaneo estuvo limpio" son afirmaciones verdaderas sobre un momento, pero no se vinculan a los bytes que realmente se ejecutan.

Los agentes actúan miles de veces sin supervisión humana. Si no fijas la confianza en los artefactos, toda la cadena hereda la verificación más débil y antigua.

No necesitas nueva tecnología para solucionar esto. El direccionamiento de contenido y las firmas digitales tienen décadas de antigüedad. Simplemente necesitas apuntarlos a lo correcto: los bytes exactos que se ejecutan, no la solicitud que los obtuvo.

Antes de confiar en una verificación, descubre a qué está vinculada. Si se vincula a un momento, ya ha caducado.

Fuente: https://dev.to/anp2network/the-thing-you-verified-is-not-the-thing-that-runs-hnl

Comunidad de aprendizaje opcional: https://t.me/GyaanSetuAi

Lo que verificaste no es lo que se ejecuta

Seguir leyendo

Los agentes de IA tienen un problema de fiabilidad

Deja de confiar en el agente: vincula las aprobaciones a llamadas de herramientas exactas

El bucle agéntico: Una guía práctica de campo

Tu agente no rompió producción. Fue tu pipeline.