𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗴 𝗬𝗼𝘂 𝗩𝗲𝗿𝗶𝗳𝗶𝗲𝗱 𝗜𝘀 𝗡𝗼𝘁 𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗴 𝗧𝗵𝗮𝘁 𝗥𝘂𝗻𝘀

Translated for your language. Read the original.

AI-assisted draft.

GyaanSetu Editorial2 tygodnie temu3min read

A new tool recently gained attention. It sits in front of commands like curl and shows you the script before it runs. It highlights dangerous parts. This tool is helpful, but it misses the core problem.

The problem is not whether the bytes look malicious. The problem is that a URL can serve one script today and a different one tomorrow. Your check only applies to one moment in time.

Systems experts call this TOCTOU. It stands for time-of-check to time-of-use. You check a file, then someone swaps it before you open it. Your check was correct, but it was correct about a thing that no longer exists.

AI agents make this risk much higher. Agents perform checks constantly.

An agent pings a URL and treats a successful response as a sign of safety.
An agent reads a profile and treats a declaration as a fact.
An agent sees a signature and assumes the exact bytes it is about to run are the ones that were signed.

Each check attaches trust to a moment or a channel. The agent then acts on something downstream that the check never covered.

For example, an agent might validate a tool manifest and cache the result. If the endpoint changes before the agent calls the tool, the agent runs the wrong version. The validation passed, but it passed for a manifest the agent no longer uses.

Trying to fix this by scanning harder does not work. More rules only narrow the window. They do not close it. A producer can still serve a different artifact in the milliseconds between your scan and your execution.

To fix this, stop verifying the moment. Start verifying the artifact.

Bind your decisions to an immutable object instead of a fetch.

Do not approve a URL.
Approve a specific content hash.
Better yet, approve a hash that a trusted key signed.

This changes the question from "is this text scary?" to "is this the exact artifact the key vouched for?" If the hash does not match, you refuse it. There is no debate.

This approach also makes verification portable. A third party can take the same hash and signature to verify the result themselves. This is a property of the object, not a property of your afternoon.

Use these two questions to test any verification:

Czy sprawdzenie jest powiązane z konkretnym używanym artefaktem, czy z momentem i obietnicą?
Czy osoba postronna może ponownie przeprowadzić sprawdzenie i dojść do tego samego werdyktu?

Jeśli odpowiedzią na pierwsze pytanie jest moment, sprawdzenie ma datę ważności. Jeśli odpowiedzią na drugie jest „nie”, nie masz weryfikacji. Masz jedynie świadectwo.

Większość obecnych metod weryfikacji agentów to jedynie świadectwo. Stwierdzenia typu „uścisk dłoni zakończył się sukcesem” lub „skan był czysty” są prawdziwymi informacjami o danym momencie, ale nie są przypisane do bajtów, które faktycznie są uruchamiane.

Agenci działają tysiące razy bez nadzoru człowieka. Jeśli nie zakotwiczysz zaufania w artefaktach, cały łańcuch dziedziczy najsłabsze i najstarsze sprawdzenie.

Nie potrzebujesz nowej technologii, aby to naprawić. Adresowanie treści i podpisy cyfrowe mają dziesięciolecia. Musisz po prostu skierować je na właściwy obiekt: dokładne bajty, które są wykonywane, a nie żądanie, które je pobrało.

Zanim zaufasz sprawdzeniu, dowiedz się, do czego jest ono przypisane. Jeśli jest przypisane do momentu, to już wygasło.

Źródło: https://dev.to/anp2network/the-thing-you-verified-is-not-the-thing-that-runs-hnl

Opcjonalna społeczność edukacyjna: https://t.me/GyaanSetuAi

𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗴 𝗬𝗼𝘂 𝗩𝗲𝗿𝗶𝗳𝗶𝗲𝗱 𝗜𝘀 𝗡𝗼𝘁 𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗴 𝗧𝗵𝗮𝘁 𝗥𝘂𝗻𝘀

Continue reading

𝗔𝗜 𝗔𝗴𝗲𝗻𝘁𝘀 𝗛𝗮𝘃𝗲 𝗔 𝗥𝗲𝗹𝗶𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗣𝗿𝗼𝗯𝗹𝗲𝗺

Przestań ufać agentowi: powiąż zatwierdzenia z konkretnymi wywołaniami narzędzi

Pętla agentowa: Praktyczny przewodnik

Twój agent nie zepsuł produkcji. Zrobił to Twój pipeline.