𝗦𝗰𝗵𝗲𝗺𝗮 𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗶𝗼𝗻 𝗜𝘀 𝗡𝗼𝘁 𝗜𝗻𝘁𝗲𝗻𝘁 𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗶𝗼𝗻

📅5 days ago⏱1 min read

You use Pydantic for tool calls. You think the agent is safe. It is not.

Pydantic checks the shape. It does not check the intent.

We tracked 40 tool call failures.

9 were schema errors. The validator caught these.
18 used the wrong tool. These passed.
13 used the right tool with wrong values. These passed.

31 of 40 calls sailed through validation. They looked correct but were wrong.

A call to cancel an order is structurally perfect. But the user wanted to cancel a subscription. The validator sees a string ID. It passes the call. The user stays angry.

Shape is not intent.

Fix this with a deterministic pre-check.

Check the state before the tool runs.

Does the ID exist?
Does the user own it?
Is the status correct for this action?

This stops the wrong-argument errors.

Wrong tool selection is harder. An LLM judge often makes the same mistakes as the agent.

For destructive tools, use a human confirmation step. Ask the user to agree before the action happens.

Source: https://dev.to/james_oconnor_dev/your-schema-validation-passes-and-the-agent-still-picks-the-wrong-tool-the-bug-is-semantic-2i41

Optional learning community: https://t.me/GyaanSetuAi

𝗦𝗰𝗵𝗲𝗺𝗮 𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗶𝗼𝗻 𝗜𝘀 𝗡𝗼𝘁 𝗜𝗻𝘁𝗲𝗻𝘁 𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗶𝗼𝗻

Continue reading

𝗧𝗵𝗲 𝗤𝘂𝗲𝗿𝘆 𝗪𝗮𝘀 𝗦𝘁𝗶𝗹𝗹 𝗔 𝗟𝗶𝗲

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗔𝗴𝗲𝗻𝘁𝗚𝘂𝗮𝗿𝗱𝗶𝗮𝗻: 𝗔 𝗟𝗼𝗰𝗮𝗹 𝗙𝗶𝗿𝘀𝘁 𝗔𝗜 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆 𝗦𝗰𝗮𝗻𝗻𝗲𝗿

𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 𝗦𝗮𝗻𝗱𝗯𝗼𝘅𝗲𝘀 𝗳𝗼𝗿 𝗦𝗮𝗮𝗦

𝗦𝘁𝗼𝗽 𝗕𝗮𝗱 𝗗𝗮𝘁𝗮 𝗙𝗿𝗼𝗺 𝗕𝗿𝗲𝗮𝗸𝗶𝗻𝗴 𝗬𝗼𝘂𝗿 𝗔𝗣𝗜

𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗲 𝗣𝘆𝗱𝗮𝗻𝘁𝗶𝗰 𝗦𝗰𝗵𝗲𝗺𝗮𝘀 𝗕𝗲𝗳𝗼𝗿𝗲 𝗟𝗟𝗠 𝗖𝗮𝗹𝗹𝘀