๐—” ๐—™๐—ถ๐—ฒ๐—น๐—ฑ ๐—š๐˜‚๐—ถ๐—ฑ๐—ฒ ๐˜๐—ผ ๐— ๐˜‚๐—น๐˜๐—ถ-๐—”๐—ด๐—ฒ๐—ป๐˜ ๐—™๐—ฎ๐—ถ๐—น๐˜‚๐—ฟ๐—ฒ ๐— ๐—ผ๐—ฑ๐—ฒ๐˜€

Stop saying agents got confused. Vague words do not help you fix things. You need a clear list of failures.

Cemri et al. studied 1,642 traces from 7 frameworks. They found 14 failure modes in 3 groups.

Group 1: Specification failures. These happen at design time. The agent follows a bad setup.

Fix these first. They are cheap.

Group 2: Coordination failures. These only happen with multiple agents.

Fix: Share full execution traces. Do not share only messages.

Group 3: Verification failures. These have high impact.

Fix: Add a verification step. This raised success by 15.6%.

Finding the bug is hard. Zhang et al. found low accuracy for failure attribution. Agent accuracy was 53.5%. Step accuracy was 14.2%.

Mistakes cascade. One early error causes a later crash.

Your plan:

Source: https://dev.to/tuomo_pisama/a-field-guide-to-multi-agent-failure-modes-59on