𝗧𝗵𝗲 𝗠𝗼𝘀𝘁 𝗗𝗮𝗻𝗴𝗲𝗿𝗼𝘂𝘀 𝗟𝗶𝗻𝗲 𝗼𝗳 𝗔𝗜 𝗖𝗼𝗱𝗲

📅6 days ago⏱1 min read

I pushed a breaking change to production. All tests passed. CI was green. The system did what I told it to do. It still broke.

I asked an AI agent to clean up a response. The agent removed a null phone number field. The payload looked cleaner. An old Android app crashed. It needed the field to exist.

Here is the problem. AI often writes the code and the test in one go. The test no longer guards the code. The test mirrors the code.

If the agent changes a field, it updates the test to match. The test passes because the behavior changed. I call these yes-man tests. They give you a green checkmark but no safety.

Breaking changes happen at the boundary between systems. The AI only sees your repo. It does not see the customer app. It does not see the partner API.

Stop relying on PR reviews for this. Humans miss missing keys in large diffs. Computers do not.

Change your safety layer:

Use a frozen contract. Diff output against a fixed spec.
Separate test writing from code changes.
Move breaking-change detection to CI.

Writing code is no longer the bottleneck. Knowing if you broke a user is the bottleneck. Focus on verification.

Source: https://dev.to/deepaksatyam/the-most-dangerous-line-of-code-your-ai-agent-writes-is-the-test-that-passes-23ko

𝗧𝗵𝗲 𝗠𝗼𝘀𝘁 𝗗𝗮𝗻𝗴𝗲𝗿𝗼𝘂𝘀 𝗟𝗶𝗻𝗲 𝗼𝗳 𝗔𝗜 𝗖𝗼𝗱𝗲

Continue reading

𝗠𝗮𝗸𝗲 𝗬𝗼𝘂𝗿 𝗖𝗼𝗱𝗲𝗯𝗮𝘀𝗲 𝗪𝗼𝗿𝗸 𝗙𝗼𝗿 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁𝘀

𝗬𝗼𝘂𝗿 𝗧𝗲𝘀𝘁 𝗦𝘂𝗶𝘁𝗲 𝗜𝘀 𝗟𝘆𝗶𝗻𝗴 𝗧𝗼 𝗬𝗼𝘂

𝗦𝗶𝘅 𝗠𝗼𝗻𝘁𝗵𝘀 𝗢𝗳 𝗔𝗜 𝗪𝗿𝗶𝘁𝗶𝗻𝗴 𝗠𝘆 𝗧𝗲𝘀𝘁𝘀

𝗤𝗔 𝗘𝘅𝗽𝗲𝗿𝗶𝗺𝗲𝗻𝘁𝘀 𝗧𝗵𝗮𝘁 𝗔𝗰𝘁𝘂𝗮𝗹𝗹𝘆 𝗠𝗮𝘁𝘁𝗲𝗿

𝗪𝗵𝗮𝘁 𝗛𝗮𝗽𝗽𝗲𝗻𝗲𝗱 𝗪𝗵𝗲𝗻 𝗜 𝗧𝗼𝗹𝗱 𝗖𝗼𝗱𝗲𝘅 𝘁𝗼 𝗖𝗮𝗹𝗺 𝗗𝗼𝘄𝗻