𝗖𝗼𝗱𝗲𝘅 𝗙𝗶𝘅𝗶𝗻𝗴 𝗖𝗼𝗱𝗲𝘅: 𝗔 𝗖𝗼𝗻𝘀𝗲𝗻𝘀𝘂𝘀 𝗟𝗼𝗼𝗽

Translated for your language. Read the original.

AI-assisted draft.

GyaanSetu Editorialsaa 13 zilizopita2min read

I built an agent loop that does more than suggest code. It writes code, reviews it, and merges its own pull requests.

To test it, I pointed the loop at a fork of the codex CLI. I let the agents try to fix the software themselves. This is a pure experiment. The fork has no users and no stars. This is about the mechanism, not a product.

Here is how the loop works:

Intake: An upstream bug becomes an issue in the fork. The loop only picks small, mechanical bugs it can finish.
Solvers Argue: Multiple agents propose different fixes. One solver wants the smallest change. Another wants clean structure. A third wants to delete code instead of adding it. They disagree.
Judge Arbitrates: A judge reads the debate. If solvers disagree, the judge sends them back for more rounds. The judge also records why it rejected certain ideas.
Implement and Merge: Once they reach consensus, the loop writes the patch, runs tests, and opens a PR. If tests pass, it merges itself.

You can see this in action in issue #34. The agents debated a concurrency bug. They went through three rounds of arbitration before reaching a decision. The loop produced a real fix and a regression test without a human typing a single line of code.

One interesting result happened in PR #16. The loop could not reproduce a reported bug. Instead of making up a fake fix, it simply added a test to lock the behavior and stopped. A loop that knows when not to patch is more useful than one that always produces a diff.

The loop has merged about 16 PRs so far. It handles small tasks like UTF-8 handling and command fixes. It does not maintain a whole codebase, but it closes small, bounded bugs from start to finish.

Humans still set the rules and review the work. We still check every PR. The code is automatic, but the attention is human.

You can see the entire process on GitHub. Look at issue #34 and PR #37 to see the debate.

Source: https://dev.to/nwnwnw413/codex-fixing-codex-a-consensus-loop-that-argues-judges-and-merges-its-own-prs-11bh

Optional learning community: https://t.me/GyaanSetuAi

𝗖𝗼𝗱𝗲𝘅 𝗙𝗶𝘅𝗶𝗻𝗴 𝗖𝗼𝗱𝗲𝘅: 𝗔 𝗖𝗼𝗻𝘀𝗲𝗻𝘀𝘂𝘀 𝗟𝗼𝗼𝗽

Continue reading

𝗖𝗼𝗺𝗽𝗶𝗹𝗶𝗻𝗴 𝗧𝗵𝗲 𝗣𝗿𝗼𝗰𝗲𝘀𝘀, 𝗡𝗼𝘁 𝗧𝗵𝗲 𝗖𝗼𝗱𝗲

𝗜 𝗥𝘂𝗻 𝗮 𝗦𝗲𝗹𝗳 𝗜𝗺𝗽𝗿𝗼𝘃𝗲𝗺𝗲𝗻𝘁 𝗟𝗼𝗼𝗽 𝗼𝗻 𝗺𝘆 𝗔𝗴𝗲𝗻𝘁 𝗘𝘃𝗲𝗿𝘆 𝗡𝗶𝗴𝗵𝘁

𝗛𝗼𝘄 𝗜 𝗕𝘂𝗶𝗹𝘁 𝗔 𝗣𝗲𝗿𝘀𝗼𝗻𝗮𝗹 𝗔𝗜 𝗦𝘂𝗽𝗲𝗿 𝗔𝗽𝗽

𝗬𝗼𝘂𝗿 𝗔𝗴𝗲𝗻𝘁 𝗖𝗵𝗲𝗰𝗸𝗲𝗱 𝗘𝘃𝗲𝗿𝘆𝘁𝗵𝗶𝗻𝗴. 𝗜𝘁 𝗪𝗮𝘀 𝗦𝘁𝗶𝗹𝗹 𝗪𝗿𝗼𝗻𝗴.