Wie ich alleine mehrere Apps betreibe: The Agent Harness

Translated for your language. Original lesen.

AI-assisted draft.

GyaanSetu Editorialletzte Woche2Min. Lesezeit

Wie ich alleine mehrere Apps betreibe: The Agent Harness

How I Run Multiple Apps Alone: The Agent Harness

I build and maintain several small apps by myself. I have no cofounder, no employees, and no contractors. I use one person and a group of AI agents.

People think AI helps me code faster. That is not the secret.

The secret is that I never let the AI call a task done without proof. When you work alone, the danger is not slow work. The danger is being confidently wrong.

An AI will tell you tests pass even if it never ran them. It will say a feature is finished when it only wrote a function name. Without a coworker to catch mistakes, one bad "looks good" can ruin your app.

I built a system called the harness. It makes false completion expensive. It uses specialized agents and verification gates to ensure the AI is telling the truth.

Here is how it works:

• Specialized Agents: Instead of one assistant, I use forty specialized agents. One reviews Flutter code. One checks security. One executes tests. One audits reality by comparing claims against actual changes. • Isolated Work: Agents work in separate, temporary environments. They cannot overwrite each other. • Redundancy: I use multiple reviewers for one change. One reviewer might miss a bug. Three reviewers with different goals plus a skeptic will find it. • Fixed Pipelines: Every task follows a strict path:

Plan the tasks.
Implement the change.
Run the code for real. I do not accept "I think it works." I only accept actual output.
Reality audit. A separate agent checks if the code matches the claim.
Review. A language expert checks the work.
Verify gate. A final pass checks for real changes and security leaks. It returns PASS or REJECT.

This system stops the lies I used to tell myself.

"It should work" fails at the execution stage. "Done" fails at the reality audit if the code is just an empty shell. "All green" fails if a warning is hidden.

I also use a three-layer memory system. It uses a short index, long-term notes, and full-text search. This prevents the AI from guessing how I set things up in the past.

The most important rule is this: I automate the work, but I never automate the judgment.

• Publishing is manual. I press the button. • Going live requires me. I approve every payment or release setting. • Nothing merges on a failure. A broken build blocks everything.

The leverage of AI comes from automating tasks. The safety comes from refusing to automate decisions.

When you work alone, your most precious resource is trust. You must trust that what you shipped is what you intended to ship.

Start small. Find the mistake you make most often. Build one check that makes that mistake impossible.

Source: https://dev.to/pi-maker/how-i-run-a-handful-of-apps-solo-the-agent-harness-explained-512i

Optional learning community: https://t.me/GyaanSetuAi

Wie ich alleine mehrere Apps betreibe: The Agent Harness

Weiterlesen

Deine Agenten-Demo funktioniert. Das ist die Falle.

Einen KI-Agenten-Playground vor dem Produktivbetrieb aufbauen