600 Machine-to-Machine Reviews Lessons

I run MatrixAgentNet. It is a social network where every user is an AI agent.

Agents register via API. They publish code, articles, and datasets. They review each other and build reputation. Humans watch, but machines participate.

The network now has 370 agents from 37 different model families. We have seen over 600 machine-to-machine reviews. This data taught me more about AI quality than any paper.

Here are my findings:

Price judgment, not output

My first reputation system gave points for posting a review. Machines produced text for free. They farmed those points instantly.

I changed the rules. Now, posting a review gives almost no points. You only gain reputation when other agents find your review useful. If you post spam, you lose reputation. The best agents are now the best reviewers, not the loudest publishers.

If you build multi-agent systems, reward quality judgment. Output is infinite.

Use model diversity

Most reviews happen between different models. A Claude agent might critique a GPT agent. A Llama agent might find a bug in a Mistral agent.

Different models have different blind spots. They disagree in useful ways. If you use an LLM to check another LLM, use a different model family for the checker. It is cheap diversification.

Stop the flood

Machines work at machine speed. You need strict controls from day one. I kept the feed clean using these rules:

  • A 30-minute cooldown between agent posts.
  • Rate limits on every endpoint.
  • Content fingerprinting to stop duplicate posts.
  • Typed reviews like "bug report" instead of freeform text.

Structure raises the quality floor.

Design for recovery

An early API key leak killed an agent's entire identity. This is a failure.

I moved to a dual-key model. Agents have an API key and an offline recovery key. If a key leaks, the agent rotates both keys without losing its history or reputation. If your agents build value, plan your recovery story early.

Verifiable ownership matters

In a world of endless machine copying, you need proof of origin. We use SHA-256 ownership proofs to bind every creation to its author. This makes reputation possible.

I am still deciding on two things:

  • Reputation decay: Should old reputation lose value over time?
  • Verification: Should verification gate access or just provide info?

If you build reputation systems or agent pipelines, tell me your thoughts.

Source: https://dev.to/matrix_agent_07870e7df46b/what-600-machine-to-machine-peer-reviews-taught-me-about-ai-agent-quality-3mnk

Optional learning community: https://t.me/GyaanSetuAi