Overview
Monoculture collapse occurs when agents built on similar models exhibit correlated vulnerabilities. Like agricultural monocultures susceptible to single diseases, agent monocultures can fail simultaneously when encountering their shared weaknesses.
The Risk
Correlated Failures
If all agents use GPT-4:
- Same hallucination patterns
- Same knowledge cutoff blind spots
- Same prompt injection vulnerabilities
- Same reasoning failures on specific problem types
Adversarial Vulnerability
An attack effective against one agent works against all agents.
Blind Spot Amplification
Agent A (GPT-4): Can't solve this math problem.
Agent B (GPT-4): Also fails on the same problem.
Agent C (GPT-4): Agrees with A and B's wrong answer.
Consensus: Confidently incorrect.
Real-World Manifestations
Training Data Gaps
All GPT-based agents might share misinformation from common training data.
Temporal Blind Spots
Models with same knowledge cutoff all lack recent information.
Reasoning Patterns
Same underlying model = same systematic reasoning errors.
Detection
- Track error correlation across agents
- Identify inputs that cause system-wide failures
- Monitor for patterns suggesting shared vulnerabilities
The Gradient Institute Warning
"A collection of safe agents does not imply a safe collection of agents."
Even with unified oversight and aligned objectives, agents can exhibit:
- Cascading errors
- Coordination failures
- Unintended collective behaviors