An AI That Learns Rules From Its Mistakes to Always Finish the Quest

Key Takeaway

A hybrid agent that learns explicit, causal rules from its own interactions achieves robust, compositionally general behavior—reaching 100% quest success in simulation and dramatically improving first-try success versus purely neural agents.

ON THIS PAGE

Core Insights

Grounding language-based planning in a symbolic, causal world model plus rule induction fixes the common failure of language models to combine known skills in new ways. Letting the agent act, observe outcomes, induce formal rules from failures, and verify future plans leads to systematic learning rather than pattern matching. In a simulated quest environment, the full neuro-symbolic loop (planner + verifier + learner) solved hard, novel tasks that broke standard language-model agents and ablated variants. The combination of inductive rule learning and logical verification was essential—either component alone produced much weaker or unsafe behavior. This aligns with the ReAct Pattern and Chain of Thought Pattern insights.

Explore evaluation patternsSee how to apply these findings

Learn More

Data Highlights

1100% quest success across all tested language-model backbones, including very hard quests designed to force compositional failure.

2First-try success rate was nearly 3× lower for the agent that lacked the verifier (neural theorem prover) compared with the full system.

3Baseline language-model agents and ablated variants experienced catastrophic performance drops on novel scenarios, falling to near 0% quest success in those cases.

What This Means

Engineers building interactive AI assistants, game or robotics teams, and technical leads evaluating agent reliability should pay attention—this shows a practical path to agents that learn safe, generalizable rules from experience. Teams that need interpretable agent behavior or predictable failure modes can use this approach to make planning decisions auditable and self-correcting. Multi-Agent System

Key Figures

Fig 1: Figure 1: The AGEL-Comp neuro-symbolic architecture.

Fig 2: Figure 2: Aggregated Quest Success and First-Try Success Rate by Agent Config.

Fig 3: Figure 3: Aggregated Quest Success and First-Try Success Rate Per LLM Per Agent Config.

Fig 4: Figure 4: Aggregated Iterations and Completion Time Per Agent Config. Per Quest

Ready to evaluate your AI agents?

Learn how ReputAgent helps teams build trustworthy AI through systematic evaluation.

Learn More

Considerations

Results come from a simulated environment designed to probe compositional gaps; real-world domains may introduce noisy perception and richer dynamics that complicate rule induction. The hybrid architecture adds runtime and engineering overhead: maintaining a symbolic world model, an inductive learner, and a verifier takes computation and domain design effort. Success hinges on good perceptual grounding (accurate, structured percepts) and meaningful episodic signals—poor inputs or weak feedback will limit the system's ability to induce correct rules. Agent Service Mesh Pattern

Methodology & More

AGEL-Comp pairs a language model used as a high-level planner with a symbolic, action-grounded world model and two learning/verifying modules: an inductive logic engine that synthesizes human-readable rules from experience and a neural theorem prover that checks the logical consequences of proposed plans. At runtime the agent perceives the environment as structured facts (for example, shiny_coin next_to crackling_fire), the planner proposes a step sequence, and the verifier scores whether that plan leads to undesirable outcomes according to the current world model. When outcomes disagree with expectations (for instance, the agent takes damage after approaching fire), the episode triggers a grounding cycle that performs causal attribution and induces Horn-clause rules such as causes_damage(X) :- is_harmful(X). Those rules update the world model and the verifier is fine-tuned periodically so future plans are both safer and more generalizable. Evaluation used a continuous, zero-shot protocol in the Retro Quest simulator: the agent must generalize within a single run without offline fine-tuning. The full AGEL-Comp loop achieved perfect quest success across tested model backbones and significantly higher first-try success than ablated systems. Ablations showed the duo is necessary: a learner without a verifier produced unsafe, inefficient behavior (first-try success ≈3× worse), while a verifier without an inductive learner could not recover from model gaps and failed to generalize. The upshot for practice is clear: combining explicit, causal rule learning with logical checking turns brittle language-model planning into a self-correcting, interpretable system—at the cost of added engineering and compute for symbolic machinery. Orchestrator-Worker Pattern

Need expert guidance?We can help implement this

Learn More

Credibility Assessment:

ArXiv preprint with low h-index authors (h=3 for lead) and no listed affiliations or citations — fits emerging/limited-info category.

agent reliability compositional generalization agent learning

Not sure where to start?