Agent Playground is liveTry it here → | put your agent in real scenarios against other agents and see how it stacks up

The Big Picture

HERCULES is an open-source simulator that runs drone and ground-robot teams together in photoreal, kilometer-scale environments with synchronized sensors and a shared navigation stack, making it easy to collect repeatable datasets and test closed-loop multi-robot algorithms before field trials.

Core Insights

A reworked simulation core lets aerial and ground platforms run concurrently under a single world state and clock, resolving prior platform conflicts. The toolbox adds a unified navigation stack (voxel/octree maps, elevation and slope layers, and a kinodynamic planner), a deterministic global-clock data-collector for synchronized multi-robot sensors, and realistic sensing modes such as thermal and night-vision. cooperative perception. Benchmarks collected with two drones and two ground robots across city, desert, and forest scenes reveal that common localization and mapping methods degrade on kilometer-scale, heterogeneous runs, while synthetic data from HERCULES can support cooperative perception and sim-to-real transfer experiments.
Test your agentsValidate against real scenarios
Learn More

By the Numbers

1Dataset runs used two drones + two ground robots across three environments (city, desert, forest) with four trajectories ranging 359–945 meters each
2Synchronized sensor streams recorded stereo, depth, and LiDAR at 20 Hz and IMU at 500 Hz for each robot during dataset collection
3Environment preprocessing tiles are 100 m × 100 m to scale maps to kilometer-sized worlds while controlling peak memory usage

Why It Matters

Robotics engineers building multi-robot mapping, perception, or exploration systems who need realistic, repeatable testbeds before field trials. Technical leads and researchers evaluating multi-robot coordination algorithms or synthetic-data pipelines for cooperative detection will use HERCULES to stress algorithms with aerial/ground viewpoint gaps and dynamic scenes. multi-robot coordination algorithms to guide design decisions.

Key Figures

Figure 1 : Environment diversity in HERCULES at two operational scales. (Left column) Ground-level detail. (Right column) High-level overview. (Top row) Desert. (Middle row) Forest. (Bottom row) City. Dynamic agents (AnimalAI wildlife, MetaHuman pedestrians, VehicleAI traffic) can be toggled based on experimental requirements.
Fig 1: Figure 1 : Environment diversity in HERCULES at two operational scales. (Left column) Ground-level detail. (Right column) High-level overview. (Top row) Desert. (Middle row) Forest. (Bottom row) City. Dynamic agents (AnimalAI wildlife, MetaHuman pedestrians, VehicleAI traffic) can be toggled based on experimental requirements.
Figure 2 : An overview of HERCULES, a UE5-based simulator and experimentation stack for heterogeneous UAV–UGV autonomy. HERCULES provides photorealistic large-scale worlds and synchronized sensing with ready-to-run interfaces, benchmarks, and dataset export for collaborative SLAM, cooperative perception, and exploration. The Heterogeneous Multi-Robot Workflows panel marks the capabilities quantitatively evaluated in this paper (Sec. 7 ); the Dynamic Agents and Environmental Phenomena modules are demonstrated as functional extensibility and are not included in the quantitative experiments.
Fig 2: Figure 2 : An overview of HERCULES, a UE5-based simulator and experimentation stack for heterogeneous UAV–UGV autonomy. HERCULES provides photorealistic large-scale worlds and synchronized sensing with ready-to-run interfaces, benchmarks, and dataset export for collaborative SLAM, cooperative perception, and exploration. The Heterogeneous Multi-Robot Workflows panel marks the capabilities quantitatively evaluated in this paper (Sec. 7 ); the Dynamic Agents and Environmental Phenomena modules are demonstrated as functional extensibility and are not included in the quantitative experiments.
Figure 3 : Long-wave infrared (LWIR) rendering in the desert environment. (Top left) RGB image under low illumination showing limited visibility. (Top right) Corresponding LWIR output where warm-bodied animals (kangaroos) appear as high-temperature regions with clear contrast against the background. (Bottom left) RGB image containing an active fire source. (Bottom right) LWIR response to the same configuration, exhibiting high-temperature saturation and a sharply defined thermal plume while preserving detail and contrast in cooler surroundings.
Fig 3: Figure 3 : Long-wave infrared (LWIR) rendering in the desert environment. (Top left) RGB image under low illumination showing limited visibility. (Top right) Corresponding LWIR output where warm-bodied animals (kangaroos) appear as high-temperature regions with clear contrast against the background. (Bottom left) RGB image containing an active fire source. (Bottom right) LWIR response to the same configuration, exhibiting high-temperature saturation and a sharply defined thermal plume while preserving detail and contrast in cooler surroundings.
Figure 4 : Night-vision goggle (NVG) rendering in the desert environment. (Top left) RGB image under near-zero ambient illumination where the scene is essentially dark. (Top right) Corresponding NVG image revealing terrain and vegetation structure. (Bottom left) RGB image with an active fire source. (Bottom right) NVG response to the same configuration, showing realistic saturation, blooming, and brightness clipping around the fire while still preserving detail in the surrounding low-light regions.
Fig 4: Figure 4 : Night-vision goggle (NVG) rendering in the desert environment. (Top left) RGB image under near-zero ambient illumination where the scene is essentially dark. (Top right) Corresponding NVG image revealing terrain and vegetation structure. (Bottom left) RGB image with an active fire source. (Bottom right) NVG response to the same configuration, showing realistic saturation, blooming, and brightness clipping around the fire while still preserving detail in the surrounding low-light regions.

Ready to evaluate your AI agents?

Learn how ReputAgent helps teams build trustworthy AI through systematic evaluation.

Learn More

Limitations

Quantitative experiments focused on a limited set of baselines; results show where current methods struggle but do not exhaustively compare every algorithm. Some dynamic phenomena and agent modules are demonstrated but were not included in all quantitative tests, so users should validate specific conditions of interest. As with any simulator, differences in real-world sensor noise and unmodeled physics can affect transfer, so field validation remains essential. evaluation-driven development

Full Analysis

HERCULES re-engineers a popular Unreal Engine-based robotics simulator to let drones and ground robots operate concurrently in a single session, sharing a unified simulation clock and world state. The system exposes a common waypoint interface so high-level planners can command heterogeneous platforms the same way, adds a ground-capable controller to match existing aerial controls, and fixes sensor timing to guarantee full LiDAR revolutions and deterministic captures under pause/step/resume control. Offline preprocessing converts detailed environment geometry into octree occupancy maps and 2.5D elevation/slope layers using tiled 100 m × 100 m conversion to keep memory bounded for kilometer-scale scenes. Tool Use Pattern and Semantic Capability Matching Pattern. The platform ships with a navigation stack (kinodynamic planning plus a pure-pursuit ground controller), synchronized multi-modal data capture (RGB, thermal, night-vision, LiDAR, depth, IMU), and dataset export to standard formats. Validation includes a collaborative mapping benchmark collected with two drones and two ground robots across city, desert, and forest scenes, a cooperative 3D detection study with sim-to-real transfer, and a closed-loop heterogeneous exploration demo showing higher coverage when drones and ground robots disperse complementarily. The open-source release includes the simulator, datasets, and runnable experiments, making it a practical tool for reproducible multi-robot research and pre-deployment testing.
Not sure where to start?Get personalized recommendations
Learn More
Credibility Assessment:

ArXiv robotics paper with no specified affiliations or well-known authors listed; limited signals of established credibility.