How warehouse robots can finish up to 39% more orders in a day

The Big Picture

Choosing which robot, which identical item, and which storage spot to use all at once (a many-to-many assignment) can raise long-run warehouse throughput by up to 38.8% versus the prior best method, while keeping allocation fast enough for online use.

ON THIS PAGE

The Evidence

A scheduling approach called M2M jointly assigns robots, tasks, pickup spots, and delivery spots instead of treating each task as fixed; that keeps robots busier and boosts completed tasks over long runs. M2M begins with a fast suboptimal assignment, improves it with an anytime local-search routine, then plans collision-free robot paths—this balance of speed and global reasoning gives consistent gains. A variant that explicitly tried to spread items around for future benefit (M2M-wSKU) did not improve throughput over the basic M2M, which suggests straightforward many-to-many reasoning is the most practical win today. dynamic routing pattern

Not sure where to start?Get personalized recommendations

Learn More

Data Highlights

1Up to 22,000 more tasks completed on average over an 8-hour simulation—representing up to a 38.8% increase in throughput compared to the prior state-of-the-art.

2Mean throughput improvements vs. the baseline across three warehouse layouts (30% inventory density): +4.95% (restricted), +6.13% (open-top), +0.02% (open).

3Task-allocation was kept operationally practical: allocation steps were capped at 1 second; experiments used 40 robots on 27×50 maps with 4 task arrivals per timestep and 30 SKUs.

What This Means

Robotics engineers and system designers building multi-robot warehouse fleets can use these ideas to get higher sustained throughput, especially in dense storage layouts. Operations and planning leads evaluating scheduler choices will care because many-to-many allocation can make better use of identical inventory and reduce long-run inefficiency. Researchers working on multi-agent task allocation can adopt the decomposition and anytime improvement tricks for larger, online problems. Role-Based Agent Pattern

Key Figures

$Figure 1 : (a) Multi-location inventory management is common practice in warehouse settings, where multiple source and destination locations are available for storing and retrieving goods. In multi-agent pickup and delivery contexts, the choice of source and destination locations can significantly impact system performance (e.g., for agent a i a_{i} (center above) the sequence s 3 d 1 s 4 d 5 s_{3}d_{1}s_{4}d_{5} is far more efficient than s 1 d 3 s 4 d 4 s_{1}d_{3}s_{4}d_{4} ). (b) Current state-of-the-art methods focus on one-to-one allocation between agents and tasks, ignoring multi-location inventory options. (c) We introduce Many-to-Many Multi-Agent Pickup and Delivery, which accounts for agents, tasks, and multiple source and destination locations per item. Our computationally-efficient approach yields significant improvements in task throughput in long-horizon simulations across 3 warehouse layouts.$

Fig 1: Figure 1 : (a) Multi-location inventory management is common practice in warehouse settings, where multiple source and destination locations are available for storing and retrieving goods. In multi-agent pickup and delivery contexts, the choice of source and destination locations can significantly impact system performance (e.g., for agent a i a_{i} (center above) the sequence s 3 d 1 s 4 d 5 s_{3}d_{1}s_{4}d_{5} is far more efficient than s 1 d 3 s 4 d 4 s_{1}d_{3}s_{4}d_{4} ). (b) Current state-of-the-art methods focus on one-to-one allocation between agents and tasks, ignoring multi-location inventory options. (c) We introduce Many-to-Many Multi-Agent Pickup and Delivery, which accounts for agents, tasks, and multiple source and destination locations per item. Our computationally-efficient approach yields significant improvements in task throughput in long-horizon simulations across 3 warehouse layouts.

Fig 2: Figure 2 : Overview of M2M method and M2M-wSKU variant.

$Figure 3 : Partial view of our three simulated warehouse layouts. All non-obstacle regions are traversable by agents, subject to collision constraints. Each map is 27 × \times 50.$

Fig 3: Figure 3 : Partial view of our three simulated warehouse layouts. All non-obstacle regions are traversable by agents, subject to collision constraints. Each map is 27 × \times 50.

Figure 4 : Comparison of the rolling mean (solid) and std-dev (shaded region) of throughput (tasks/min.) metric reported across Restricted (a), Open-Top (b), and Open (c) maps. Our method, M2M (purple), and variant, M2M-wSKU (orange), maintain a higher mean throughput over the baseline LNS-PBS (green) across all map types. Between our two variants, M2M maintains the highest mean throughput over M2M-wSKU.

Fig 4: Figure 4 : Comparison of the rolling mean (solid) and std-dev (shaded region) of throughput (tasks/min.) metric reported across Restricted (a), Open-Top (b), and Open (c) maps. Our method, M2M (purple), and variant, M2M-wSKU (orange), maintain a higher mean throughput over the baseline LNS-PBS (green) across all map types. Between our two variants, M2M maintains the highest mean throughput over M2M-wSKU.

Ready to evaluate your AI agents?

Learn how ReputAgent helps teams build trustworthy AI through systematic evaluation.

Learn More

Keep in Mind

Results come from simulated warehouses that assume perfect localization, noiseless sensing, and no task failures—real-world noise and errors could reduce gains. Experiments used 40 robots and moderately sized maps; scalability to much larger fleets and environments needs further engineering work. The SKU-distribution variant (M2M-wSKU) added planning overhead without throughput benefits in these settings; different cost trade-offs or longer horizons might change that outcome. Inter-Agent Miscommunication

Methodology & More

Many real warehouses store multiple identical items at several locations, so a request can be satisfied by any matching item. Solving that 'many-to-many' assignment—deciding which robot, which item instance, and which drop-off spot to use—creates a four-way assignment problem that is computationally hard if handled naively. M2M tackles this by decomposing the huge four-dimensional cost structure into smaller matrices, building a quick initial allocation by repeatedly picking the best agent-task-start-destination tuple, and then improving the allocation with an anytime Large Neighborhood Search. The final assignments are fed to a priority-based multi-robot path planner to produce collision-free routes. A2A Protocol Pattern In 8-hour simulated warehouse runs across three map layouts and multiple inventory densities, M2M consistently matched or beat the prior state-of-the-art method (which converts many-to-many tasks to fixed single-pick tasks). Gains were largest in denser, more constrained layouts—up to 38.8% higher throughput (about 22,000 extra tasks over 8 hours). A variant that explicitly weighted item distribution to favor spreading inventory (M2M-wSKU) did not improve throughput in these experiments, suggesting that the main benefit comes from joint choice of pickup/delivery locations rather than active redistribution. Allocation steps were kept fast (1 second limit), making the approach practical for online deployment, though real-world validation under sensor noise and failures remains necessary. Planning

Avoid common pitfallsLearn what failures to watch for

Learn More

Credibility Assessment:

University-affiliated authors (University of New Hampshire) and at least one mid-career author (h-index ~11); solid but not top-tier venue or overwhelmingly senior author list.

many-to-many pickup warehouse robots multi-agent orchestration task allocation

Not sure where to start?