Planning and coordination of multiple agents in the presence of uncertainty and noisy sensors is extremely hard. A human operator who observes a multi-agent team can provide valuable guidance to the team based on her superior ability to interpret observations and assess the overall situation. We propose an extension of decentralized POMDPs that allows such human guidance to be factored into the planning and execution processes. Human guidance in our framework consists of intuitive high-level commands that the agents must translate into a suitable joint plan that is sensitive to what they know from local observations. The result is a framework that allows multi-agent systems to benefit from the complex strategic thinking of a human supervising them. We evaluate this approach on several common benchmark problems and show that it can lead to dramatic improvement in performance.
Funding
National Key R&D Program of China (Grant No. 017YFB1002204)
National Natural Science Foundation of China (Grant No. U1613216, Grant No. 61603368)
Guangdong Province Science and Technology Plan (Grant No. 2017B010110011)
History
Published in
PRIMA 2020: Principles and Practice of Multi-Agent Systems. 23rd International Conference Nagoya, Japan, November 18–20, 2020
Proceedings
Pages
182 - 198
Source
PRIMA 2020: Principles and Practice of Multi-Agent Systems