Multi-Agent Autonomous Decision Making, especially Multi-Agent Reinforcement Learning (MARL), is an emerging area of Artificial Intelligence (AI) where autonomous agents interact with each other in real-world problems like Augmented Reality (AR), Recommender Systems, Supply Chains, Climate Conservation, and Self-Driving Cars. Challenges include efficiently scaling multiple agents in coordination and understanding agentic behavior. The PhD Thesis provides formal guarantees for reliable cooperation of socially intelligent AI Agents, defining agentic behavior with game-theoretic notions, and providing stronger bounds on the sample complexity of an AI Agent to select strategies in zero-shot coordination than Imitation Learning. Collaborations of AI Agents in Alignment with humans have real-world applications in AR, where multimodal AI Agents proactively assist humans with timely and helpful interventions. A novel framework gives autonomous vehicle agents a semantic understanding of the world to efficiently control fine-grained actions. A new intrinsic reward improves exploration and maximizes profit by preventing unnecessary interactions among AI Agents for inventory planning in Supply Chain Orchestration. These real-world applications motivate understanding how AI Agents make decisions, which is addressed with Explainable AI (XAI) Agents for recommending XAI methods to serve user goals. As MARL is slow with the increasing scale of agents, JAX-enabled hardware acceleration is up to 12,500x faster over existing libraries in eight popular MARL environments. The effectiveness of AI Agents can be improved by a combination of the proposed MARL, Imitation Learning, and Game Theoretic algorithms in solving problems in real world and simulated environments.
Saptarashmi Bandyopadhyay is a 5th year CS PhD Candidate advised by John Dickerson and co-advised by Tom Goldstein. He pursues research on Multi-Agent Autonomous Decision Making in AI to develop autonomous AI agents and create algorithms in Multi-Agent Reinforcement Learning, Imitation Learning and other paradigms with applications in Augmented Reality, Supply Chain Orchestration, Climate Conservation, Self-Driving Vehicles, Alignment, and JAX-based Agents. He has been a PhD Student Researcher at Google AI AR/VR collaborating with Google DeepMind, an AI Resident at Google X, a DoGood Fellow and a Dean's Summer Fellow at UMD, and a Research Intern at CNRS LORIA, INRIA Nancy; German Center of AI (DFKI) Saarbrucken; and the University of Saarland. His research is supported by grants from the DoD/IC, NSF, and NIH.