27th EAAAI (EANN) 2026, 16 - 19 July 2026, Chania, Crete, Greece

Explaining Agents’ Interactions through their Causal Behavior and Counterfactuals

Islam Mir Riyanul, Barua Shaibal, Ahmed Mobyen Uddin, Begum Shahina

Abstract:

  Reinforcement learning (RL) agents often operate as black boxes, making it difficult to understand their decision-making in dynamic environments. This study proposes a novel framework for explainable RL based on structural causal models (SCMs). Here, the approach learns an SCM of the environment dynamics and reward process in a mobile network simulator (mobile-env ), and uses this causal model to generate counterfactual explanations and perform interventions to understand agent behavior. The approach demonstrates that the learned SCM can closely approximate the environment’s transition dynamics while remaining interpretable. By leveraging do-calculus and counterfactual reasoning, our framework explains the long-term effects of actions through causal chains and highlights key influential factors. Experiments on a wireless network control task show that our method provides meaningful explanations for agent decisions (e.g., why a given action yields a higher reward), with minimal loss in policy performance. The study also presents comparative evaluations against baseline explanation approaches and discusses how our SCM-based explanations improve transparency and trust in RL policies  

*** Title, author list and abstract as submitted during Camera-Ready version delivery. Small changes that may have occurred during processing by Springer may not appear in this window.