Fuzzy Centered Explainable Network for Reinforcement Learning

Publisher:
Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:
Journal Article
Citation:
IEEE Transactions on Fuzzy Systems, 2024, 32, (1), pp. 203-213
Issue Date:
2024-01-01
Filename Description Size
Fuzzy Centered Explainable Network for Reinforcement Learning.pdfAccepted version1.3 MB
Adobe PDF
Full metadata record
The explainability of reinforcement learning (RL) models has received vast amount of interest as its applications have widened. Most existing explainable RL models focus on improving the explainability of an agent's observations instead of the relationships between agent states and actions. This study presents a fuzzy centered explainable network (FCEN) for RL tasks to interpret the relationships between agent states and actions. The proposed FCEN leverages the interpretability of fuzzy neural networks to establish if-then rules and a generative model to visualize learned knowledge. Precisely, the FCEN includes if-then rules that formulate state-action mappings with human-understandable logic, such as the form 'IF Input is A THEN Output is B.' In addition, these rules connect with a generative model that concretizes the states into human-understandable patterns (figures). Our experimental results obtained on 4 Atari games show that the proposed FCEN can achieve a high level of performance in RL tasks and enormously boost the explainability of RL agents both globally and locally. In other words, the FCEN maintains a high-level explanation for the agent decision logic and the possibility of low-level analysis for each given observation sample. The explainability boost does not undermine reward learning performance, humans can even enhance the agent's performance with the provided explainability.
Please use this identifier to cite or link to this item: