Socially-Aware Robot Explanations: Inferring Needs from Human Facial Expressions

Dimosthenis Kontogiorgos und Julie Shah

Introduction

Although AI systems can make decisions, their ability to explain them remains limited, particularly in error-prone situations. This work focuses on mechanistic interpretability in error detection and examines how different explanation types influence user behaviour. While prior research has explored the causes of robot errors, few have investigated how robots should explain error detection, especially when users express social cues indicating something has gone wrong. We adopt a user-centred approach to transparency, proposing that human facial expressions guide the robot’s explanation behaviour. In this work, we investigate whether users’ facial expressions can inform when and how robots should provide explanations during collaborative tasks. Using a deep-learning model trained on Facial Action Units from a public HRI dataset, a robot classified states of user confusion.

The model was deployed on a robot arm performing a pick-and-place task, where errors were introduced through random perturbations. Based on user behaviour, the robot detected potential errors and provided explanations to increase model transparency. We tested three XAI methods designed to answer how, why, and what-if questions, offering a different aspect of the robot’s decision-making, explaining a robot’s failure detection model. In a study, participants engaged in a robot-assisted pick-and-place task while receiving different types of explanations. User responses were analysed through multimodal signals, alongside subjective measures of cognitive load, trust, and model understanding. The results showed that why-explanations were the most preferred and whatif explanations required more vocal effort. This work demonstrates that facial expressions can be used to tailor explanation frequency demands, supporting transparent and adaptive human-robot interactions.

Method & Results

We used post-hoc, interactive, and model-agnostic explanations, presented in a consistent format across all methods. Three explanation types were tested using established XAI techniques: How-explanations used global feature importance via SHAP, applying Shapley Value Sampling to show how features generally influenced the model’s output. Why-explanations provided local feature contributions using Kernel-SHAP, which approximates the decision boundary around a given input (based on the LIME framework ) to highlight input-specific influence. What-if explanations used counterfactuals to show how minimal changes in two features could alter the model’s prediction, applying the method by Mothilal et al..

To stimulate the error detection model, we injected random perturbations into the robot’s input and output, inducing uncertainty. The model, trained on a public dataset of HRI sessions, used weighted classification and Softmax probabilities within a sliding window to detect user confusion signals. It traced back to the estimated onset of the error, which served as the input for the explanation algorithms. Participants performed a voice-controlled pickand-place task with the robot arm, and the explanations were displayed on an external monitor. The setup included facial expression and body pose tracking, as well as speech processing. Explanations were visualised using bar charts, commonly used for tabular data. The robot’s workspace contained PVC pipes, which participants used to assemble various structures.

Several facial expressions were associated with participants’ reactions to different explanation types, with why-explanations eliciting greater expressiveness. What-if explanations, however, were linked to increased vocal effort. For each explanation, we assessed participants’ ability to correctly identify relevant features. We found that how-explanations, based on global feature importance, led to better understanding, likely due to their independence from user-specific input. Overall, participants expressed a clear preference for why-explanations, while what-if explanations were least preferred. Thematic analyses showed that how-explanations were generally seen as clear and easy to follow but lacked personalisation. In contrast, why-explanations were valued for their personalised and user-specific nature, though some found them harder to interpret. Reactions to what-if explanations were mixed: while some participants grasped the counterfactual logic, others found it confusing, reflected in their lower ratings.

Summary & Conclusion

Overall, How-explanations were seen as intuitive and easy to understand, but were perceived as impersonal. Why-explanations were the most preferred and perceived as personalised, though how-explanations led to better objective understanding.

What-if explanations, while expected to be straightforward, required greater vocal effort and elicited lower engagement and expressiveness, despite no increase in reported cognitive load. In summary, our findings in this work offer empirical insights into which explanation types are most effective for error detection models based on facial expressions. We hope this work informs future research on robot error detection by emphasising the importance of multimodal cues and encouraging the integration of error causality with detection, placing multimodal HRI at the centre of XAI-driven interaction design.

Presentation Socially-Aware Robot Explanations: Inferring Needs from Human Facial Expressions held at the 3rd TRR 318 Conference: Contextualizing Explanations on 18th of June 2025 in Bielefeld, Germany

Nächstes Kapitel

26 Context(s) for contextualizing explanaitons

One could argue that context is basically a setting or a situation that needs to be properly described. Recent discussions on future XAI systems call to regard context to provide more relevant explanations. According to Sanneman and Shah, “it is not equally valuable to provide just any information to human users via XAI, but only information…

Schriftgröße

Klein

Mittel

Groß

Hintergrund

% Lesefortschritt

Inhaltsverzeichnis
Contextualizing Explanations
Fußnoten
1. Anjomshoae, S. et al.: Explainable agents and robots: Results from a systematic literature review, in 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019), Montreal, Canada, May 13-17, 2019. International Foundation for Autonomous Agents and Multiagent Systems, 2019, pp. 1078-1088.;
  Hellström. T: The relevance of causation in robotics: A review, categorization, and analysis, Paladyn, Journal of Behavioral Robotics, vol. 12, no. 1, pp. 238-255, 2021. https://doi.org/10.1515/pjbr-2021-0017 ;
  Kontogiorgos, D. et al. :Embodiment effects in interactions with failing robots, in Proceedings of the 2020 CHI conference on human factors in computing systems, 2020, pp. 1-14. https://doi.org/10.1145/3313831.3376372 ;
  Miller, T.: Explanation in artificial intelligence: Insights from the social sciences, Artificial intelligence, vol. 267, pp. 1-38, 2019. https://doi.org/10.1016/j.artint.2018.07.007 ;
  Rosenfeld A., Richardson, A.: Explainability in human-agent systems, Autonomous agents and multi-agent systems, vol. 33, pp. 673-705, 2019. https://doi.org/10.1007/s10458-019-09408-y ;
  Sado, et al.: Explainable goaldriven agents and robots-a comprehensive review, ACM Computing Surveys, vol. 55, no. 10, pp. 1-41, 2023. https://doi.org/10.1145/3564240 ;
  Sakai T., Nagai, T.: Explainable autonomous robots: a survey and perspective, Advanced Robotics, vol. 36, no. 5-6, pp. 219-238, 2022. https://doi.org/10.1080/01691864.2022.2029720
  Setchi, R., Dehkordi, M. B., Khan, J. S.: Explainable robotics in human-robot interactions, Procedia Computer Science, vol. 176, pp. 3057-3066, 2020. https://doi.org/10.1016/j.procs.2020.09.198
2. Kwon, M., Huang, S. H., Dragan, A. D.: Expressing robot incapability, in Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, 2018, pp. 87-95. https://doi.org/10.1145/3171221.3171276 ;
  Lewis, D. Causal explanation, Philosophical Papers/Oxford University Press, 1986. https://doi.org/10.1093/0195036468.003.0007
3. Ekman P., Friesen, W. V.: Facial action coding system, Environmental Psychology & Nonverbal Behavior, 1978. https://doi.org/10.1037/t27734-000
4. Stiber, M., Taylor, R. H., Huang, C.-M.: On using social signals to enable flexible error-aware hri, in Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023, pp. 222-230. https://doi.org/10.1145/3568162.3576990
5. Kontogiorgos, D., Pereira, A., Gustafson, J.: Estimating uncertainty in taskoriented dialogue, in 2019 International Conference on Multimodal Interaction, 2019, pp. 414-418. https://doi.org/10.1145/3340555.3353722 ;
  Kontogiorgos, D.: Behavioural responses to robot conversational failures, in Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020, pp. 53-62. https://doi.org/10.1145/3319502.3374782 ;
  Wachowiak, L. et al: Predicting when and what to explain from multimodal eye tracking and task signals, IEEE Transactions on Affective Computing, 2024. https://doi.org/10.1109/TAFFC.2024.3419696
6. Wachowiak, L. et al.: A taxonomy of explanation types and need indicators in human-agent collaborations, International Journal of Social Robotics, pp. 1-12, 2024. https://doi.org/10.1007/s12369-024-01148-8
7. Castro, J., Gómez, D., Tejada, J. Polynomial calculation of the shapley value based on sampling, Computers & operations research, vol. 36, no. 5, pp. 1726-1730, 2009. https://doi.org/10.1016/j.cor.2008.04.004;
  Kokhlikyan, N. et al.: Captum: A unified and generic model interpretability library for pytorch, arXiv preprint arXiv:2009.07896, 2020.
8. Ribeiro, M. T., Singh, S., Guestrin, C.: "Why should i trust you?" explaining the predictions of any classifier, in Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016, pp. 1135-1144. https://doi.org/10.1145/2939672.2939778
9. Lundberg, S.: A unified approach to interpreting model predictions, arXiv preprint arXiv:1705.07874, 2017.
10. Mothilal, R. K., Sharma, A., Tan, C.: Explaining machine learning classifiers through diverse counterfactual explanations, in Proceedings of the 2020 conference on fairness, accountability, and transparency, 2020, pp. 607-617. https://doi.org/10.1145/3351095.3372850
11. Stiber, M., Taylor, R. H., Huang, C.-M.: On using social signals to enable flexible error-aware hri, in Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023, pp. 222-230. https://doi.org/10.1145/3568162.3576990
12. Stiber, M., Taylor, R. H., Huang, C.-M.: Modeling human response to robot errors for timely error detection, in 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 676-683. https://doi.org/10.1109/IROS47612.2022.9981726
13. Kontogiorgos D., Shah, J.: Questioning the robot: Using human non-verbal cues to estimate the need for explanations, in Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, 2025, pp. 717-728. https://doi.org/10.1109/HRI61500.2025.10974079
Literaturverzeichnis
1. Anjomshoae, S., Najjar, A., Calvaresi, D. and Främling, K.: Explainable agents and robots: Results from a systematic literature review, in 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019), Montreal, Canada, May 13-17, 2019. International Foundation for Autonomous Agents and Multiagent Systems, 2019, pp. 1078-1088.
2. Castro, J., Gómez, D. and Tejada, J.: Polynomial calculation of the shapley value based on sampling, Computers & operations research, vol. 36, no. 5, pp. 1726-1730, 2009. https://doi.org/10.1016/j.cor.2008.04.004
3. Ekman, P. and Friesen, W. V.: Facial action coding system, Environmental Psychology & Nonverbal Behavior, 1978. https://doi.org/10.1037/t27734-000
4. Hellström, T.: The relevance of causation in robotics: A review, categorization, and analysis, Paladyn, Journal of Behavioral Robotics, vol. 12, no. 1, pp. 238-255, 2021. https://doi.org/10.1515/pjbr-2021-0017
5. Kokhlikyan, N., Miglani, V., Martin, M., Wang, E., Alsallakh, B., Reynolds, J., Melnikov, A., Kliushkina, N., Araya, C., Yan, S., et al.: Captum: A unified and generic model interpretability library for pytorch, arXiv preprint arXiv:2009.07896, 2020.
6. Kontogiorgos, D., Pereira, A. , and Gustafson, J.: Estimating uncertainty in taskoriented dialogue, in 2019 International Conference on Multimodal Interaction, 2019, pp. 414-418. https://doi.org/10.1145/3340555.3353722
7. Kontogiorgos, D., Pereira, A., Sahindal, B., van Waveren, S. and Gustafson, J.: Behavioural responses to robot conversational failures, in Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020, pp. 53-62. https://doi.org/10.1145/3319502.3374782
8. Kontogiorgos, D. and Shah, J.: Questioning the robot: Using human non-verbal cues to estimate the need for explanations, in Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction, 2025, pp. 717-728. https://doi.org/10.1109/HRI61500.2025.10974079
9. Kontogiorgos, D., van Waveren, S., Wallberg, O., Pereira, A., Leite, I. and Gustafson, J.: Embodiment effects in interactions with failing robots, in Proceedings of the 2020 CHI conference on human factors in computing systems, 2020, pp. 1-14. https://doi.org/10.1145/3313831.3376372
10. Kwon, M., Huang, S. H. and Dragan, A. D.: Expressing robot incapability, in Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, 2018, pp. 87-95. https://doi.org/10.1145/3171221.3171276
11. Lewis, D.: Causal explanation, Philosophical Papers/Oxford University Press, 1986. https://doi.org/10.1093/0195036468.003.0007
12. Lundberg, S: A unified approach to interpreting model predictions, arXiv preprint arXiv:1705.07874, 2017.
13. Miller, T.: Explanation in artificial intelligence: Insights from the social sciences, Artificial intelligence, vol. 267, pp. 1-38, 2019. https://doi.org/10.1016/j.artint.2018.07.007
14. Mothilal,R. K., Sharma, A. and Tan,C.: Explaining machine learning classifiers through diverse counterfactual explanations, in Proceedings of the 2020 conference on fairness, accountability, and transparency, 2020, pp. 607-617. https://doi.org/10.1145/3351095.3372850
15. Ribeiro, M. T., Singh, S. and Guestrin, C.: "Why Should I Trust You?" Explaining the Predictions of Any Classifier in Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016, pp. 1135-1144. https://doi.org/10.1145/2939672.2939778
16. Rosenfeld, A. and Richardson, A.: Explainability in human-agent systems, Autonomous agents and multi-agent systems, vol. 33, pp. 673-705, 2019. https://doi.org/10.1007/s10458-019-09408-y
17. Sado, F., Loo, C. K., Liew, W. S., Kerzel, M. and Wermter, S.: Explainable goaldriven agents and robots – a comprehensive review, ACM Computing Surveys, vol. 55, no. 10, pp. 1-41, 2023. https://doi.org/10.1145/3564240
18. Sakai, T. and Nagai, T.: Explainable autonomous robots: a survey and perspective, Advanced Robotics, vol. 36, no. 5-6, pp. 219-238, 2022. https://doi.org/10.1080/01691864.2022.2029720
19. Setchi, R., Dehkordi, M. B. and Khan, J. S.: Explainable robotics in human-robot interactions, Procedia Computer Science, vol. 176, pp. 3057-3066, 2020. https://doi.org/10.1016/j.procs.2020.09.198
20. Stiber, M., Taylor, R. and Huang, C.-M.: Modeling human response to robot errors for timely error detection, in 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 676-683. https://doi.org/10.1109/IROS47612.2022.9981726
21. Stiber, M., Taylor, R. and Huang, C.-M.: On using social signals to enable flexible error-aware hri, in Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023, pp. 222-230. https://doi.org/10.1145/3568162.3576990
22. Wachowiak, L., Coles, A., Canal, G. and Celiktutan, O.: A taxonomy of explanation types and need indicators in human-agent collaborations, International Journal of Social Robotics, pp. 1-12, 2024. https://doi.org/10.1007/s12369-024-01148-8
23. Wachowiak, L., Tisnikar, P., Canal, G., Coles, A., Leonetti, M. and Celiktutan, O.: Predicting when and what to explain from multimodal eye tracking and task signals, IEEE Transactions on Affective Computing, 2024. https://doi.org/10.1109/TAFFC.2024.3419696

Bibliografische Daten

Erscheinungsdatum	5. März 2026
DOI	10.64136/qdae5886
Creative Commons Lizenz

Acknowledgments

This research was supported by the Wallenberg Foundation.

Socially-Aware Robot Explanations: Inferring Needs from Human Facial Expressions

Introduction

Method & Results

Summary & Conclusion

Nächstes Kapitel

26 Context(s) for contextualizing explanaitons