Context-dependent Effects of Explanations on Multiple Layers of Trust

Eda Ismail-Tsaous, Matthias Uhl, Celine Spannagl, Sebastian Krügel und Ute Schmid

The rise of highly performant deep learning approaches results in a growing number of possible applications in many domains. In most fields, such as medicine, human agency and oversight are crucial, that is, complex decisionmaking processes should be performed by human-AI teams. Since machine learned models are typically operating as opaque systems, they need to be complemented with methods that allow humans to evaluate the reliance of such models. This requirement has inspired research on explainable AI (XAI) with a growing number of different methods. Increasing the transparency of systems has been considered as important means for users to calibrate their trust into AI-systems, i.e., avoiding undertrust to ensure that the system is utilized effectively while also preventing overtrust to avoid blind reliance and to recognize potential errors.

However, recent research suggests that transparency means do not guarantee trust calibration or better performance. The use of explanations can indeed improve users’ confidence in the results of a system, reveal hidden biases and help to improve the model, but it can also increase the users’ cognitive workload or even undermine their trust. Transparency often falls short of interpretability: exposing a system’s inner workings does not ensure that this information is meaningful and comprehensible to humans. Moreover, explanations can differ in their fidelity, i.e., can be more or less consistent with the explained outcome of the system. Especially the widely used post hoc explanation methods lack a ground truth for the ’real’ explanation and can be considered black boxes themselves, which also demand trust. However, most non-expert users might not become aware of this unless they are presented with multiple inconsistent explanations.

In addition, approaches to evaluate these explanations are also subject to uncertainties. Humans judging the quality of explanations are inherently subjective, often biased and tend to favor simplified explanations. Computational fidelity metrics have also been shown to exhibit inconsistencies and lack precision when applied to non-linear models. Thus, the evaluation of explanations yields results that also must be trusted.

We argue that every intervention or technique that aims to improve transparency or human oversight is possibly associated with uncertainty and, therefore, adds another layer of required trust to the trust in the system’s outcomes: trust in the explanations and trust in the fidelity metrics used to evaluate explanations. Depending on the features of a system, other layers can be relevant such as the trust in the appropriate corrigibility, i.e., the question whether corrections are integrated in the intended way or whether the system can be corrupted by false or manipulated corrections, or the trust in the correct adaptation of a learning system to the user’s preferences, and so forth. Every layer can shift the focus to other aspects than the system’s outcome, and inconsistencies between these layers can lead to a weighing of credibility and influence the users’ perception of the system. In our view, whether these layers are apparent to the human and become relevant for the perceived trustworthiness of a system, highly depends on contextual factors such as the expertise of the trustor (AI experts, domain experts and non-experts), the presentation of the information and the type, severity and importance of the joint task.

From an ethical perspective, the sustainability of trust in the system is important. Cognitive dissonance may occur if users think that the explanations nudge them into taking decisions rather than owning reflected decisions in human-AI teams. Such impressions, however, will not be reflected in behavioral data. If a decision affects other people, cognitive dissonance may even turn into ethical dissonance. This is the case when users perceive a gap between their internalized moral standard of making self-determined decisions and the experienced reality of having merely nodded off the system’s explanation. The ethical distinction between manipulative and persuasive AI is instructive. The literature on manipulation that often focuses on the intentions of the manipulator is not easily transferrable to the realm of human-AI interaction. The explanations of manipulative AI may well induce as much trust as those of persuasive AI in the short run, but the long-term effects of both might be different. Therefore, it seems important to complement behavioral measures that investigate reliance in a system’s output and explanations by eliciting users’ self-reflections on their decision-making autonomy.

The relationship between explanations, trust, reliance and human-AI team performance remains complex and requires further research. Open questions persist around evaluating XAI methods with regard to their fidelity and their impact on trust and performance. This underscores the need for controlled empirical studies with different user groups considering their individual information needs and the layers of trust that are associated with them.

Explanations and other transparency means should be presented in a way that makes them as beneficial as possible for the user. Since they can introduce uncertainties, increase the mental load, or induce cognitive dissonance, they should be used and implemented carefully so that the benefits outweigh these costs. What is beneficial for the user is, to a substantial degree, subjective and may depend on the context and the users’ perceptions of their role in the humanAI team under specific XAI methods.

Presentation Context-dependent Effects of Explanations on Multiple Layers of Trust held at the 3rd TRR 318 Conference: Contextualizing Explanations on 17th of June 2025 in Bielefeld, Germany

Nächstes Kapitel

17 Explaining to and being explained by a service robot: Four HRI studies revisited under a framework for explainability

Robotic butlers relieving humans of domestic chores are currently highly researched and debated, envisioning a near future where these systems will smoothly navigate our private spaces and interact with us naturally and transparently. This vision will hardly be realized as long as potential users are not able to interpret and trust such autonomous systems. This requires negotiating a common interpretation of the environment,

Schriftgröße

Klein

Mittel

Groß

Hintergrund

% Lesefortschritt

Inhaltsverzeichnis
Contextualizing Explanations
Fußnoten
1. Bruckert, S., Finzel, B., Schmid, U.: The next generation of medical decision support: A roadmap toward transparent expert companions. Frontiers in Artificial Intelligence 3 (2020) https://doi.org/10.3389/frai.2020.507973
2. Longo, L., et al.: Explainable artificial intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions. Information Fusion 106, 102301 (2024). https://doi.org/10.1016/j.inffus.2024.102301
3. Wischnewski, M., Krämer, N., Müller, E.: Measuring and understanding trust calibrations for automated systems: A survey of the state-of-the-art and future directions. In: Proc. of the 2023 CHI Conference on Human Factors in Computing Systems. pp. 1-16 (2023) https://doi.org/10.1145/3544548.3581197
4. Papenmeier, A. et al.: It's complicated: The relationship between user trust, model accuracy and explanations in AI. ACM Trans. Comput.-Hum. Interact. 29(4), 1-33 (2022) https://doi.org/10.1145/3495013
5. Longo, L. et al.: Explainable artificial intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions. Information Fusion 106, 102301 (2024) https://doi.org/10.1016/j.inffus.2024.102301
6. Papenmeier, A. et al.: It's complicated: The relationship between user trust, model accuracy and explanations in AI. ACM Trans. Comput.-Hum. Interact. 29(4), 1-33 (2022) https://doi.org/10.1145/3495013
7. Bruckert, S., Finzel, B., Schmid, U.: The next generation of medical decision support: A roadmap toward transparent expert companions. Frontiers in Artificial Intelligence 3 (2020) https://doi.org/10.3389/frai.2020.507973
8. Teso, S., Kersting, K.: Explanatory interactive machine learning. In: Proc. of the 2019 AAAI/ACM Conference on AI, Ethics, and Society. p. 239-245 (2019) https://doi.org/10.1145/3306618.3314293
9. Huang, X., Marques-Silva, J.: On the failings of shapley values for explainability. International Journal of Approximate Reasoning 171, 109112 (2024) https://doi.org/10.1016/j.ijar.2023.109112 ;
  Wischnewski, M., Krämer, N., Müller, E.: Measuring and understanding trust calibrations for automated systems: A survey of the state-of-the-art and future directions. In: Proc. of the 2023 CHI Conference on Human Factors in Computing Systems. pp. 1-16 (2023) https://doi.org/10.1145/3544548.3581197
10. Mohseni, S., Zarei, N., Ragan, E.D.: A multidisciplinary survey and framework for design and evaluation of explainable AI systems. ACM Trans. Interact. Intell. Syst. 11(3-4), 1-45 (2021) https://doi.org/10.1145/3387166
11. Miró-Nicolau, M., Jaume-i Capó, A., Moyà-Alcover, G.: A comprehensive study on fidelity metrics for XAI. Information Processing & Management 62(1), 103900 (2025) https://doi.org/10.1016/j.ipm.2024.103900
12. Schmid, U., Wrede, B.: What is missing in XAI so far?: An interdisciplinary perspective. KI - Künstliche Intelligenz 36(3-4), 303-315 (2022) https://doi.org/10.1007/s13218-022-00786-2
13. Barkan, R., Ayal, S., Ariely, D.: Ethical dissonance, justifications, and moral behavior. Current Opinion in Psychology 6(12), 157-61 (2015)
  https://doi.org/10.1016/j.copsyc.2015.08.001
14. Ienca, M.: On artificial intelligence and manipulation. Topoi 42(3), 833-842 (2023) https://doi.org/10.1007/s11245-023-09940-3 ;
  Dragoni, M., Donadello, I., Eccher, C.: Explainable AI meets persuasiveness: Translating reasoning results into behavioral change advice. Artificial Intelligence in Medicine 105, 101840 (2020) https://doi.org/10.1016/j.artmed.2020.101840
15. Kasten, V.: Manipulation and teaching. Journal of the Philosophy of Education 14(1), 53-62 (1980) https://doi.org/10.1111/j.1467-9752.1980.tb00539.x
16. Ammeling, J. et al.: An interdisciplinary perspective on AI-supported decision making in medicine. Technology in Society 81, 102791 (2025) https://doi.org/10.1016/j.techsoc.2024.102791
17. Longo, L. et al.: Explainable artificial intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions. Information Fusion 106, 102301 (2024) https://doi.org/10.1016/j.inffus.2024.102301 ;
  Papenmeier, A. et al.: It's complicated: The relationship between user trust, model accuracy and explanations in AI. ACM Trans. Comput.-Hum. Interact. 29(4), 1-33 (2022) https://doi.org/10.1145/3495013
18. Mohseni, S., Zarei, N., Ragan, E.D.: A multidisciplinary survey and framework for design and evaluation of explainable AI systems. ACM Trans. Interact. Intell. Syst. 11(3-4), 1-45 (2021) https://doi.org/10.1145/3387166 ;
  Schmid, U., Wrede, B.: What is missing in XAI so far?: An interdisciplinary perspective. KI - Künstliche Intelligenz 36(3-4), 303-315 (2022) https://doi.org/10.1007/s13218-022-00786-2
Literaturverzeichnis
1. Ammeling, J., Aubreville, M., Fritz, A., Kießig, A., Krügel, S., Uhl, M.: An interdisciplinary perspective on AI-supported decision making in medicine. Technology in Society 81, 102791 (2025). https://doi.org/10.1016/j.techsoc.2024.102791
2. Barkan, R., Ayal, S., Ariely, D.: Ethical dissonance, justifications, and moral behavior. Current Opinion in Psychology 6(12), 157-61 (2015). https://doi.org/10.1016/j.copsyc.2015.08.001
3. Bruckert, S., Finzel, B., Schmid, U.: The next generation of medical decision support: A roadmap toward transparent expert companions. Frontiers in Artificial Intelligence 3 (2020). https://doi.org/10.3389/frai.2020.507973
4. Dragoni, M., Donadello, I., Eccher, C.: Explainable AI meets persuasiveness: Translating reasoning results into behavioral change advice. Artificial Intelligence in Medicine 105, 101840 (2020). https://doi.org/10.1016/j.artmed.2020.101840
5. Huang, X., Marques-Silva, J.: On the failings of shapley values for explainability. International Journal of Approximate Reasoning 171, 109112 (2024). https://doi.org/10.1016/j.ijar.2023.109112
6. Ienca, M.: On artificial intelligence and manipulation. Topoi 42(3), 833-842 (2023). https://doi.org/10.1007/s11245-023-09940-3
7. Kasten, V.: Manipulation and teaching. Journal of the Philosophy of Education 14(1), 53-62 (1980). https://doi.org/10.1111/j.1467-9752.1980.tb00539.x
8. Klenk, M.: (Online) manipulation: sometimes hidden, always careless. Review of Social Economy 80(1), 85-105 (2022). https://doi.org/10.1080/00346764.2021.1894350
9. Longo, L., Brcic, M., Cabitza, F., Choi, J., Confalonieri, R., Ser, J.D., Guidotti, R., Hayashi, Y., Herrera, F., Holzinger, A., Jiang, R., Khosravi, H., Lecue, F., Malgieri, G., Páez, A., Samek, W., Schneider, J., Speith, T., Stumpf, S.: Explainable artificial intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions. Information Fusion 106, 102301 (2024). https://doi.org/10.1016/j.inffus.2024.102301
10. Miró-Nicolau, M., Jaume-i Capó, A., Moyà-Alcover, G.: A comprehensive study on fidelity metrics for XAI. Information Processing & Management 62(1), 103900 (2025). https://doi.org/10.1016/j.ipm.2024.103900
11. Mohseni, S., Zarei, N., Ragan, E.D.: A multidisciplinary survey and framework for design and evaluation of explainable AI systems. ACM Trans. Interact. Intell. Syst. 11(3-4), 1-45 (2021). https://doi.org/10.1145/3387166
12. Papenmeier, A., Kern, D., Englebienne, G., Seifert, C.: It's complicated: The relationship between user trust, model accuracy and explanations in AI. ACM Trans. Comput.-Hum. Interact. 29(4), 1-33 (2022). https://doi.org/10.1145/3495013
13. Schmid, U., Wrede, B.: What is missing in XAI so far?: An interdisciplinary perspective. KI - Künstliche Intelligenz 36(3-4), 303-315 (2022). https://doi.org/10.1007/s13218-022-00786-2
14. Teso, S., Kersting, K.: Explanatory interactive machine learning. In: Proc. of the 2019 AAAI/ACM Conference on AI, Ethics, and Society. p. 239-245 (2019). https://doi.org/10.1145/3306618.3314293
15. Wischnewski, M., Krämer, N., Müller, E.: Measuring and understanding trust calibrations for automated systems: A survey of the state-of-the-art and future directions. In: Proc. of the 2023 CHI Conference on Human Factors in Computing Systems. pp. 1-16 (2023). https://doi.org/10.1145/3544548.3581197

Bibliografische Daten

Erscheinungsdatum	5. März 2026
DOI	10.64136/dryx3176
Creative Commons Lizenz

Acknowledgments

This work was funded by Bundesministerium für Bildung und Forschung (BMBF) grants 01IS24067A and 01IS24067B.
Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

Context-dependent Effects of Explanations on Multiple Layers of Trust

Nächstes Kapitel

17 Explaining to and being explained by a service robot: Four HRI studies revisited under a framework for explainability