Emerging Categories in Scientific Explanations

Giacomo Magnifico und Eduard Barbu

Clear and effective explanations are essential for human understanding and knowledge dissemination. The scope of scientific research aiming to understand the essence of explanations has recently expanded from the social sciences to include the fields of machine learning and artificial intelligence. Important contributions from social sciences include with works that examine critical aspects such as causality (cause-and-effect relationships), contrast (distinctions between differing scenarios), relevance (applicability of explanations), and truth (accuracy and verifiability of explanations). However, machine learning and natural language processing focus more on operational definitions and on the importance of constructing datasets, as seen in studies by. Since explanations for machine learning decisions must be both impactful and human-like, a major challenge lies in developing explanations that emphasize proximal aspects — details that are immediately relevant, direct and related to the user — over broad algorithmic processes. The current lack of large-scale datasets with a focus on both human-like and human-generated explanations highlights the issue addressed by this work.

The specific research questions of this work are thus the following: in what form(s) do explanations take form within the context of scientific literature? Can we provide an annotated dataset with as clear-cut definitions as possible and reach an acceptable consensus between different annotators?

The scope of this study has been limited to scientific literature due to the intrinsic nature of explanations – to avoid complications that would derive from the additional analysis of truth and relevance. Scientific explanations possess an identifiable general structure that involves a relationship between two components: the explanans (which provides the explanation) and the explanandum (what is being explained). The explanandum is contingent on the explanans, as changes in the latter directly impact the former. A useful example can be the equation y = 2 ∗ x ; the value of y (explanandum) depends on the value of x (explanans), as it increases with the increase of x, but the inverse also holds true. This relationship shows how explanations build upon the dependence of the explanandum on the explanans, with enough nuance for multiple types of explanations. An additional constraint was to only include explanations that presented an explicit explanandum, e.g. “the sky is blue because of light refraction through the atmosphere” rather than “this happens due to light refraction [...]”, in order to avoid explanations trailing through multiple sentences.

With our research questions in mind, we started by extracting sentences that indicate explanations from scientific literature among various sources in the biotechnology and biophysics topic domains, the majority of which selected from PubMed’s PMC Open Access subset. The selected 340 sentences were analyzed and different "explanation types", as possible categories for interpretation, emerged from the data. It’s crucial to reiterate that this categorization process was entirely driven by the dataset, as these classes emerged from the text in an inductive fashion and were not a superimposition of pre-existing known categories upon the dataset. This method avoided pre-set criteria to explore instead the intrinsic connections between categories in the dataset, aiming to understand the commonalities and differences within the explanations. The categories of explanations that emerged are the following: causation, which establish a cause-and-effect relationship, stating that one event or condition leads to another without detailing intermediate steps; mechanistic causation, which detail the underlying mechanisms by which a cause leads to an effect, outlining the intermediate steps that explain how and why the cause produces the outcome; contrastive, which focus on comparing scenarios to explain why a particular outcome occurred in one case but not in another, emphasizing divergent outcomes; correlation, which detail relationships between variables where changes in one are associated with changes in another but without establishing causality; functional, which focus on the function of a trait in relation to its form and effectiveness, particularly in biology; pragmatic approach, which focus on the selection of choices/actions based on convenience/effectiveness, requiring a conscious choice and emphasizing practicality or applicability. To minimize author bias in sentence categorization, we conducted a classification study on the Prolific platform with 120 annotators divided between 10 questionnaires, guaranteeing a base of twelve annotators per sentence. The annotators were tasked to choose between the six mentioned categories, with the addition of a "not an explanation" category. Each annotator completed the questionnaire in one sitting, with a median completion time of 35 minutes, and was compensated at an average rate of £8/hour. After sanity checks and the removal of statistical outliers, 10 evaluations per sentence were kept along with the highest-quality 272 explanatory sentences. Upon calculating the averaged Krippendorf’s alpha value to gauge the robustness of inter-annotator agreement, significant disagreement between categories of similar causal strength was observed (causation/mechanistic causation, correlation/functional/pragmatic approach). After categorizing the sentences by causal strength and the number of relations, with the new categories of strong relation (causation and mechanistic causation), weak relation (correlation, functional, pragmatic approach) and multi-path relation (contrastive), the average agreement between annotators improved to a value of 0.667. Albeit only slightly over a desiderata target, the final alpha value is still a representation of good agreement between annotators and, thus, of a high-quality human-annotated explanation dataset. The dataset is made available to the community through a dedicated repository at.

Presentation Emerging Categories in Scientific Explanations held at the 3rd TRR 318 Conference: Contextualizing Explanations on 17th of June 2025 in Bielefeld, Germany

Nächstes Kapitel

9 Stability of Model Explanations in Interpretable Prototype-based Classification Learning

IntroductionLearning vector quantization (LVQ) as originally proposed by Kohonen and mathematically justified as the Generalized LVQ (GLVQ) constitutes a classifier for multiple…

Schriftgröße

Klein

Mittel

Groß

Hintergrund

% Lesefortschritt

Inhaltsverzeichnis
Contextualizing Explanations
Fußnoten
1. Miller, T.: Explanation in artificial intelligence: Insights from the social sciences. Artificial intelligence 267, 1-38 (2019) https://doi.org/10.1016/j.artint.2018.07.007;
  Mill, J.S.: A System of Logic. In: Arguing About Science, pp. 243-267. Routledge (2012);
  Thagard, P.: The Cognitive Science of Science: Explanation, Discovery, and Conceptual Change. MIT Press (2012)
  https://doi.org/10.7551/mitpress/9218.001.0001;
  Lombrozo, T.: The Structure and Function of Explanations. Trends in Cognitive Sciences 10(10), 464-470 (2006) https://doi.org/10.1016/j.tics.2006.08.004;
  Halpern, J.Y., Pearl, J.: Causes and Explanations: A structural-model approach. part ii: Explanations. The British Journal for the Philosophy of Science (2005)
  https://doi.org/10.1093/bjps/axi148;
  Lewis, D.: Causal Explanation (1986)
  https://doi.org/10.1093/0195036468.003.0007
2. Tan, C.: On the diversity and limits of human explanations. In: Proc. of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp. 2173-2188. Association for Computational Linguistics, Seattle, United States (Jul 2022). https://aclanthology.org/2022.naacl-main.158, https://doi.org/10.18653/v1/2022.naacl-main.158;
  Wiegreffe, S., Marasović, A.: Teach me to Explain: A review of Datasets for Explainable Natural Language Processing (2021);
  Hartmann, M., Sonntag, D.: A Survey on Improving NLP Models with Human Explanations. In: Proc. of the First Workshop on Learning with Natural Language Supervision. pp. 40-47. Association for Computational Linguistics, Dublin, Ireland (May 2022). https://aclanthology.org/2022.lnls-1.5,
  https://doi.org/10.18653/v1/2022.lnls-1.5
3. Kulesza, T., Burnett, M., Wong, W.K., Stumpf, S.: Principles of explanatory debugging to personalize interactive machine learning. In: Proc. of the 20th Int. Conf. on intelligent user interfaces. pp. 126–137 (2015). https://doi.org/10.1145/2678025.2701399;
  Ali, S. et al.: Explainable artificial intelligence (xai): What we know and what is left to attain trustworthy artificial intelligence. Information fusion 99, 101805 (2023).
  https://doi.org/10.1016/j.inffus.2023.101805;
  Ribeiro, M.T., Singh, S., Guestrin, C.: " why should i trust you?" explaining the predictions of any classifier. In: Proc. of the 22nd ACM SIGKDD Int. Conf. on knowledge discovery and data mining. pp. 1135-1144 (2016).
  https://doi.org/10.1145/2939672.2939778;
  Linardatos, P., Papastefanopoulos, V., Kotsiantis, S.: Explainable ai: A review of machine learning interpretability methods. Entropy 23(1), 18 (2020). https://doi.org/10.3390/e23010018;
  Ghassemi, M., Oakden-Rayner, L., Beam, A.L.: The false hope of current approaches to explainable artificial intelligence in health care. The Lancet Digital Health 3(11), e745-e750 (2021). https://doi.org/10.1016/S2589-7500 (21)00208-9
4. Tan, C.: On the diversity and limits of human explanations. In: Proc. of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp. 2173-2188. Association for Computational Linguistics, Seattle, United States (Jul 2022), https://aclanthology.org/2022.naacl-main.158. https://doi.org/10.18653/v1/2022.naaclmain.158
5. Wiegreffe, S., Marasović, A.: Teach me to explain: A review of datasets for explainable natural language processing (2021)
6. Mackie, J.L.: The Cement of the Universe. Clarendon Press, Oxford, (1974)
7. Machamer, P., Darden, L., Craver, C.F.: Thinking about mechanisms. Philosophy of Science 67(1), 1-25 (2000). https://doi.org/10.1086/392759
8. Jacovi, A. et al.: Contrastive explanations for model interpretability. CoRR abs/2103.01378 (2021), https://arxiv.org/abs/2103.01378.
  https://doi.org/10.18653/v1/2021.emnlp-main.120
9. Mayr, E.: Toward a New Philosophy of Biology: Observations of an Evolutionist. Harvard University Press, Cambridge, MA (1988)
10. Morgan, M.S., Morrison, M. (eds.): Models as Mediators: Perspectives on Natural and Social Science. Cambridge University Press (1999)
  https://doi.org/10.1017/CBO9780511660108
11. Prolific. www.prolific.com, last accessed 2025/04/08
12. Krippendorff, K.: Reliability in content analysis: Some Common Misconceptions and recommendations. Human Communication Research 30, 411-433 (07 2004).
  https://doi.org/10.1093/hcr/30.3.411;
  Hayes, A., Krippendorff, K.: Answering the call for a standard reliability measure for coding data. Communication Methods and Measures 1, 77-89 (04 2007). https://doi.org/10.1080/19312450709336664
13. SciExpl Dataset. https://github.com/gima9552/SciExplDataset
Literaturverzeichnis
1. Prolific. www.prolific.com, last accessed 2025/04/08
2. SciExpl Dataset. https://github.com/gima9552/SciExplDataset
3. Ali, S., Abuhmed, T., El-Sappagh, S., Muhammad, K., Alonso-Moral, J.M., Confalonieri, R., Guidotti, R., Del Ser, J., Díaz-Rodríguez, N., Herrera, F.: Explainable artificial intelligence (xai): What we know and what is left to attain trustworthy artificial intelligence. Information Fusion 99, 101805 (2023) https://doi.org/10.1016/j.inffus.2023.101805
4. Ghassemi, M., Oakden-Rayner, L., Beam, A.L.: The False Hope of Current Approaches to Explainable Artificial Intelligence in Health Care. The Lancet Digital Health 3(11), e745-e750 (2021) https://doi.org/10.1016/S2589-7500(21)00208-9
5. Halpern, J.Y., Pearl, J.: Causes and explanations: A structural-model approach. part ii: Explanations. The British Journal for the Philosophy of Science (2005) https://doi.org/10.1093/bjps/axi148
6. Hartmann, M., Sonntag, D.: A Survey on Improving NLP Models with Human Explanations. In: Proc. of the First Workshop on Learning with Natural Language Supervision. pp. 40-47. Association for Computational Linguistics, Dublin, Ireland (May 2022), https://aclanthology.org/2022.lnls-1.5. https://doi.org/10.18653/v1/2022.lnls-1.5
7. Hayes, A., Krippendorff, K.: Answering the Call for a Standard Reliability Measure for Coding Data. Communication Methods and Measures 1, 77-89 (04 2007). https://doi.org/10.1080/19312450709336664
8. Jacovi, A., Swayamdipta, S., Ravfogel, S., Elazar, Y., Choi, Y., Goldberg, Y.: Contrastive Explanations for Model Interpretability. CoRR abs/2103.01378 (2021), https://arxiv.org/abs/2103.01378
  https://doi.org/10.18653/v1/2021.emnlp-main.120
9. Krippendorff, K.: Reliability in content analysis: Some Common Misconceptions and Recommendations. Human Communication Research 30, 411-433 (07 2004). https://doi.org/10.1093/hcr/30.3.411
10. Kulesza, T., Burnett, M., Wong, W.K., Stumpf, S.: Principles of explanatory debugging to personalize interactive Machine Learning. In: Proc. of the 20th Int. Conf. on Intelligent User Interfaces. pp. 126-137 (2015)
  https://doi.org/10.1145/2678025.2701399
11. Lewis, D.: Causal explanation (1986). https://doi.org/10.1093/0195036468.003.0007
12. Linardatos, P., Papastefanopoulos, V., Kotsiantis, S.: Explainable ai: A review of Machine Learning Interpretability Methods. Entropy 23(1), 18 (2020). https://doi.org/10.3390/e23010018
13. Lombrozo, T.: The structure and function of explanations. Trends in cognitive sciences 10(10), 464-470 (2006).
  https://doi.org/10.1016/j.tics.2006.08.004
14. Machamer, P., Darden, L., Craver, C.F.: Thinking about mechanisms. Philosophy of Science 67(1), 1-25 (2000). https://doi.org/10.1086/392759
15. Mackie, J.L.: The Cement of the Universe. Clarendon Press, Oxford, (1974)
16. Mayr, E.: Toward a New Philosophy of Biology: Observations of an Evolutionist. Harvard University Press, Cambridge, MA (1988).
17. Mill, J.S.: A System of Logic. In: Arguing About Science, pp. 243-267. Routledge (2012)
18. Miller, T.: Explanation in Artificial Intelligence: Insights from the Social Sciences. Artificial Intelligence 267, 1-38 (2019). https://doi.org/10.1016/j.artint.2018.07.007
19. Morgan, M.S., Morrison, M. (eds.): Models as Mediators: Perspectives on Natural and Social Science. Cambridge University Press (1999). https://doi.org/10.1017/CBO9780511660108
20. Ribeiro, M.T., Singh, S., Guestrin, C.: "Why Should I Trust You?" Explaining the Predictions of Any Classifier. In: Proc. of the 22nd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining. pp. 1135-1144 (2016).
  https://doi.org/10.1145/2939672.2939778
21. Tan, C.: On the diversity and limits of human explanations. In: Proc. of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. pp. 2173-2188. Association for Computational Linguistics, Seattle, United States (Jul 2022), https://aclanthology.org/2022.naacl-main.158. https://doi.org/10.18653/v1/2022.naacl-main.158
22. Thagard, P.: The Cognitive Science of Science: Explanation, Dscovery, and Conceptual Change. MIT Press (2012)
  https://doi.org/10.7551/mitpress/9218.001.0001
23. Wiegreffe, S., Marasović, A.: Teach me to Explain: A review of Datasets for Explainable Natural Language Processing (2021)

Bibliografische Daten

Erscheinungsdatum	5. März 2026
DOI	10.64136/vpzq8515
Creative Commons Lizenz

Emerging Categories in Scientific Explanations

Nächstes Kapitel

9 Stability of Model Explanations in Interpretable Prototype-based Classification Learning