Stability of Model Explanations in Interpretable Prototype-based Classification Learning

Subhashree Panda, Marika Kaden und Thomas Villmann

Introduction

Learning vector quantization (LVQ) as originally proposed by Kohonen and mathematically justified as the Generalized LVQ (GLVQ) constitutes a classifier for multiple class learning of vector data $\bold{x} \in \R^n$ based on the nearest prototype principle (NPP). GLVQ is known to be a robust classifier maximizing inherently the hypothesis margin during training. GLVQ can be combined with class related data embedding learning by a linear transformation $\bold{\Omega} \bold{x} \in \R^m$ proposed as Generalized Matrix LVQ (GMLVQ). The basic prototype learning scheme distributes the class dependent prototype vectors $\mathcal{P} = \{ \bold{p}_1, \ldots, \bold{p}_M \} \subset \R^m$, where the prototypes are equipped with class labels $c (\bold{p}_k) \in \mathcal{C}$ indicating their class responsibilities such that each class is represented by at least one prototype. Both, prototype and embedding learning usually is realized by stochastic gradient descent learning using an approximated overall classification error.

Model interpretation is an important aspect to prefer the shallow GMLVQ over deep networks. Thus, model certainty and stability are strongly desired properties such that general causal implication from model inspection could be drawn about model decisions in terms of feature contributions and relations. Yet, from a machine learning perspective there is some evidence that there are many close-to-optimum-solutions (CTOS). Those different CTOS may realize their model performance according to substantially varying decision strategies. Hence, resulting generalizations regarding causality are crucial, which are, however, frequently demanded and expected from a user perspective. We present respective qualitative considerations for GMLVQ.

While the resulting performance remains almost similar, it is close to the optimal solution (COST). This is important for understanding how to interpret causal models. What is more, this research is also related to a better understanding of how machines learn, which can be very different from how humans learn and understand things.

Description of the Experiments and Results

Let $d_\bold{\Omega} (\bold{x}, \bold{p}_k) = (\bold{\Omega} \bold{x} − \bold{p}_k)^\intercal \, (\bold{\Omega} \bold{x} − \bold{p}_k)$ be the prototype-to-data dissimilarity used for NPP in GMLVQ using an embedding matrix $\bold{\Omega} \in \R^{m \times n}$ with $m \le n$. From a trained model we can derive the classification correlation matrix (CCM) $\bold{\varLambda_\Omega} = \bold{\Omega}^\intercal$ $\bold{\Omega}$ describing feature correlations supporting the classification and the corresponding classification information profile (CIP) $\bold{\gamma_\Omega} = (\gamma_1^\bold{\Omega}, \ldots, \gamma_n^\bold{\Omega})^\intercal$ with $\gamma_k^\bold{\Omega} = \sum_j \big| [\bold{\varLambda_\Omega}]_{k,j} \big|$ estimating feature importance. Further, the relevance profile $\bold{\lambda_\Omega} = (\lambda_1^\bold{\Omega},\ldots, \lambda_n^\bold{\Omega})^\intercal$ with $\lambda_k^\bold{\Omega} = [\bold{\varLambda_\Omega}]_{k,k}$ is a simple measure for feature relevance. Thus, these quantities give qualitative information about the decision process of the model. Further, we compare the CIP-profiles with corresponding Shapley-values (Shap) for feature relevance evaluations whereas permutation feature invariance (PFI) values are easy to compute for feature sensitivity evaluation. Shap is calculated with respect to the GMLVQ cost function (ShapCosts) and with respect to the output (ShapOut - changes for the predicted labels). Similarity is judged by the Spearman correlations.

We trained GMLVQ on several data sets and present a typical result: We considered a 11-dimensional medical tumor metabolite (MTM) data set (235 samples / 3 classes) adopted and modidied from. Five-fold cross validation yields high accuracies for each fold with only small deviations for each data set. Yet, the visual analysis of the $\bold{\varLambda_\Omega}$-matrices reveals substantial deviation within between the folds as depicted in Fig. 1. Thus, we obtained qualitatively different CTOS in the individual folds but with approximately the same performance. This underpins the above statement regarding high variability of CTOS. In consequence, model interpretation is only valid for the specific model in use. Moreover, the correlation analysis of the feature relevance profile yields the observation that Shapley-values seem to be approximable by easier to generate profiles like CIP.

Five heatmaps labeled Fold 1 to Fold 5 showing color-coded data patterns, each with a corresponding bar chart below and a table of Spearman correlations beneath the charts. — Fig. 1. Upper row: CCM matrices $\bold{\varLambda_\Omega}$ obtained by GMLVQ for the five different folds of the MTM data set. Middle row: respective (selected) feature sensitivity profiles. Lower row: Correlation matrices between all considered feature sensitivity profiles.

Presentation Stability of Model Explanations in Interpretable Prototype-based Classification Learning held at the 3rd TRR 318 Conference: Contextualizing Explanations on 17th of June 2025 in Bielefeld, Germany

Nächstes Kapitel

10 Inherently Explainable Hierarchical Generalized Learning Vector Quantization Models

Although deep learning models and transformers have demonstrated superior performance in identification and classification tasks across various domains, they often function as black-box models. Black-box AI models lack transparency, limiting their use in critical fields like healthcare and finance. Treebased structures offer clearer,…

Schriftgröße

Klein

Mittel

Groß

Hintergrund

% Lesefortschritt

Inhaltsverzeichnis
Contextualizing Explanations
Fußnoten
1. Kohonen, T. Learning Vector Quantization. Neural Networks 1(Supplement 1):303, 1988.
  https://doi.org/10.1016/0893-6080(88)90334-6
2. Sato, A. and Yamada, K.: Generalized learning vector quantization. In Advances in Neural Information Processing Systems 8. Proceedings of the 1995 Conference, pages 423-9. MIT Press, Cambridge, MA, USA, 1996.
3. Crammer, K., Gilad-Bachrach, R., Navot, A. and ishby, N.: Margin analysis of the LVQ algorithm. In S. Becker, S. Thrun, and K. Obermayer (eds) Advances in Neural Information Processing (Proc. NIPS 2002), volume 15, pages 462-469, Cambridge, MA, 2003. MIT Press;
  Saralajew, S., Holdijk, L. Rees, M. and Villmann, T.: Robustness of Generalized Learning Vector Quantization Models against Adversarial Attacks. In Advances in Self-Organizing Maps, Learning Vector Quantization, Clustering and Data Visualization - Proceedings of the 13th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization, WSOM+2019, Barcelona, volume 976 of Advances in Intelligent Systems and Computing, pages 189-199. Springer Berlin-Heidelberg, 2019.
  https://doi.org/10.1007/978-3-030-19642-4_19
4. Schneider, P. Hammer, B. and Biehl, M.: Adaptive Relevance Matrices in Learning Vector Quantization. Neural Computation, 21:3532-3561, 2009. https://doi.org/10.1162/neco.2009.11-08-908
5. Biehl, M., Hammer, B. and Villmann, T. Prototype-based models in machine learning. Wiley Interdisciplinary Reviews: Cognitive Science, 7(2):92-111, 2016.
  https://doi.org/10.1002/wcs.1378;
  Lisboa, P. et al.: The coming of age of interpretable and explainable machine learning models. Neurocomputing, 535:25-39, 2023.
  https://doi.org/10.1016/j.neucom.2023.02.040
6. Pearl, J.. Causal Inference. In Proceedings of Workshop on Causality: Objectives and Assessment at NIPS 2008, volume 6 of Journal of Machine Learning Research, pages 39-58, 2010;
  Pearl, V.: The seven tools of causal inference, with re ections on machine learning. Communications ACM, 62(3):54-60, 2019.
  https://doi.org/10.1145/3241036
7. Donnelly, J., Katta, S., Rudin, C. and Browne, E.: The Rashomon importance distribution: getting RID of unstable, single model-based variable importance. In: Proceedings of the 37th International Conference on Neural Information Processing System (NeurIPS), number Art. No 274, pages 6267 - 6279, 2023;
  Lei, Y., Jin, R. and Ying, Y.: Stability and generalization analysis of gradient methods for shallow neural networks. In: Proceedings of the 36th International Conference on Neural Information Processing System (NeurIPS'22), New Orleans, number 2794, pages 38557-38570, 57 Morehouse Lane, Red Hook, NY, United States, 2022. Curran Associates Inc.;
  Martinetz, J., Linse, C. and Martinetz, T.: Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), Yokohama, pages 2575-2584. IEEE, 2024.
  https://doi.org/10.1109/IJCNN60899.2024.10650680;
  Martinetz, J. and Martinetz, T.: Do highly over-parameterized neural networks generalize since bad solutions are rare? IEEE Transactions on Neural Networks and Learning Systems, 36(8):13848-13858, 2025.
  https://doi.org/10.1109/TNNLS.2025.3529297;
  Paes, L., Cruz, R., Calmon, F. and Diaz, M.: On the inevitability of the Rashomon effect. In: Proc. of IEEE International Symposium on Information Theory (ISIT), Taipei, pages 559-554, 2023.
  https://doi.org/10.1109/ISIT54713.2023.10206657;
  Rudin, C. et al.: Amazing things come from having many good models. In: Proceedings of the 41st International Conference on Machine Learning (ICML), number Art. No 1742, pages 42783 - 42795, 2024.
8. Ilievski, F. et al.: Aligning generalization between humans and machines. Nature Machine Intelligence, pages 1-12, 2025.
9. Bunte, K. et al.: Limited rank matrix learning, discriminative dimension reduction and visualization. Neural Networks, 26(1):159-173, 2012.
  https://doi.org/10.1016/j.neunet.2011.10.001
10. Hammer, B. and Villmann, T.: Generalized relevance learning vector quantization. Neural Networks, 15(8-9):1059-1068, 2002.
  https://doi.org/10.1016/S0893-6080 (02)00079-5
11. Lundberg, S. and Lee, S.-I.: A unified approach to interpreting model predictions. In: I. Guyon, et al.(eds.) Advances in Neural Information Processing Systems, volume 30, pages 4768-4777. Curran Associates, Inc., 2017;
  Tatano, R. et al.: Sanity check for Shapley values-based explanations of deep neural networks predictions. In: Proc. of the IEEE 10th International Conference on Healthcare Informatics (ICHI), Rochester, MN, USA, pages 644-646, 2022.
  https://doi.org/10.1109/ICHI54592.2022.00130;
  Strumbelj, E. and Kononenko, I.: Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41(3):647-665, 2014.
  https://doi.org/10.1007/s10115-013-0679-x .
12. Fisher, A., Rudin, C. and Dominici, F. All models are wrong, but many are useful: Learning a variable's importance by studying an entire class of prediction models simultaneously. Journal of Machine Learning Research, 20(177):1-81, 2019.
13. Biehl, M. et al.: Matrix relevance LVQ in steroid metabolomics based classification of adrenal tumors. In: Proc. of European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN'2012), pages 447-458, Louvain-La-Neuve, Belgium, 2012. i6doc.com.
Literaturverzeichnis
1. Biehl, M., Hammer, B. and Villmann, T.: Prototype-based models in machine learning. Wiley Interdisciplinary Reviews: Cognitive Science, 7(2):92-111, 2016. https://doi.org/10.1002/wcs.1378.
2. Biehl, M., Schneider, P., Smith, D., Stiekema, H., Taylor, A., Hughes, B., Shackleton, C., Stewart, P. and Arlt, W.: Matrix relevance LVQ in steroid metabolomics based classification of adrenal tumors. In: Proc. of European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN'2012), pages 447-458, Louvain-La-Neuve, Belgium, 2012. https://www.i6doc.com.
3. Bunte, K., Schneider, P., Hammer, B., Schleif, F.-M., Villmann, T. and Biehl, M.: Limited rank matrix learning, discriminative dimension reduction and visualization. Neural Networks, 26(1):159-173, 2012. https://doi.org/10.1016/j.neunet.2011.10.001.
4. Crammer, K., Gilad-Bachrach, R., Navot, A. and Ishby, N.: Margin analysis of the LVQ algorithm. In: S. Becker, S. Thrun, and K. Obermayer (eds.) Advances in Neural Information Processing (Proc. NIPS 2002), volume 15, pages 462-469, Cambridge, MA, 2003. MIT Press.
5. Donnelly, J., Katta, S., Rudin, C. and Browne, E.: The Rashomon importance distribution: getting RID of unstable, single model-based variable importance. In: Proceedings of the 37th International Conference on Neural Information Processing System (NeurIPS), number Art. No 274, pages 6267 - 6279, 2023.
6. Fisher, A., Rudin, C. and Dominici, F.: All models are wrong, but many are useful: Learning a variable's importance by studying an entire class of prediction models simultaneously. Journal of Machine Learning Research, 20(177):1-81, 2019.
7. Hammer, B. and Villmann, T. Generalized relevance learning vector quantization. Neural Networks, 15(8-9):1059-1068, 2002. https://doi.org/10.1016/S0893-6080(02)00079-5
8. Ilievski, F., Hammer, B., van Harmelen, F., Paassen, B., Saralajew, S., Schmid, U., Biehl, M., Bolognesi, M., Dong, X. L., Gashteovski, K., Hitzler, P., Marra, G., Minervini, P., Mundt, M., Ngomo, A.-C. N., Oltramari, A., Pasi, G., Saribatur, Z. G., Serafini, L., Shawe-Taylor, J., Shwartz, V., Skitalinskaya, G., Stachl, C., van de Ven, G. M. and Villmann, T. Aligning generalization between humans and machines. Nature Machine Intelligence, pages 1-12, 2025.
9. Ilievski, F., Hammer, B., van Harmelen, F., Paassen, B., Saralajew, S., Schmid, U., Biehl, M., Bolognesi, M., Dong, X. L., Gashteovski, K., Hitzler, P., Marra, G., Minervini, P., Mundt, M., Ngomo, A.-C. N., Oltramari, A., Pasi, G., Saribatur, Z. G., Serafini, L., Shawe-Taylor, J., Shwartz, V., Skitalinskaya, G., Stachl, C., van de Ven, G. M. and Villmann, T.: Aligning generalisation between humans and machines. arXiv, Nov. 2024.
10. Kohonen, T. Learning Vector Quantization. Neural Networks, 1(Supplement 1):303, 1988. https://doi.org/10.1016/0893-6080(88)90334-6.
11. Lei, Y., Jin, R. and Ying, Y.: Stability and generalization analysis of gradient methods for shallow neural networks. In: Proceedings of the 36th International Conference on Neural Information Processing System (NeurIPS'22), New Orleans, number 2794, pages 38557-38570, 57 Morehouse Lane, Red Hook, NY, United States, 2022. Curran Associates Inc.
12. Lisboa, P., Saralajew, S., Vellido,A., Fernández-Domenech, R. and Villmann, T. The coming of age of interpretable and explainable machine learning models. Neurocomputing, 535:25-39, 2023. https://doi.org/10.1016/j.neucom.2023.02.040.
13. Lundberg, S. and Lee, S.-I.: A unified approach to interpreting model predictions. In: I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (eds.) Advances in Neural Information Processing Systems, volume 30, pages 4768-4777. Curran Associates, Inc., 2017.
14. Martinetz, J., Linse, C. and Martinetz, T.: Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN), Yokohama, pages 2575-2584. IEEE, 2024. https://doi.org/10.1109/IJCNN60899.2024.10650680.
15. Martinetz, J. and Martinetz, T.: Do highly over-parameterized neural networks generalize since bad solutions are rare? IEEE Transactions on Neural Networks and Learning Systems, 36(8):13848-13858, 2025. https://doi.org/10.1109/TNNLS.2025.3529297.
16. Morgan, S. and Winship, C.: Counterfactuals and Causal Inference. Cambridge University Press, 2nd edition, 2014. https://doi.org/10.1017/CBO9781107587991.
17. Paes, L., Cruz, R., Calmon, F. and Diaz, M.: On the inevitability of the Rashomon effect. In: Proc. of IEEE International Symposium on Information Theory (ISIT), Taipei, pages 559-554, 2023. https://doi.org/10.1109/ISIT54713.2023.10206657.
18. Pearl, J.: Causal Inference. In: Proceedings of Workshop on Causality: Objectives and Assessment at NIPS 2008, volume 6 of Journal of Machine Learning Research, pages 39-58, 2010.
19. Pearl, V.: The seven tools of causal inference, with reections on machine learning. Communications ACM, 62(3):54-60, 2019. https://doi.org/10.1145/3241036.
20. Rudin, C., Zhong, C., Semenova, L., Seltzer, M., Parr, R., Liu, J., Katta, S., Donnelly, J., Chen, H. and Boner, Z.: Amazing things come from having many good models. In: Proceedings of the 41st International Conference on Machine Learning (ICML), number Art. No 1742, pages 42783 - 42795, 2024.
21. Saralajew, S., Holdijk, L. Rees, M. and Villmann, T.: Robustness of Generalized Learning Vector Quantization Models against Adversarial Attacks. In: Advances in Self-Organizing Maps, Learning Vector Quantization, Clustering and Data Visualization - Proceedings of the 13th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization, WSOM+2019, Barcelona, volume 976 of Advances in Intelligent Systems and Computing, pages 189-199. Springer Berlin-Heidelberg, 2019. https://doi.org/10.1007/978-3-030-19642-4_19
22. Sato, A. and Yamada, K.: Generalized learning vector quantization. In: Advances in Neural Information Processing Systems 8. Proceedings of the 1995 Conference, pages 423-9. MIT Press, Cambridge, MA, USA, 1996.
23. Schneider, P. Hammer, B. and Biehl, M.: Adaptive Relevance Matrices in Learning Vector Quantization. Neural Computation, 21:3532-3561, 2009. https://doi.org/10.1162/neco.2009.11-08-908
24. Tatano, R., Mastropietro, A., Busto, E. and Vaccarino, F. Sanity check for Shapley values-based explanations of deep neural networks predictions. In: Proc. of the IEEE 10th International Conference on Healthcare Informatics (ICHI), Rochester, MN, USA, pages 644-646, 2022. https://doi.org/10.1109/ICHI54592.2022.00130
25. Strumbelj, E. and Kononenko, I.: Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41(3):647-665, 2014. https://doi.org/10.1007/s10115-013-0679-x

Bibliografische Daten

Erscheinungsdatum	5. März 2026
DOI	10.64136/sukg8794
Creative Commons Lizenz

Stability of Model Explanations in Interpretable Prototype-based Classification Learning

Introduction

Description of the Experiments and Results

Nächstes Kapitel

10 Inherently Explainable Hierarchical Generalized Learning Vector Quantization Models