Towards explainable classifiers using the counterfactual approach : global explanations for discovering bias in data

Mikołajczyk, Agnieszka; Grochowski, Michał; Kwasigroch, Arkadiusz

doi:10.2478/jaiscr-2021-0004

Artykuł - szczegóły

Tytuł artykułu

Towards explainable classifiers using the counterfactual approach : global explanations for discovering bias in data

Autorzy

Mikołajczyk Agnieszka , Grochowski Michał , Kwasigroch Arkadiusz

Treść / Zawartość

Pełne teksty:

Pobierz

Identyfikatory

DOI

10.2478/jaiscr-2021-0004

Warianty tytułu

Języki publikacji

Abstrakty

The paper proposes summarized attribution-based post-hoc explanations for the detection and identification of bias in data. A global explanation is proposed, and a step-by-step framework on how to detect and test bias is introduced. Since removing unwanted bias is often a complicated and tremendous task, it is automatically inserted, instead. Then, the bias is evaluated with the proposed counterfactual approach. The obtained results are validated on a sample skin lesion dataset. Using the proposed method, a number of possible bias-causing artifacts are successfully identified and confirmed in dermoscopy images. In particular, it is confirmed that black frames have a strong influence on Convolutional Neural Network’s prediction: 22% of them changed the prediction from benign to malignant.

Słowa kluczowe

explainable classifiers counterfactual approach bias detection

Wydawca

University of Social Sciences

Czasopismo

Journal of Artificial Intelligence and Soft Computing Research

Rocznik

2021

Tom

Vol. 11, No. 1

Strony

51--67

Opis fizyczny

Bibliogr. 42 poz., rys.

Twórcy

autor

Mikołajczyk Agnieszka

agnieszka.mikolajczyk@pg.edu.pl

Department of Electrical Engineering, Control Systems and Informatics, Gdansk University of Technology, Poland

autor

Grochowski Michał

Department of Electrical Engineering, Control Systems and Informatics, Gdansk University of Technology, Poland

autor

Kwasigroch Arkadiusz

Department of Electrical Engineering, Control Systems and Informatics, Gdansk University of Technology, Poland

Bibliografia

[1] P. Stock and M. Cisse, ConvNets and imagenet beyond accuracy: Understanding mistakes and uncovering biases, in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, vol. 11210 LNCS, pp. 504–519, doi: 10.1007978-3-030-01231-1_31.
[2] E. B. Kania, Chinese Military Innovation in Artificial Intelligence, 2019.
[3] F. Wang, L. P. Casalino, and D. Khullar, Deep Learning in Medicine - Promise, Progress, and Challenges, JAMA Internal Medicine, vol. 179, no. 3. American Medical Association, pp. 293–294, Mar. 01, 2019, doi: 10.1001/jamaintern-med.2018.7117.
[4] J. Folmsbee, S. Johnson, X. Liu, M. Brandwein-Weber, and S. Doyle, Fragile neural networks: the importance of image standardization for deep learning in digital pathology, in Medical Imaging 2019: Digital Pathology, Mar. 2019, vol. 10956, p. 38, doi: 10.1117/12.2512992.
[5] S. Lapuschkin, S. Wäldchen, A. Binder, G. Montavon, W. Samek, and K. R. Müller, Unmasking Clever Hans predictors and assessing what machines really learn, Nature Communications, vol. 10, no. 1, 2019, doi: 10.1038/s41467-019-08987-4.
[6] R. M. J Byrne, Counterfactuals in Explainable Artificial Intelligence (XAI): Evidence from Human Reasoning, 2019.
[7] M. T. Ribeiro, S. Singh, and C. Guestrin, ‘Why Should I Trust You?’ Explaining the Predictions of Any Classifier, doi: 10.1145/2939672.2939778.
[8] A. B. Arrieta et al., Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI, Oct. 2019
[9] S. Bach, A. Binder, G. Montavon, F. Klauschen, K.-R. Müller, and W. Samek, On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation, 2015, doi: 10.1371/journal.pone.0130140.
[10] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, Grad-cam: Why did you say that? visual explanations from deep networks via gradient-based localization, Revista do Hospital das Clinicas, vol. 17, pp. 331–336, 2016
[11] S. Wachter, B. Mittelstadt, and C. Russell, COUNTERFACTUAL EXPLANATIONS WITHOUT OPENING THE BLACK BOX: AUTOMATED DECISIONS AND THE GDPR, Harvard Journal of Law & Technology, vol. 31, no. 2, 2018, doi: 10.1177/1461444816676645.
[12] W. Samek, T. Wiegand, and K.-R. Müller, Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models, Aug. 2017
[13] J. Zhang, S. A. Bargal, Z. Lin, J. Brandt, X. Shen, and S. Sclaroff, Top-Down Neural Attention by Excitation Backprop, International Journal of Computer Vision, vol. 126, no. 10, pp. 1084–1102, Oct. 2018, doi: 10.1007/s11263-017-1059-x.
[14] G. Montavon, A. Binder, S. Lapuschkin, W. Samek, and K. R. Müller, Layer-Wise Relevance Propagation: An Overview, in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11700 LNCS, Springer Verlag, 2019, pp. 193–209.
[15] W. Samek, A. Binder, G. Montavon, S. Lapuschkin, and K. R. Müller, Evaluating the visualization of what a deep neural network has learned, IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 11, pp. 2660–2673, 2017, doi: 10.1109/TNNLS.2016.2599820.
[16] A. Torralba and A. A. Efros, Unbiased look at dataset bias, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2011, pp. 1521–1528, doi: 10.1109/CVPR.2011.5995347.
[17] S. M. Lundberg et al., From local explanations to global understanding with explainable AI for trees, Nature Machine Intelligence, vol. 2, no. 1, pp. 56–67, Jan. 2020, doi: 10.1038/s42256-019-0138-9.
[18] B. Kim et al., Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV), 2018.
[19] G. Fidel, R. Bitton, and A. Shabtai, When Explainability Meets Adversarial Learning: Detecting Adversarial Examples using SHAP Signatures, Sep. 2019
[20] A. M. Šimundić, Bias in research, Biochemia Medica, vol. 23, no. 1, pp. 12–15, Feb. 2013, doi: 10.11613/BM.2013.003.
[21] C. J. Pannucci and E. G. Wilkins, Identifying and avoiding bias in research, Plastic and Reconstructive Surgery, vol. 126, no. 2, pp. 619–625, Aug. 2010, doi: 10.1097/PRS.0b013e3181de24bc.
[22] R. Ambrosino, B. G. Buchanan, G. F. Cooper, and M. J. Fine, The use of misclassification costs to learn rule-based decision support models for cost-effective hospital admission strategies., Proceedings / the ... Annual Symposium on Computer Application [sic] in Medical Care. Symposium on Computer Applications in Medical Care, pp. 304–308, 1995.
[23] M. Thelwall, Gender bias in sentiment analysis, Online Information Review, vol. 42, no. 1, pp. 45–57, 2018, doi: 10.1108/OIR-05-2017-0139.
[24] P.-S. Huang et al., Reducing Sentiment Bias in Language Models via Counterfactual Evaluation, Nov. 2019
[25] M. Hardt Google, E. Price, and N. Srebro, Equality of Opportunity in Supervised Learning, 2016.
[26] Jia Deng, Wei Dong, R. Socher, Li-Jia Li, Kai Li, and Li Fei-Fei, ImageNet: A large-scale hierarchical image database, in ieeexplore.ieee.org, 2009, pp. 248–255, doi: 10.1109/cvprw.2009.5206848.
[27] R. Geirhos, P. Rubisch, C. Michaelis, M. Bethge, F. A. Wichmann, and W. Brendel, ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness, 7th International Conference on Learning Representations, ICLR 2019, Nov. 2018
[28] P. Tschandl, C. Rosendahl, and H. Kittler, The HAM10000 dataset, a large collection of multisource dermatoscopic images of common pigmented skin lesions, Scientific Data, vol. 5, Mar. 2018, doi: 10.1038/sdata.2018.161.
[29] X. Sun, J. Yang, M. Sun, and K. Wang, A Benchmark for Automatic Visual Classification of Clinical Skin Disease Images.
[30] A. Bissoto, M. Fornaciali, E. Valle, and S. Avila, (De)Constructing Bias on Skin Lesion Datasets, Apr. 2019
[31] C. Barata, J. S. Marques, and M. E. Celebi, Deep Attention Model for the Hierarchical Diagnosis of Skin Lesions.
[32] B. Wang et al., Neural cleanse: Identifying and mitigating backdoor attacks in neural networks, in Proceedings - IEEE Symposium on Security and Privacy, May 2019, vol. 2019-May, pp. 707–723, doi: 10.1109/SP.2019.00031.
[33] C. J. Anders, T. Marincˇ, D. Neumann, W. Samek, K.-R. Müller, and S. Lapuschkin, Analyzing ImageNet with Spectral Relevance Analysis: Towards ImageNet un-Hans’ed, Dec. 2019
[34] A. Mikolajczyk and M. Grochowski, Style transfer-based image synthesis as an efficient regularization technique in deep learning, in 2019 24th International Conference on Methods and Models in Automation and Robotics, MMAR 2019, 2019, pp. 42–47, doi: 10.1109/MMAR.2019.8864616.
[35] G. Huang, Z. Liu, L. van der Maaten, and K. Q. Weinberger, Densely Connected Convolutional Networks, Aug. 2016,
[36] G. Montavon, S. Lapuschkin, A. Binder, W. Samek, and K. R. Müller, Explaining nonlinear classification decisions with deep Taylor decomposition, Pattern Recognition, vol. 65, pp. 211–222, 2017, doi: 10.1016/j.patcog.2016.11.008.
[37] M. Balasubramanian, The Isomap Algorithm and Topological Stability, Science, vol. 295, no. 5552, pp. 7a – 7, Jan. 2002, doi: 10.1126/science.295.5552.7a.
[38] D. Xu and Y. Tian, A Comprehensive Survey of Clustering Algorithms, Annals of Data Science, vol. 2, no. 2, pp. 165–193, Jun. 2015, doi: 10.1007/s40745-015-0040-1.
[39] T. M. Kodinariya and P. R. Makwana, Review on determining number of Cluster in K-Means Clustering, International Journal of Advance Research in Computer Science and Management Studies, vol. 1, no. 6, 2013
[40] J. Jaworek-Korjakowska, A Deep Learning Approach to Vascular Structure Segmentation in Dermoscopy Colour Images, hindawi.com, 2018, doi: 10.1155/2018/5049390.
[41] R. H. Johr, Dermoscopy: Alternative melanocytic algorithms - The ABCD rule of dermatoscopy, menzies scoring method, and 7-point checklist, Clinics in Dermatology, vol. 20, no. 3, pp. 240–247, May 2002, doi: 10.1016/S0738-081X(02)00236-5.
[42] T. Majtner, K. Lidayová, S. Yildirim-Yayilgan, and J. Y. Hardeberg, Improving skin lesion segmentation in dermoscopic images by thin artefacts removal methods, Dec. 2016, doi: 10.1109/EUVIP.2016.7764580.
[43] A. Sultana, I. Dumitrache, M. Vocurek, and M. Ciuc, Removal of artifacts from dermatoscopic images, 2014, doi: 10.1109/ICComm.2014.6866757.
[44] M. E. Celebi, H. Iyatomi, G. Schaefer, and W. V. Stoecker, Lesion border detection in dermoscopy images, Computerized Medical Imaging and Graphics, vol. 33, no. 2, pp. 148–153, Mar. 2009, doi: 10.1016/j.compmedimag.2008.11.002.
[45] C. Kim, K. Kim, and S. R. Indurthi, Small energy masking for improved neural network training for end-to-end speech recognition, Feb. 2020,

Uwagi

Opracowanie rekordu ze środków MNiSW, umowa Nr 461252 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2021).

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-a92e95ec-cea3-4051-95f4-2d074692ae16