1185305_Fryer,D_2021.pdf (15.77 MB)
Shapley values for feature selection: The good, the bad, and the axioms
journal contributionposted on 2022-01-24, 01:31 authored by Daniel FryerDaniel Fryer, I Strumke, Hien NguyenHien Nguyen
The Shapley value has become popular in the Explainable AI (XAI) literature, thanks, to a large extent, to a solid theoretical foundation, including four “favourable and fair” axioms for attribution in transferable utility games. The Shapley value is provably the only solution concept satisfying these axioms. In this paper, we introduce the Shapley value and draw attention to its recent uses as a feature selection tool. We call into question this use of the Shapley value, using simple, abstract “toy” counterexamples to illustrate that the axioms may work against the goals of feature selection. From this, we develop a number of insights that are then investigated in concrete simulation settings, with a variety of Shapley value formulations, including SHapley Additive exPlanations (SHAP) and Shapley Additive Global importancE (SAGE). The aim is not to encourage any use of the Shapley value for feature selection, but we aim to clarify various limitations around their current use in the literature. In so doing, we hope to help demystify certain aspects of the Shapley value axioms that are viewed as “favourable”. In particular, we wish to highlight that the favourability of the axioms depends non-trivially on the way in which the Shapley value is appropriated in the XAI application.
Pagination9p. (p. 144352-144360)
Rights StatementThe Author reserves all moral rights over the deposited text and must be credited if any re-use occurs. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
CategoriesNo categories selected
Science & TechnologyTechnologyComputer Science, Information SystemsEngineering, Electrical & ElectronicTelecommunicationsComputer ScienceEngineeringFeature extractionGamesComputational modelingTask analysisAdditivesPredictive modelsData modelsExplainabilityfeature selectioninterpretabilityShapley valuevariable selectionXAI