La Trobe

File(s) stored somewhere else

Please note: Linked content is NOT stored on La Trobe and we can't guarantee its availability, quality, security or accept any liability.

Shapley value confidence intervals for attributing variance explained

journal contribution
posted on 2021-01-07, 04:30 authored by Daniel FryerDaniel Fryer, I Strümke, Hien NguyenHien Nguyen
© Copyright © 2020 Fryer, Strümke and Nguyen. The coefficient of determination, the (Formula presented.), is often used to measure the variance explained by an affine combination of multiple explanatory covariates. An attribution of this explanatory contribution to each of the individual covariates is often sought in order to draw inference regarding the importance of each covariate with respect to the response phenomenon. A recent method for ascertaining such an attribution is via the game theoretic Shapley value decomposition of the coefficient of determination. Such a decomposition has the desirable efficiency, monotonicity, and equal treatment properties. Under a weak assumption that the joint distribution is pseudo-elliptical, we obtain the asymptotic normality of the Shapley values. We then utilize this result in order to construct confidence intervals and hypothesis tests for Shapley values. Monte Carlo studies regarding our results are provided. We found that our asymptotic confidence intervals required less computational time to competing bootstrap methods and are able to exhibit improved coverage, especially on small samples. In an expository application to Australian real estate price modeling, we employ Shapley value confidence intervals to identify significant differences between the explanatory contributions of covariates, between models, which otherwise share approximately the same (Formula presented.) value. These different models are based on real estate data from the same periods in 2019 and 2020, the latter covering the early stages of the arrival of the novel coronavirus, COVID-19.


Publication Date



Frontiers in Applied Mathematics and Statistics



Article Number



17p. (p. 1-17)


Frontiers Research Foundation



Rights Statement

The Author reserves all moral rights over the deposited text and must be credited if any re-use occurs. Documents deposited in OPAL are the Open Access versions of outputs published elsewhere. Changes resulting from the publishing process may therefore not be reflected in this document. The final published version may be obtained via the publisher’s DOI. Please note that additional copyright and access restrictions may apply to the published version.