Multivariate permutation tests based on discrete optimal transport
Abstract
This contribution presents a new approach to multivariate permutation tests that utilizes the discrete optimal measure transport.
-
Department of Probability and Statistics, Faculty of Mathematics and Physics, Charles University, the Czech Republic [hudecova@karlin.mff.cuni.cz]
Keywords: multivariate permutation test, optimal measure transport
1 Univariate permutation tests
Permutation tests provide a robust and powerful tool for testing various statistical hypotheses. Their use for univariate test statistics is simple, and they yield valid inference under minimal assumptions about the underlying data distribution. Let be a univariate test statistic computed for a given data set. For a chosen number of repetitions, one randomly shuffles the labels of the observations and calculates the test statistics for the permuted data sets to obtain a permutation sample . The permutation p-value is computed as a proportion of , , that are more extreme than .
2 Multivariate permutation tests
There are various problems where the test statistic under investigation is multivariate, with the multiple testing problem being a simple example, as multiple test statistics can be viewed as components of a single multivariate test statistic. Unfortunately, the permutation principle does not straightforwardly extend to the multivariate setup. If the test statistic takes values in , , then the permutation sample can be obtained analogously, but it is not clear how to decide which values are more extreme than due to the lack of natural ordering in . Traditional approaches to multivariate permutation tests rely on a suitable transformation of a vector of componentwise permutation p-values. However, this approach has several drawbacks.
The problem of extremeness in is closely related to a definition of multivariate quantiles. Recently, Hallin et al. [2021] introduced a concept of multivariate quantiles derived from the optimal measure transportation (OMT). This contribution presents a new approach to permutation tests from Hlávka et al. [2024] that utilizes this concept.
3 Application of the optimal transport
The OMT permutation tests are based on the discrete optimal measure transport of the set of points in to a specified regular grid set of points in the unit ball in . The extremeness of is then evaluated via extremeness of its image among , , i.e. the permutation -value is computed as the relative frequency of , , such that .
The proposed approach is distribution-free and it requires only mild assumptions about exchangeability under the null hypothesis. Beside the final permutation p-value, the approach allows also to calculate and interpret contributions of the components of the vector test statistic to the rejection of the null hypothesis. Illustrative practical examples will be presented.
References
- Hallin et al. [2021] Marc Hallin, Eustasio del Barrio, Juan Cuesta-Albertos, and Carlos Matrán. Distribution and quantile functions, ranks and signs in dimension : A measure transportation approach. Ann. Statist., 49(2):1139 – 1165, 2021. https://doi.org/10.1214/20-AOS1996.
- Hlávka et al. [2024] Zdeněk Hlávka, Daniel Hlubinka, and Šárka Hudecová. Multivariate quantile-based permutation tests with application to functional data. J. Comput. Graph. Stat., 2024. https://doi.org/10.1080/10618600.2024.2444302.