Regional Statistics Conference 2026

Regional Statistics Conference 2026

Improved Inference by optimal transport when data are scarse.

Conference

Regional Statistics Conference 2026

Format: IPS Abstract - Malta 2026

Session: IPS 1261 - Advances in Methods for Scarce and Missing Data

Thursday 4 June 2:40 p.m. - 4:20 p.m. (Europe/Malta)

Abstract

When data are scare or samples very small, then exact inference is hardly possible. Moreover, if a given samples size is reasonably, that depends on the complexity of the estimation problem. We start from the simple problem of estimating a mean of a scalar variable or the correlation of two variables and providing a reliable confidence interval even if the sample is very small. Without distributional assumptions, bootstrap is so far the maybe most common approach. However, depending on the distribution of the original variable(s), its coverage properties can be very poor. We show how a relatively simple regularization by optimal transport can significantly improve on it. Our method is theoretically valid but fully data-driven in practice. This talk is based on joint work with Christophe Valvason and Eustasio del Barrio.