Bootstrapping Robust Nonparametric Regression Estimators

S. Van Aelst1 and M. Salibián-Barrera4
  • 1

    Department of Mathematics, KU Leuven, Leuven, Belgium [Stefan.VanAelst@kuleuven.be]

  • 2

    Department of Statistics, University of British Columbia, Vancouver, Canada [matias@stat.ubc.ca]

Keywords: Bootstrap – Nonparametric regression – Robustness

1 Robust Nonparametric Regression based on Splines

Consider the non-parametric regression model

Y=f(𝐱)+σε,

with 𝐱p a vector of explanatory variables and f an unknown regression function. We assume that the function f is smooth (e.g. differentiable). The errors ε are assumed to be independent and identically distributed for some symmetric distribution with center zero and scale 1 such that the parameter σ>0 is the error scale.

In this talk we consider spline based estimators, that is, the unknown function f is assumed to be of the form

f(𝐱)=j=1Kγjbj(𝐱),

with b1(𝐱),,bK(𝐱) a spline basis, e.g. truncated polynomials or B-splines while the parameter K controls the size of the spline space. The essential problem thus becomes estimating the parameter vector 𝜸=(γ1,,γK).

Based on a sample (𝐱1,Y1),,(𝐱n,Yn) we can then consider estimators 𝜸^n which are a minimizer of a penalized optimization problem of the form

Ln(𝜸;𝐱1,Y1),,(𝐱n,Yn))+λ𝜸𝐃𝜸.

For M-type estimators the loss function Ln is of the form

Ln(𝜸;𝐱1,Y1),,(𝐱n,Yn))=1ni=1nρ(yi-𝜸𝐛iσ^n),

with 𝐛i=(b1(𝐱i),,bK(𝐱i)) while for S-estimators Ln=σ^n2(𝜸) where σ^n2(𝜸) solves

1ni=1nρ(yi-𝜸𝐛iσ^n(𝜸))=δ,

for some δ(0,1). The smoothness of the estimator is regularized by the penalty term and the form of the matrix 𝐃. Note that such estimators can be rewritten in a penalized weighted least squares form whichis convenient to develop an iteratively weighted least squares algorithm for their computation. Several robust spline-based estimators fit within this framework, such as the M-type spline estimators of He and Shi [1995], Kalogridis and Van Aelst [2021, 2024], Kalogridis [2022, 2023] as well as the spline based S-estimator of Tharmaratnam et al. [2010].

2 Bootstrapping Robust Splines

For most robust spline-based estimators asymptotic results are not sufficiently developed to derive robust inference based on asymptotic theory. Therefore, we explore the use of bootstrap methods to construct robust inference. Application of the bootstrap for nonparametric regression often does not yield desirable results because the bias due to smoothing is not taken into account. See Kauermann et al. [2009] and references therein. In this talk we will adapt bootstrap methods to develop robust inference such as confidence bands based on robust spline-based estimators and investigate their performance.

References

  • He and Shi [1995] X. He and P. Shi. Asymptotics for M-type regression splines with auxiliary scale estimation. Sankhyā: The Indian Journal of Statistics, Series A (1961-2002), 57(3):452–461, 1995.
  • Kalogridis [2022] I. Kalogridis. Asymptotics for M-type smoothing splines with non-smooth objective functions. TEST, 31:373–389, 2022. doi: https://doi.org/10.1007/s11749-021-00782-y.
  • Kalogridis [2023] I. Kalogridis. Robust thin-plate splines for multivariate spatial smoothing. Econometrics and Statistics, 2023. doi: https://doi.org/10.1016/j.ecosta.2023.06.002.
  • Kalogridis and Van Aelst [2021] I. Kalogridis and S. Van Aelst. M-type penalized splines with auxiliary scale estimation. Journal of Statistical Planning and Inference, 212:97–113, 2021. doi: https://doi.org/10.1016/j.jspi.2020.09.004.
  • Kalogridis and Van Aelst [2024] I. Kalogridis and S. Van Aelst. Robust penalized spline estimation with difference penalties. Econometrics and Statistics, 29:169–188, 2024. doi: https://doi.org/10.1016/j.ecosta.2021.07.005.
  • Kauermann et al. [2009] G. Kauermann, G. Claeskens, and J. D. Opsomer. Bootstrapping for penalized spline regression. Journal of Computational and Graphical Statistics, 18(1):126–146, 2009.
  • Tharmaratnam et al. [2010] K. Tharmaratnam, G. Claeskens, C. Croux, and M. Salibián-Barrera. S-estimation for penalized regression splines. Journal of Computational and Graphical Statistics, 19(3):609–625, 2010. doi: https://doi.org/10.1198/jcgs.2010.08149.