References

Sparse Unbiased Estimating Equations via Likelihood Score Alignment

G. Bertagnolli ${}^{a}$ , Z. Huang ${}^{b}$ and D. Ferrari ${}^{a}$

${}^{a}$ Free University of Bozen-Bolzano, IT, ${}^{b}$ RMIT University, Melbourne, AU

Estimating equations are central to statistical inference, underpinning likelihood- and moment-based methods. Consider a parametric model $\mathcal{M}={f(\cdot;\theta):\theta\in\Theta\subseteq\mathbb{R}^{p}}$ and a sample $\left(Y^{(1)},\ldots,Y^{(n)}\right)$ . We study inference for $\theta$ in high-dimensional regimes ( $p\geq n$ ), based on an unbiased estimating function $S:\Theta\times\mathcal{Y}\to\mathbb{R}^{p}$ satisfying $\mathbb{E}\theta[S(\theta,Y)]=0$ and the associated system $\sum{i=1}^{n}S(\theta,Y^{(i)})=0$ . Such functions include moment conditions and composite likelihood scores. In high dimensions, the system is typically ill-conditioned, motivating a sparsity assumption: the true parameter $\theta_{0}$ has support $A_{0}={j:\theta_{0j}\neq 0}$ with $|A_{0}|\ll p$ . Building on optimal estimating function theory [1, 2], we look for an optimal estimating function $\tilde{S}=W_{0}S$ in the class of linear transformations of $S$ , where $W_{0}$ is the minimiser of the convex criterion

\mathcal{L}_{\lambda}(W)=\frac{1}{2}\operatorname{tr}\left(W\Sigma_{S}W^{\top}% \right)-\operatorname{tr}\left(H_{S}W^{\top}\right)+\operatorname{pen}_{% \lambda}(W).

with $\Sigma_{S}$ , $H_{S}$ being the variability and sensitivity matrices of $S$ respectively. The penality $\operatorname{pen}_{\lambda}(W)$ enforces sparsity directly at the estimating function level, also providing a link between the sparsity pattern of $W$ and the active set $A_{0}$ . Under standard conditions for penalised models, we establish selection consistency and asymptotic normality on the estimated active set. We also propose a blockwise proximal scoring algorithm for efficient computation of $(\widehat{W},\hat{\theta})$ , and illustrate the method on problems including linear regression and sparse multinomial models.

Keywords: High-dimensional statistics, Estimating equations, Sparsity

References

[1] Vidyadhar P Godambe. An optimum property of regular maximum likelihood estimation. The Annals of Mathematical Statistics, 31(4):1208–1211, 1960.
[2] Christopher C Heyde. Quasi-likelihood and its application: a general approach to optimal parameter estimation. Springer, 1997.