A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
Large-sample properties of unsupervised estimation of the linear discriminant using projection pursuit (2021)


Radojičić, U., Nordhausen, K., & Virta, J. (2021). Large-sample properties of unsupervised estimation of the linear discriminant using projection pursuit. Electronic Journal of Statistics, 15(2), 6677-6739. https://doi.org/10.1214/21-EJS1956


JYU-tekijät tai -toimittajat


Julkaisun tiedot

Julkaisun kaikki tekijät tai toimittajatRadojičić, Una; Nordhausen, Klaus; Virta, Joni

Lehti tai sarjaElectronic Journal of Statistics

eISSN1935-7524

Julkaisuvuosi2021

Ilmestymispäivä01.01.2021

Volyymi15

Lehden numero2

Artikkelin sivunumerot6677-6739

KustantajaInstitute of Mathematical Statistics

JulkaisumaaYhdysvallat (USA)

Julkaisun kielienglanti

DOIhttps://doi.org/10.1214/21-EJS1956

Julkaisun avoin saatavuusAvoimesti saatavilla

Julkaisukanavan avoin saatavuusKokonaan avoin julkaisukanava

Julkaisu on rinnakkaistallennettu (JYX)https://jyx.jyu.fi/handle/123456789/79240

Rinnakkaistallenteen verkko-osoite (pre-print)https://arxiv.org/abs/2103.04678


Tiivistelmä

We study the estimation of the linear discriminant with projection pursuit, a method that is unsupervised in the sense that it does not use the class labels in the estimation. Our viewpoint is asymptotic and, as our main contribution, we derive central limit theorems for estimators based on three different projection indices, skewness, kurtosis, and their convex combination. The results show that in each case the limiting covariance matrix is proportional to that of linear discriminant analysis (LDA), a supervised estimator of the discriminant. An extensive comparative study between the asymptotic variances reveals that projection pursuit gets arbitrarily close in efficiency to LDA when the distance between the groups is large enough and their proportions are reasonably balanced. Additionally, we show that consistent unsupervised estimation of the linear discriminant can be achieved also in high-dimensional regimes where the dimension grows at a suitable rate to the sample size, for example, pn=o(n1∕3) is sufficient under skewness-based projection pursuit. We conclude with a real data example and a simulation study investigating the validity of the obtained asymptotic formulas for finite samples.


YSO-asiasanatmatematiikkaestimointi

Vapaat asiasanatclustering; kurtosis; linear discriminant analysis; projection pursuit; skewness


Liittyvät organisaatiot


OKM-raportointiKyllä

Raportointivuosi2021

JUFO-taso1


Viimeisin päivitys 2024-22-04 klo 18:53