Nonparametric Identification and Estimation of
the Number of Components in Multivariate Mixtures

Kasahara Hiroyuki
Shimotsu Katsumi

October 2012

Abstract

This article analyzes the identifiability of the number of components in k-variate, M-component finite mixture models in which each component distribution has independent marginals, including models in latent class analysis. Without making parametric assumptions on the component distributions, we investigate how one can identify the number of components from the distribution function of the observed data. When k>= 2, a lower bound on the number of components (M) is nonparametrically identifiable from the rank of a matrix constructed from the distribution function of the observed variables. Building on this identification condition, we develop a procedure to consistently estimate a lower bound on the number of components.

Full text

PDF Download (PDF: 548KB)