Type: Journal Publication
Abstract: This paper provides exact analytical expressions for the first and second moments of the true error for linear discriminant analysis (LDA) when the data are univariate and taken from two stochastic Gaussian processes. The key point is that we assume a general setting in which the sample data from each class do not need to be identically distributed or independent within or between classes. We compare the true errors of designed classifiers under the typical i.i.d. model and when the data are correlated, providing exact expressions and demonstrating that, depending on the covariance structure, correlated data can result in classifiers with either greater error or less error than when training with uncorrelated data. The general theory is applied to autoregressive and moving-average models of the first order, and it is demonstrated using real genomic data.
Cited as: Amin Zollanvaria, Jianping Hua, Edward R. Dougherty, "Analytical study of performance of linear discriminant analysis in stochastic settings", Pattern Recognition Volume 46, Issue 11, November 2013, Pages 3017–3029