Skip to main content
U.S. flag

An official website of the United States government

Stability and bias of classification rates in biological applications of discriminant analysis

January 1, 1990

We assessed the sampling stability of classification rates in discriminant analysis by using a factorial design with factors for multivariate dimensionality, dispersion structure, configuration of group means, and sample size. A total of 32,400 discriminant analyses were conducted, based on data from simulated populations with appropriate underlying statistical distributions. Simulation results indicated strong bias in correct classification rates when group sample sizes were small and when overlap among groups was high. We also found that stability of the correct classification rates was influenced by these factors, indicating that the number of samples required for a given level of precision increases with the amount of overlap among groups. In a review of 60 published studies, we found that 57% of the articles presented results on classification rates, though few of them mentioned potential biases in their results. Wildlife researchers should choose the total number of samples per group to be at least 2 times the number of variables to be measured when overlap among groups is low. Substantially more samples are required as the overlap among groups increases

Publication Year 1990
Title Stability and bias of classification rates in biological applications of discriminant analysis
Authors B.K. Williams, K. Titus, J. E. Hines
Publication Type Article
Publication Subtype Journal Article
Series Title Journal of Wildlife Management
Index ID 5222400
Record Source USGS Publications Warehouse
USGS Organization Patuxent Wildlife Research Center