Skip to main content
U.S. flag

An official website of the United States government

Singularity and Nonnormality in the Classification of Compositional Data

January 1, 1998

Geologists may want to classify compositional data and express the classification as a map. Regionalized classification is a tool that can be used for this purpose, but it incorporates discriminant analysis, which requires the computation and inversion of a covariance matrix. Covariance matrices of compositional data always will be singular (noninvertible) because of the unit-sum constraint. Fortunately, discriminant analyses can be calculated using a pseudo-inverse of the singular covariance matrix; this is done automatically by some statistical packages such as SAS. Granulometric data from the Darss Sill region of the Baltic Sea is used to explore how the pseudo-inversion procedure influences discriminant analysis results, comparing the algorithm used by SAS to the more conventional Moore-Penrose algorithm. Logratio transforms have been recommended to overcome problems associated with analysis of compositional data, including singularity. A regionalized classification of the Darss Sill data after logratio transformation is different only slightly from one based on raw granulometric data, suggesting that closure problems do not influence severely regionalized classification of compositional data.

Publication Year 1998
Title Singularity and Nonnormality in the Classification of Compositional Data
DOI 10.1023/A:1021705120065
Authors Geoffrey C. Bohling, J.C. Davis, Ricardo A. Olea, Jan Harff
Publication Type Article
Publication Subtype Journal Article
Series Title Mathematical Geology
Index ID 70020485
Record Source USGS Publications Warehouse