Skip to main content
U.S. flag

An official website of the United States government

Machine learning and data augmentation approach for identification of rare earth element potential in Indiana Coals, USA

May 28, 2022

Rare earth elements and yttrium (REYs) are critical elements and valuable commodities due to their limited availability and high demand in a wide range of applications and especially in high-technology products. The increased demand and geopolitical pressures motivate the search for alternative sources of REYs, and coal, coal waste, and coal ash are considered as new sources for these critical elements. This research evaluates the REY potential of coals from Indiana (USA). However, although coal data revealed REY potential, it suffered from sparse samples with complete REY measurements. Therefore, we explore the applicability of machine learning (ML) models and data augmentation techniques to demonstrate their applicability to evaluate REY potential in Indiana, and other areas in coal basins, using selected coal parameters (Al2O3, Fe2O3, C, Ash, S, P, Mo, Zn, and As contents) as covariates (indicators). Due to the relatively small sample size with complete REY data in the Indiana Coal Database, two data augmentation techniques (Random Over-Sampling Examples and Synthetic Minority Over-Sampling Technique) were used. Four machine learning algorithms (linear discriminate analysis, support vector machine, random forest, and artificial neural networks) were applied for modeling REY potential as a classification problem. The results show that application of Synthetic Minority Over-Sampling Technique prior to development of the support vector machine (SVM) models generated the best REY classification with an accuracy of 95%. The encouraging results based on Indiana coal data may suggest that a similar approach can be used for other coal basins for screening the locations with REY potential. Those locations then can be targeted for more detailed geochemical surveys to identify most promising areas and evaluate overall REY resources.

Citation Information

Publication Year 2022
Title Machine learning and data augmentation approach for identification of rare earth element potential in Indiana Coals, USA
DOI 10.1016/j.coal.2022.104054
Authors Snahamoy Chatterjee, Maria Mastalerz, Agnieszka Drobniak, C. Özgen Karacan
Publication Type Article
Publication Subtype Journal Article
Series Title International Journal of Coal Geology
Series Number
Index ID 70232385
Record Source USGS Publications Warehouse
USGS Organization Geology, Energy & Minerals Science Center