Skip to main content
U.S. flag

An official website of the United States government

Machine Learning Model: Estimates of Metal Abundance in Global Seafloor Massive Sulfide Deposits

January 14, 2026

A multi-stage ensembled machine learning model was developed to estimate metal abundances in seafloor massive sulfide deposits worldwide. The modeling framework integrates (1) KMeans++ clustering to identify geochemical groupings based on enrichment controls, (2) Random Forest classification to assign geochemical labels to vent fields with incomplete or absent geochemical data, and (3) XGBoost regression to generate high-fidelity predictions of metal concentrations. This USGS model application data release includes all scripts, input files, and output files necessary to apply the model to estimate concentrations of cobalt, gold, and zinc. This model is not limited by spatial boundaries and is intended for application to any oceanic location with appropriate input data.

Publication Year 2026
Title Machine Learning Model: Estimates of Metal Abundance in Global Seafloor Massive Sulfide Deposits
DOI 10.5066/P13PYBJL
Authors Maria C Figueroa Matias
Product Type Data Release
Record Source USGS Asset Identifier Service (AIS)
USGS Organization Pacific Coastal and Marine Science Center
Rights This work is licensed under CC BY 4.0
Was this page helpful?