Images of two standard crude oils collected using a fluorescent camera device to train and optimize a machine learning model for real-time oil spill concentration assessment collected from November 7, 2023, to July 8, 2024
The data are a set of fluorescent images that were generated to support the development of a machine learning model. The approach combines fluorescence imaging, deep learning, a mobile application, and a data management system for automated and real-time oil spill assessment. The dataset is comprised of 1,530 fluorescence images from two distinct oil types, a napthalenic crude oil (NACO) and an aromatic-napthalenic crude oil (ANCO). The oil is diluted in hexane and the images represent concentrations ranging from 0 to 500 mg/L. The data are presented as JPEG files in two zip folders (one for each oil type) as well as a CSV file that describes the type and concentration of the oil photographed in each image. These images were used to train and evaluate a machine learning tool comprised of convolutional neural network architecture for feature extraction coupled with a custom regression model. Model description and code can be found at https://github.com/biplabpoudel25/Oil-spill-estimation.
Citation Information
| Publication Year | 2025 |
|---|---|
| Title | Images of two standard crude oils collected using a fluorescent camera device to train and optimize a machine learning model for real-time oil spill concentration assessment collected from November 7, 2023, to July 8, 2024 |
| DOI | 10.5066/P1SXVZX2 |
| Authors | Biplab Poudel, Jiacheng Xie, Congyu Guo, Dong Xu, Rishi J Patel, Olivia E Watt, Erin L Pulster, Jeffery A Steevens |
| Product Type | Data Release |
| Record Source | USGS Asset Identifier Service (AIS) |
| USGS Organization | Columbia Environmental Research Center |
| Rights | This work is marked with CC0 1.0 Universal |