Skip to main content
U.S. flag

An official website of the United States government

Improving predictions of hydrological low-flow indices in ungaged basins using machine learning

March 1, 2018

We compare the ability of eight machine-learning models (elastic net, gradient boosting, kernel-k-nearest neighbors, two variants of support vector machines, M5-cubist, random forest, and a meta-learning ensemble M5-cubist model) and four baseline models (ordinary kriging, a unit area discharge model, and two variants of censored regression) to generate estimates of the annual minimum 7-day mean streamflow with an annual exceedance probability of 90% (7Q10) at 224 unregulated sites in South Carolina, Georgia, and Alabama, USA. The machine-learning models produced substantially lower cross validation errors compared to the baseline models. The meta-learning M5-cubist model had the lowest root-mean-squared-error of 26.72 cubic feet per second. Partial dependence plots show that 7Q10s are likely moderated by late summer and early fall precipitation and the infiltration capacity of basin soils.

Publication Year 2018
Title Improving predictions of hydrological low-flow indices in ungaged basins using machine learning
DOI 10.1016/j.envsoft.2017.12.021
Authors Scott C. Worland, William H. Farmer, Julie E. Kiang
Publication Type Article
Publication Subtype Journal Article
Series Title Environmental Modelling and Software
Index ID 70198127
Record Source USGS Publications Warehouse
USGS Organization Lower Mississippi-Gulf Water Science Center