Skip to main content
U.S. flag

An official website of the United States government

Structural Topic Models of water-quality related news articles and scientific abstracts in the Illinois River Basin, USA

May 19, 2026

This data release provides the input and output files, metadata, and R scripts used to apply a Structural Topic Model (STM) to public (local news articles) and scientific (abstracts) discourse on water quality in the Illinois River Basin (ILRB). A total of 6,822 local news articles published from 2018 through 2022, and 190 scientific abstracts published from 2018 through 2023 were compiled for use in a Structural Topic Model (STM), a statistical text-mining approach that identified latent themes within a collection of documents while incorporating document-level metadata to explain topic prevalence. The model inputs included the text of each news article and scientific abstract along with associated metadata (e.g., author, title, date, publisher, location of the news publisher; citation information for scientific abstracts). Due to proprietary restrictions, the full text of the local news articles (NewsArticles.xlsx) is not included in this data release. Instead, we provide the scientific abstracts input file (SCOPUS_scientificAbstracts.csv), the R scripts used to generate the STM analysis for both the news articles (STM_newsArticles.R) and scientific abstracts (STM_scientificAbstracts.R), and the corresponding model output showing the main topics (STM_newsArticles_output.png and STM_scientificAbstracts_output.png). This data release supports the analyses presented in the corresponding journal article (https://www.doi.org/TBD). STM_newsArticles_output.png and STM_scientificAbstracts_output.png correspond to Figures 3 and 9 in this article.

Publication Year 2026
Title Structural Topic Models of water-quality related news articles and scientific abstracts in the Illinois River Basin, USA
DOI 10.5066/P1JZFCVA
Authors Catherine A Christenson, Jennifer C Murphy Blair, Jaqueline Ortiz
Product Type Data Release
Record Source USGS Asset Identifier Service (AIS)
USGS Organization Upper Midwest Water Science Center - Madison, WI, Office
Rights This work is marked with CC0 1.0 Universal
Was this page helpful?