Skip to main content
U.S. flag

An official website of the United States government

2023 CDI Workshop: Open Data for Open Science

The 2023 CDI Workshop was held from May 2-5th, 2023 at the National Conservation Training Center in Shepherdstown, West Virginia. The theme of the workshop was "Open Data for Open Science"

The 2023 Workshop had 394 registrants, 33 breakout sessions, 3 plenaries, 39 posters or demos, and 11 lightning talks. The workshop welcomed both virtual and in-person attendees, with the physical location being the National Conservation Training Center in Shepherdstown, WV.

The online agenda for the workshop is available on Sched.


Breakout Sessions 

Title Session Lead(s)
Open Data for the Year of Open Science Viv Hutchison
ArcGIS Online: Getting Started with Open Data and Initiatives in the ArcGIS Hub Kristen Hocutt
Introduction to Posit Team (formerly RStudio Team) Kaycee Faunce, Jayaram Hariharan
Develop and Advance a USGS POSSE (Participatory Open Science to Support Engagement) Sophia B Liu, Nicole Herman-Mercer, Aparna Bamzai, Amanda Cravens
Highlights from the New USGS Advanced Scientific Computing Strategy & Roadmap Janice Gordon
Effective Communication for Open Data Amanda (Mandie) Carr
Bringing legacy data into the modern era – with a focus on compliance and best practices Joe Bard, Marc Hunter
Where do my data go? To statistics and beyond! Jacob Malcom, Nicole Athearn
Fitting square pegs into round holes: challenges in interdisciplinary science information management design Jess Driscoll
Earth Science in the AI/ML Age: Leveraging Open Data and Science for Discovery Neal Pastick, Pete Doucette, Alison Appling
Shared Software Development Resources Carl Schroedl
Moving the Bar on USGS Data Management Planning Madison Langseth
Using the Cloud to Navigate the Science Data Lifecycle Dionne Zoanni, Eric Larson
Creating national products out of regional studies - processing on a larger scale Jeanne Jones
Using the Microsoft Power Platform to enhance information sharing across USGS and potentially the public Rebecca Bushon, Kenna Butler
USGS Metadata Transition: mdToolkit and Workflow Innovations Dennis Walworth
Imagery as data – A showcase of how imagery can be used in increasing earth process understanding through both data delivery and analysis Frank Engel, Cian Dawson
How do we move from data portals towards open science and decision support infrastructure? Megan Hines, Dionne Zoanni
Continuous Science Deployment in the Cloud – Using emerging cloud technologies to create living web applications Kevin Henry
Making the Leap: Adopting High Performance Computing for Your Science Kyle Moran
How to use vocabularies to make your data interoperable Fran Lightsom
Overview and updates to the Research Computing data storage infrastructure. Jeff Falgout, Drew Ignizio
USGS Basic ISO Metadata Content Standards Dennis Walworth
Reproducible Data Pipelines in R: what are they, how to use them, and a hands on example using dataRetrieval and targets Julie Padilla
High throughput data transformation on the cloud Jeremy Newson
Open Science Napkin Drawings Madison Langseth, Leslie Hsu

Posters and Demos

Title Presenter(s)
Metadata development for the USGS National Water Census (and beyond!) Megan Hines
PyHAT Demo Itiya Aneece
Informing the use of native plant materials in restoration and rehabilitation with the Native Plant Seed Mapping Toolkit Catilin Andrews, Rob Massatti
An open-source process for maintaining a catalog Leslie Hsu, Brandon Serna
PyHAT: An Open-Source Machine Learning tool for Earth and Planetary Spectral Datasets Itiya Aneece
Increasing data accessibility and cross-community use through the GHISA visualization app Itiya Aneece
The United States Geological Survey Science Data Catalog Chirag Shah, Lisa Zolly
State of the Data: how FAIR are USGS data? Tamar Norkin
ZenRiver Game Concept: Accelerating Creation of Machine Learning Imagery Training Datasets Using Citizen Science Frank Engle, Joe Federer
Generalization quality assessment tools for geospatial data Ethan Shavers
"ChesBay 24k": A Framework for Summarizing Landscape Data in the Chesapeake Bay Watershed and Beyond Benjamin Gressler
Evaluation and recommendation of practices for publication of reproducible data and software releases in the USGS Julie Padilla
Standardizing and Sharing Data for Chesapeake Studies  Stephanie Gordon
Datablasted: Can we refine publication pathways of imagery? Jin-Si Over
Integrating continuous and categorical data to improve streamflow prediction Konrad Hafen
Automating Sensor-to-Cloud Workflows for Riparian Resources in Grand Canyon Joseph Thomas
Connecting with our Stakeholders - Developing a Better  Understanding of Use and Usability for Science Products Dawn Kotowicz, Kevin Henry
USGS Alaska Science Center Data Mangement Team: Promoting Science through Data Laura McDuffie
A Tool for Rapid-Repeat High-Resolution Coastal Vegetation Maps to Improve Forecasting of Hurricane Impacts and Coastal Resilience   Kristin Byrd
Extracting data from maps: Applying lessons learned from the AI for Critical Mineral Assessment Competition Margaret Goldman
NPSdataverse: Metadata authoring and data publication & access at NPS Rob Baker
Managing Data Packages in the NPS DataStore Repository Kristin Vanderbilt
Linking Orphaned Oil & Gas Wells with Groundwater Quality Nick Gianoutsos
Communicating stream fish vulnerability to climate change Taylor Woods
USGS Hydrography Markup Tools -- Origins and Future Directions Marcelle Caturia
Hosting Published USGS Datasets - A Glance at ScienceBase Drew Ignizio
USGS Enterprise Information Management and Technology Resources for Scientists Jennifer Erxleben, Jessica Driscoll
Are U.S. Geological Survey researchers citing their data? Madison Langseth, Grace Donovan
Evolution of the North Pacific Pelagic Seabird Database Marla Hood
Geonarratives & Small Application Hosting at USGS  Milan Liu
USGS Data Strategy: An Opportunity to Engage Viv Hutchison
Everglades Tree Island Indicator: Lessons Learned for Increasing Inclusivity Saira Haider
Leveraging Existing USGS Streamgage Data to Map Flood-Prone Areas Labeeb Ahmed
USGS Science Data Lifecycle in the Cloud Dionne Zoannie, Eric Larson, Courtney Neu
Welcome to our community of Earth science data & technology enthusiasts!  Annnie Burgess, Megan Carter
Towards open geophysics: The Geophysical Survey (GS) Data Standard and GSPy Toolbox Burke Minsley
Community Engagement through Virtual Workshops: Lessons Learned from the Imagery Data Collaboration Area Cian Dawson
Augmented Reality High Water Mark Daniel Beckman
Cloud-hosted Web Applications Peter Ng


Lightning Talks 

  • Annie Burgess: Welcome to our community of Earth science data & technology enthusiasts!

  • Birgit Peterson: Seeing the Forest in the Clouds: Developing application-ready 3D products from lidar and other remotely sensed data

  • Burke Minsley: Towards open geophysics: The Geophysical Survey (GS) data standard and GSPy toolbox

  • Cassandra Ladino: The Science Behind Business at USGS

  • Judd Patterson: NPS Data Packages

  • Sarah Wright: R packages for long-term monitoring data

  • Margaret Goldman: Using AI competitions to advance automation at the USGS

  • Nick Sievert: CreelCat: A database and application for inland creel and angler survey data=

  • Sophia B Liu: Participatory Open Science at USGS: Foundational Overview Towards a Strategic Vision

  • Viv Hutchison: The USGS Data Strategy: You are invited to engage in its implementation!

  • Lindsay Powers: The Geological Materials Repository in oratory

Go back to CDI 2023 Activities.